fix(hooks): scrub secrets and harden hook security by jtzingsheim1 · Pull Request #348 · affaan-m/everything-claude-code

jtzingsheim1 · 2026-03-07T05:43:47Z

Summary

Addresses #347 — defense-in-depth security hardening across 4 hook/library files.

Scrub secrets from observation logs — regex-replace common secret patterns (api_key, token, password, etc.) with [REDACTED] before writing to JSONL, plus auto-purge observation files older than 30 days (observe.sh)
Strip auth tokens from git remote URLs — remove embedded credentials from URLs before persisting to projects.json (detect-project.sh)
Add command prefix allowlist to runCommand — only git, node, npx, which, where are permitted (utils.js)
Sanitize session ID in temp file paths — strip non-alphanumeric characters from CLAUDE_SESSION_ID to prevent path traversal (suggest-compact.js)

All patches are conservative — they constrain without changing existing functionality.

Test plan

node tests/lib/utils.test.js — 140/141 pass (1 pre-existing Windows symlink failure, unrelated)
node tests/hooks/suggest-compact.test.js — 20/20 pass
Manual: work with a mock API key in conversation, verify it's scrubbed in observation logs
Manual: clone a repo with token-in-URL remote, verify token stripped in projects.json
Manual: verify suggest-compact still works with sanitized session IDs

🤖 Generated with Claude Code

Summary by cubic

Hardened hook security and improved secret scrubbing to prevent credential leaks; now blocks $() and backticks even inside quotes, and prevents newline injection. Addresses Linear #347 with defense-in-depth updates across hooks and utils.

Bug Fixes
- Strengthened secret scrubbing in observation logs: preserves Authorization scheme; scrubs parse_error fallback; fixes regex quoting; project‑scoped 30‑day auto‑purge of archived files (once per session).
- Removed embedded credentials from git remotes before saving to projects.json; migrate legacy project directories if the hash changes.
- Tightened runCommand: allowlist prefixes (git, node, npx, which, where); block $ and backticks anywhere; block ;, |, &, and newline when unquoted; suppress blocked command echo.
- Sanitized CLAUDE_SESSION_ID in temp paths with a safe charset and a fallback to "default" if empty.

^{Written for commit 1cab6a7. Summary will update on new commits.}

Summary by CodeRabbit

Security
- Blocked unsafe command execution via an allowlist and metacharacter checks; added secret scrubbing for persisted observations.
Bug Fixes
- Sanitized session identifiers for reliable file handling.
- Backward-compatible handling of legacy project identifiers to avoid duplicate or missing project data.
Maintenance
- Automatic purge of observation files older than 30 days; improved observation persistence flow.
Tests
- Added tests covering command-safety and secret-masking behavior.

- Scrub common secret patterns (api_key, token, password, etc.) from observation logs before persisting to JSONL (observe.sh) - Auto-purge observation files older than 30 days (observe.sh) - Strip embedded credentials from git remote URLs before saving to projects.json (detect-project.sh) - Add command prefix allowlist to runCommand — only git, node, npx, which, where are permitted (utils.js) - Sanitize CLAUDE_SESSION_ID in temp file paths to prevent path traversal (suggest-compact.js) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

coderabbitai · 2026-03-07T05:44:02Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 1b62efe8-abad-4d49-89e6-066bb89c337a

📥 Commits

Reviewing files that changed from the base of the PR and between 597d431 and 1cab6a7.

📒 Files selected for processing (2)

scripts/lib/utils.js
tests/lib/utils.test.js

🚧 Files skipped from review as they are similar to previous changes (2)

tests/lib/utils.test.js
scripts/lib/utils.js

📝 Walkthrough

Walkthrough

Sanitizes session IDs, restricts runCommand to an allowlist and block unquoted shell metacharacters, adds secret-scrubbing and a 30‑day auto‑purge for observations, and strips credentials from git remote URLs with migration of legacy project hash directories.

Changes

Cohort / File(s)	Summary
Hooks — session & compacting `scripts/hooks/suggest-compact.js`	Sanitizes `sessionId` by removing non-alphanumeric/underscore/dash characters before using it in counter file paths.
CLI utility `scripts/lib/utils.js`	`runCommand` enforces an allowlist of prefixes (`git` , `node` , `npx` , `which` , `where` ) and rejects commands containing unquoted shell metacharacters (e.g., `;
Continuous learning — observe `skills/continuous-learning-v2/hooks/observe.sh`	Adds regex-based secret scrubbing for input/output, constructs scrubbed JSON observation payloads, persists observations, signals observers, and adds a session-scoped auto-purge removing observation files older than 30 days.
Continuous learning — project detection `skills/continuous-learning-v2/scripts/detect-project.sh`	Strips embedded credentials from `remote_url` before hashing to compute `project_id`, computes a legacy hash, and migrates/renames legacy project directories to the new sanitized-hash directory when appropriate.
Tests `tests/lib/utils.test.js`	Adds comprehensive tests for `runCommand` allowlist and metacharacter-blocking, covering quoted/unquoted edge cases, prefix matching, and ensuring blocked commands do not leak sensitive strings.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related issues

Issue #347: Edits the same files and implements the same objectives — sessionId sanitization, runCommand allowlist and metacharacter checks, observation secret-scrubbing and purge, and git credential stripping/migration.

Possibly related PRs

PR #31: Modifies scripts/hooks/suggest-compact.js and scripts/lib/utils.js; closely related to session sanitization and runCommand guards.
PR #41: Touches the suggest-compact hook; related to session/compacting hook behavior.
PR #207: Overlaps with skills/continuous-learning-v2/hooks/observe.sh changes around observation handling and secret-scrubbing.

Suggested reviewers

affaan-m

Poem

"I nibble logs and tidy tracks, 🐇
I tuck away secrets in soft packs,
I sanitize, scrub, and hop along,
Rename the paths and guard the song,
A careful rabbit keeps code strong."

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title clearly and concisely summarizes the main security-focused changes: secret redaction (scrub secrets) and command validation hardening (harden hook security) across multiple hook and library files.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

cubic-dev-ai

2 issues found across 4 files

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="scripts/hooks/suggest-compact.js">

<violation number="1" location="scripts/hooks/suggest-compact.js:28">
P2: Sanitization can produce an empty `sessionId`, losing session isolation. If `CLAUDE_SESSION_ID` contains only non-alphanumeric characters (e.g. `"//../.."` → `""`), the `|| 'default'` fallback has already been bypassed. Add a second fallback after `.replace()` to guarantee a non-empty value.</violation>
</file>

<file name="scripts/lib/utils.js">

<violation number="1" location="scripts/lib/utils.js:343">
P1: The prefix allowlist is trivially bypassable because the command is still passed to `execSync`, which invokes a shell. Any input like `'git log; evil_command'` passes the `'git '` prefix check yet executes arbitrary shell commands. Consider either validating that the command contains no shell metacharacters, or splitting the command and using `spawnSync` (no shell) instead of `execSync`.</violation>
</file>

Since this is your first cubic review, here's how it works:

cubic automatically reviews your code and comments on bugs and improvements
Teach cubic by replying to its comments. cubic learns from your replies and gets better over time
Add one-off context when rerunning by tagging @cubic-dev-ai with guidance or docs links (including llms.txt)
Ask questions if you need clarification on any suggestion

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

coderabbitai

Actionable comments posted: 3

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

scripts/lib/utils.js (1)
342-356: ⚠️ Potential issue | 🟠 Major

execSync with prefix-only validation allows shell metacharacter injection.

The startsWith() allowlist check on line 344 does not prevent shell metacharacter injection. A command like node -e "..." ; printf "malicious" passes the allowlist but the shell still executes the chained suffix. Additionally, line 345 echoes the full blocked command in the error message, leaking any secrets in the arguments.

Replace this with spawnSync(command, args, { shell: false }) to avoid shell interpretation, or at minimum reject shell metacharacters (; | & < > $ ( ) { } etc.) and return a generic error message instead of the user-supplied command.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@scripts/lib/utils.js` around lines 342 - 356, The allowlist using
allowedPrefixes plus execSync allows shell metacharacter injection and leaks
secrets by returning the full cmd; update the implementation that currently uses
execSync and allowedPrefixes to instead parse the input into a command and args
and call spawnSync(command, args, { shell: false, encoding: 'utf8', stdio:
['pipe','pipe','pipe'], ...options }) (or otherwise validate and split safely),
or at minimum reject any shell metacharacters (e.g. ; | & < > $ ( ) { } `)
before executing; also stop returning the raw cmd in error responses—return a
generic error message and include only non-sensitive error details (err.message)
to avoid leaking secrets (referencing allowedPrefixes, execSync, spawnSync, and
the error return that echoes cmd).
skills/continuous-learning-v2/hooks/observe.sh (1)
165-183: ⚠️ Potential issue | 🔴 Critical

The embedded Python scrubber is syntactically invalid.

This python3 -c payload is wrapped in a double-quoted shell string, so the '\'' fragments on lines 180–181 are passed straight through to Python instead of acting as shell escapes. Python then sees r'([\"'\''\\s:=]+)' and raises SyntaxError: unexpected character after line continuation character, which aborts the hook before any observation is written.
Fix
-    r'([\"'\''\\s:=]+)'
-    r'[\"'\''\\s]*'
+    r"([\"'\s:=]+)"
+    r"[\"'\s]*"
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@skills/continuous-learning-v2/hooks/observe.sh` around lines 165 - 183, The
embedded Python scrubber in the hook is failing due to incorrect quoting in the
regex literal _SECRET_RE (the r'([\"'\''\\s:=]+)' fragment) being broken by the
surrounding double-quoted shell string; update the hook so the Python payload is
not corrupted by shell escapes (for example, switch the inline Python to a
here-doc or use a triple-quoted Python string literal for the payload) and fix
the regex literal in _SECRET_RE to valid Python quoting (e.g., use a single
Python string that contains escaped single-quote and backslash sequences or a
triple-quoted raw string) so the pattern for ([\"'\s:=]+) compiles correctly;
ensure the change is made where _SECRET_RE is defined in the embedded Python and
that the rest of the scrubber code (the observation construction) remains
unchanged.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@skills/continuous-learning-v2/hooks/observe.sh`:
- Around line 75-80: The purge currently searches
"${_CLV2_PROJECTS_DIR:-$CONFIG_DIR/projects}" for files named
"observations*.jsonl", which can match the live
${PROJECT_DIR}/observations.jsonl and cross-delete other projects' live queues;
update the find call to target only archived files in the current project by
changing the search root to "${PROJECT_DIR}" and the name filter to
"observations-*.jsonl" (so it won't match the live "observations.jsonl"); keep
the existing PURGE_MARKER logic and error redirection intact.
- Around line 177-189: The _SECRET_RE regex used by scrub fails to redact
Authorization headers with schemes like "Bearer" because the final character
class ([A-Za-z0-9_\-/.+=]{8,}) doesn't allow a scheme and space before the
token; update _SECRET_RE so it optionally accepts an auth scheme (e.g., allow an
optional word and whitespace like (?:[A-Za-z]+[ \t]+)? or accept any
non-newline, non-quote characters between the separator and the token) before
matching the token, then keep scrub unchanged to substitute the captured secret
with [REDACTED]; make sure to still require a minimum token length (e.g., {8,})
and preserve the captured groups \1 and \2 in the replacement.

In `@skills/continuous-learning-v2/scripts/detect-project.sh`:
- Around line 67-74: The script now computes project_id from the sanitized
remote (hash_input) which breaks continuity with the legacy hash and can orphan
existing data; update detect-project.sh to detect an existing project dir under
the legacy hash and either reuse it or migrate its contents into the new
${_CLV2_PROJECTS_DIR}/${project_id}. Specifically, after computing the new
project_id, compute the legacy hash (the previous behavior using the original
remote/project_root input) and check both ${_CLV2_PROJECTS_DIR}/${legacy_hash}
and ${_CLV2_PROJECTS_DIR}/${project_id}; if the legacy directory exists and the
new one does not, set project_id to the legacy hash or move the legacy directory
into the new project_id location (preserving files and permissions), otherwise
leave project_id as computed. Ensure this logic references the variables
hash_input, project_id, and the legacy hash variable you introduce so future
changes are obvious.

---

Outside diff comments:
In `@scripts/lib/utils.js`:
- Around line 342-356: The allowlist using allowedPrefixes plus execSync allows
shell metacharacter injection and leaks secrets by returning the full cmd;
update the implementation that currently uses execSync and allowedPrefixes to
instead parse the input into a command and args and call spawnSync(command,
args, { shell: false, encoding: 'utf8', stdio: ['pipe','pipe','pipe'],
...options }) (or otherwise validate and split safely), or at minimum reject any
shell metacharacters (e.g. ; | & < > $ ( ) { } `) before executing; also stop
returning the raw cmd in error responses—return a generic error message and
include only non-sensitive error details (err.message) to avoid leaking secrets
(referencing allowedPrefixes, execSync, spawnSync, and the error return that
echoes cmd).

In `@skills/continuous-learning-v2/hooks/observe.sh`:
- Around line 165-183: The embedded Python scrubber in the hook is failing due
to incorrect quoting in the regex literal _SECRET_RE (the r'([\"'\''\\s:=]+)'
fragment) being broken by the surrounding double-quoted shell string; update the
hook so the Python payload is not corrupted by shell escapes (for example,
switch the inline Python to a here-doc or use a triple-quoted Python string
literal for the payload) and fix the regex literal in _SECRET_RE to valid Python
quoting (e.g., use a single Python string that contains escaped single-quote and
backslash sequences or a triple-quoted raw string) so the pattern for
([\"'\s:=]+) compiles correctly; ensure the change is made where _SECRET_RE is
defined in the embedded Python and that the rest of the scrubber code (the
observation construction) remains unchanged.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 2e19644e-aaa5-4264-ba8f-f0e5cff74c85

📥 Commits

Reviewing files that changed from the base of the PR and between 03b3e0d and 4734a14.

📒 Files selected for processing (4)

scripts/hooks/suggest-compact.js
scripts/lib/utils.js
skills/continuous-learning-v2/hooks/observe.sh
skills/continuous-learning-v2/scripts/detect-project.sh

- Reject shell command-chaining operators (;|&`) in runCommand, strip quoted sections before checking to avoid false positives (utils.js) - Remove command string from blocked error message to avoid leaking secrets (utils.js) - Fix Python regex quoting: switch outer shell string from double to single quotes so regex compiles correctly (observe.sh) - Add optional auth scheme match (Bearer, Basic) to secret scrubber regex (observe.sh) - Scope auto-purge to current project dir and match only archived files (observations-*.jsonl), not live queue (observe.sh) - Add second fallback after session ID sanitization to prevent empty string (suggest-compact.js) - Preserve backward compatibility when credential stripping changes project hash — detect and migrate legacy directories (detect-project.sh) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

jtzingsheim1 · 2026-03-07T06:14:04Z

Thanks for the thorough review! All 6 findings were valid — addressed in 953babd:

runCommand metacharacter injection — now rejects `;|& outside quoted strings (strips quoted sections before checking)
Python regex SyntaxError — switched from double-quoted to single-quoted shell string with proper escaping
Purge matching live observations.jsonl — scoped to ${PROJECT_DIR} and changed glob to observations-*.jsonl (archives only)
Empty session ID after sanitization — added || 'default' fallback after .replace()
Bearer token not matched — added optional auth scheme (?:[A-Za-z]+\s+)? before token match
Credential stripping orphans existing data — computes legacy hash first, migrates existing directory if found

cubic-dev-ai

2 issues found across 4 files (changes from recent commits).

Prompt for AI agents (unresolved issues)


Check if these issues are valid — if so, understand the root cause of each and fix them. If appropriate, use sub-agents to investigate and fix each issue separately.


<file name="skills/continuous-learning-v2/hooks/observe.sh">

<violation number="1" location="skills/continuous-learning-v2/hooks/observe.sh:182">
P2: The non-capturing auth-scheme group `(?:[A-Za-z]+\s+)?` is consumed by the match but not included in the replacement `\1\2[REDACTED]`. This silently drops the scheme word from the output (e.g. `Authorization: Bearer token` → `Authorization: [REDACTED]` instead of `Authorization: Bearer [REDACTED]`). Make the group capturing and include it in the replacement, or move it inside the secret-value capture so the redaction doesn't eat non-secret text.</violation>
</file>

<file name="scripts/lib/utils.js">

<violation number="1" location="scripts/lib/utils.js:350">
P1: The metacharacter regex misses `$(...)` command substitution and newline-based command separation. A payload like `git log $(malicious)` passes both the prefix allowlist and this check, yet `execSync` will still execute the substitution via the shell. Consider adding `$` and `\n` to the blocked characters.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

scripts/lib/utils.js (1)
348-352: Quote-stripping has edge cases with escaped quotes, but acceptable for current scope.

The regex /"[^"]*"/g doesn't handle escaped quotes (e.g., "foo\"bar"), which could leave partial strings. However, given:

Commands come from trusted internal code

The allowlist limits the attack surface

This is defense-in-depth, not the primary security boundary

This is acceptable. If stricter parsing is ever needed, consider a state-machine tokenizer.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@scripts/lib/utils.js` around lines 348 - 352, The quote-stripping regexes
(/"[^"]*"/g and /'[^']*'/g) used when building the unquoted string in runCommand
can miss escaped quotes (e.g., "foo\"bar"), so add a short inline comment above
the unquoted = cmd.replace(...) line noting this limitation and that it’s an
accepted trade-off for now (trusted internal input + allowlist), and include a
TODO suggesting a state-machine/tokenizer if stricter parsing is required in the
future; keep the current logic unchanged.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@scripts/lib/utils.js`:
- Around line 342-346: Add unit tests for runCommand to verify the allowlist and
shell metacharacter protections: create tests that call runCommand with
disallowed prefixes like "rm ", "curl ", "bash " and assert the return equals {
success: false, output: 'runCommand blocked: unrecognized command prefix' }, and
create tests that call runCommand with commands containing metacharacters such
as "git status; echo pwned" and 'node -e "$(whoami)"' and assert the return
equals { success: false, output: 'runCommand blocked: shell metacharacters not
allowed' }; reference the allowedPrefixes constant and the runCommand function
in your test descriptions so failures point back to the allowlist/shell
metacharacter logic.

---

Nitpick comments:
In `@scripts/lib/utils.js`:
- Around line 348-352: The quote-stripping regexes (/"[^"]*"/g and /'[^']*'/g)
used when building the unquoted string in runCommand can miss escaped quotes
(e.g., "foo\"bar"), so add a short inline comment above the unquoted =
cmd.replace(...) line noting this limitation and that it’s an accepted trade-off
for now (trusted internal input + allowlist), and include a TODO suggesting a
state-machine/tokenizer if stricter parsing is required in the future; keep the
current logic unchanged.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 3f656702-8642-4300-b760-d6d6c1160240

📥 Commits

Reviewing files that changed from the base of the PR and between 4734a14 and 953babd.

📒 Files selected for processing (4)

scripts/hooks/suggest-compact.js
scripts/lib/utils.js
skills/continuous-learning-v2/hooks/observe.sh
skills/continuous-learning-v2/scripts/detect-project.sh

🚧 Files skipped from review as they are similar to previous changes (1)

scripts/hooks/suggest-compact.js

…y tests - Add $ and \n to blocked shell metacharacters in runCommand to prevent command substitution via $(cmd) and newline injection (utils.js) - Make auth scheme group capturing so Bearer/Basic is preserved in redacted output instead of being silently dropped (observe.sh) - Add 10 unit tests covering runCommand allowlist blocking (rm, curl, bash prefixes) and metacharacter rejection (;|&`$ chaining), plus error message leak prevention (utils.test.js) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

jtzingsheim1 · 2026-03-07T06:33:16Z

Round 2 feedback addressed in a187e72:

$() and newline bypass — added $ and \n to blocked metacharacters (stripped quoted sections still apply)
Bearer scheme dropped in redaction — made auth scheme group capturing, used lambda replacement to preserve "Bearer"/"Basic" in output
Security tests — added 10 unit tests covering prefix allowlist blocking (rm, curl, bash), metacharacter rejection (;|&$` chaining), quoted metacharacter passthrough, and error message leak prevention. 150/151 pass (1 pre-existing Windows symlink EPERM).

coderabbitai

Actionable comments posted: 3

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@skills/continuous-learning-v2/hooks/observe.sh`:
- Around line 157-198: The parse-error branch is writing the raw hook body
unsanitized; run the same Python scrubber used for parsed payloads on the raw
payload before appending to OBSERVATIONS_FILE (or stop storing the raw body).
Specifically, ensure the branch that currently writes raw stdin uses the same
scrub(val) function and _SECRET_RE regex from the python block (the code that
builds observation with "timestamp", "event", "tool", "session",
"project_id"/"project_name" and scrubs "input"/"output") to sanitize the raw
payload (e.g., apply scrub(raw_payload) and store it under "input" or "raw" in
the observation) before printing/json-dumping to OBSERVATIONS_FILE so no
tokens/passwords are persisted unredacted.

In `@tests/lib/utils.test.js`:
- Around line 1007-1035: Add a regression test to tests/lib/utils.test.js that
verifies utils.runCommand also blocks newline injection (\n); specifically,
mirror the existing tests that call utils.runCommand and assert result.success
is false and result.output includes 'metacharacters not allowed', but use an
input string containing a newline (e.g., "git status\necho pwned") to ensure
scripts/lib/utils.js runCommand rejects multiline commands as well.
- Around line 1037-1040: Update the "runCommand allows metacharacters inside
double quotes" test so it actually exercises quote-stripping by including shell
metacharacters inside the double quotes passed to utils.runCommand (instead of
just 'node -e "process.exit(0)"'). For example, call utils.runCommand with a
string containing a metacharacter like ';' inside the quoted segment (e.g.,
'node -e "console.log(1);process.exit(0)"' or use 'sh -c "echo a; echo b"') and
assert result.success is true; change the test block that currently calls
utils.runCommand('node -e "process.exit(0)"') to one of these variants to ensure
the quote-handling branch in utils.runCommand is exercised.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 17eb2702-cd7e-47e6-8dc0-75083edf8590

📥 Commits

Reviewing files that changed from the base of the PR and between 953babd and a187e72.

📒 Files selected for processing (3)

scripts/lib/utils.js
skills/continuous-learning-v2/hooks/observe.sh
tests/lib/utils.test.js

🚧 Files skipped from review as they are similar to previous changes (1)

scripts/lib/utils.js

Address remaining reviewer feedback from CodeRabbit and Cubic: - Scrub secrets in observe.sh parse-error fallback path (was writing raw unsanitized input to observations file) - Remove redundant re.IGNORECASE flag ((?i) inline flag already set) - Add inline comment documenting quote-stripping limitation trade-off - Fix misleading test name for error-output test - Add 5 new security tests: single-quote passthrough, mixed quoted+unquoted metacharacters, prefix boundary (no trailing space), npx acceptance, and newline injection - Improve existing quoted-metacharacter test to actually exercise quote-stripping logic Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

jtzingsheim1 · 2026-03-07T12:15:34Z

Round 3 feedback addressed in 597d431:

Parse-error fallback now scrubs secrets — the fallback path in observe.sh (lines 134-153) was writing raw unsanitized input to the observations file. Now applies the same _SECRET_RE scrubber as the main path before persisting. (Identified by CodeRabbit)
Removed redundant re.IGNORECASE — the (?i) inline flag already applies case-insensitive matching to the entire pattern; the re.IGNORECASE argument was redundant.
Added inline comment documenting quote-stripping limitation — the "[^"]*" regex doesn't handle escaped quotes; documented as accepted trade-off given trusted internal callers + allowlist. (Requested by CodeRabbit)
Fixed misleading test name — "falls back to err.message when no stderr" renamed to "returns error output on failed command" since git does produce stderr for unknown subcommands.
Added 5 new security tests — single-quote metacharacter passthrough, mixed quoted+unquoted metacharacters, prefix boundary without trailing space (gitconfig blocked), npx prefix acceptance, and newline command injection.
Improved existing test — quoted-metacharacter test now uses "console.log(1);process.exit(0)" to actually include a semicolon inside quotes, exercising the quote-stripping logic.

All 155 tests pass (1 pre-existing Windows symlink EPERM, unrelated).

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (1)

scripts/lib/utils.js (1)

348-357: ⚠️ Potential issue | 🔴 Critical

Quoted command substitution is still a shell-injection bypass.

Line 351 removes quoted substrings before scanning, but Line 357 still passes the original string to execSync(). node -e "$(whoami)" or git show "`cat secret`" will pass this check and still execute through the shell. At minimum, reject ` and $ anywhere in cmd; the safer fix is to stop shelling out here and use spawnSync/execFileSync with argv.

🔒 Minimal hardening patch

-  const unquoted = cmd.replace(/"[^"]*"/g, '').replace(/'[^']*'/g, '');
-  if (/[;|&`$\n]/.test(unquoted)) {
+  const unquoted = cmd.replace(/"[^"]*"/g, '').replace(/'[^']*'/g, '');
+  if (/[;|&\n]/.test(unquoted) || /[`$]/.test(cmd)) {
     return { success: false, output: 'runCommand blocked: shell metacharacters not allowed' };
   }

In Node.js `child_process.execSync`, are shell command substitutions like `$()` and backticks still evaluated when they appear inside double-quoted arguments passed as part of the command string?

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@scripts/lib/utils.js` around lines 348 - 357, The current check strips quoted
substrings into unquoted and then still passes the original cmd to execSync,
allowing command substitution via `$()` or backticks; update the runCommand
logic to stop using execSync with a shell string and instead invoke
child_process.spawnSync or execFileSync with an explicit argv array (pass the
binary as the command and arguments separately) so no shell interpolation
occurs; if you must keep a quick mitigation, expand the rejection check on
cmd/unquoted to include backtick (`) and dollar-sign-substitution ($) (i.e.,
reject if /[`$]/ appears) and ensure execSync is not called with shell: true —
prefer spawnSync/execFileSync with args to fully eliminate shell evaluation of
cmd.

🧹 Nitpick comments (1)

skills/continuous-learning-v2/hooks/observe.sh (1)
141-146: Consider extracting the duplicate regex definition.

_SECRET_RE is defined identically in both Python blocks (lines 141-146 and 188-193). While acceptable given the separate python3 -c invocations, extracting to a shared Python module would reduce maintenance risk if the pattern needs updating.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@skills/continuous-learning-v2/hooks/observe.sh` around lines 141 - 146, The
same regex _SECRET_RE is duplicated across two separate python3 -c invocations;
extract it into a shared Python module (e.g., a new small helper file) and
import or source that module in both places so both invocations reference the
single authoritative _SECRET_RE definition; update the two call sites that
currently define _SECRET_RE inline to instead import/use the shared symbol
(keeping the exact pattern and name _SECRET_RE) to avoid future divergence.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@scripts/lib/utils.js`:
- Around line 342-345: The current allowlist hardcodes 'npx ' and blocks other
package managers; update the logic that builds allowedPrefixes (used where
allowedPrefixes and cmd.startsWith(...) are checked) to include the repo's
configured package manager by reading CLAUDE_PACKAGE_MANAGER (or project config)
and adding its prefix (e.g., "npm ", "pnpm ", "yarn ", or "bun ") to the allowed
set, or alternatively expand the static list to include all supported managers;
ensure the resulting allowedPrefixes array includes the chosen manager before
the existing startsWith check so scripts/**/*.js hooks can run with the selected
package manager.

---

Duplicate comments:
In `@scripts/lib/utils.js`:
- Around line 348-357: The current check strips quoted substrings into unquoted
and then still passes the original cmd to execSync, allowing command
substitution via `$()` or backticks; update the runCommand logic to stop using
execSync with a shell string and instead invoke child_process.spawnSync or
execFileSync with an explicit argv array (pass the binary as the command and
arguments separately) so no shell interpolation occurs; if you must keep a quick
mitigation, expand the rejection check on cmd/unquoted to include backtick (`)
and dollar-sign-substitution ($) (i.e., reject if /[`$]/ appears) and ensure
execSync is not called with shell: true — prefer spawnSync/execFileSync with
args to fully eliminate shell evaluation of cmd.

---

Nitpick comments:
In `@skills/continuous-learning-v2/hooks/observe.sh`:
- Around line 141-146: The same regex _SECRET_RE is duplicated across two
separate python3 -c invocations; extract it into a shared Python module (e.g., a
new small helper file) and import or source that module in both places so both
invocations reference the single authoritative _SECRET_RE definition; update the
two call sites that currently define _SECRET_RE inline to instead import/use the
shared symbol (keeping the exact pattern and name _SECRET_RE) to avoid future
divergence.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 629b889a-96ab-4cc4-8180-e406a27085a8

📥 Commits

Reviewing files that changed from the base of the PR and between a187e72 and 597d431.

📒 Files selected for processing (3)

scripts/lib/utils.js
skills/continuous-learning-v2/hooks/observe.sh
tests/lib/utils.test.js

🚧 Files skipped from review as they are similar to previous changes (1)

tests/lib/utils.test.js

coderabbitai · 2026-03-07T12:19:30Z

+  // Allowlist: only permit known-safe command prefixes
+  const allowedPrefixes = ['git ', 'node ', 'npx ', 'which ', 'where '];
+  if (!allowedPrefixes.some(prefix => cmd.startsWith(prefix))) {
+    return { success: false, output: 'runCommand blocked: unrecognized command prefix' };


⚠️ Potential issue | 🟠 Major

Allow the configured package manager instead of hardcoding only npx.

This guard now rejects npm, pnpm, yarn, and bun, so hooks that follow the repo’s package-manager selection will fail even though scripts/**/*.js is supposed to support them. Please derive the allowed package-manager prefix from CLAUDE_PACKAGE_MANAGER / project config, or include the supported set here.

As per coding guidelines, "scripts/**/*.js: Support multiple package managers: npm, pnpm, yarn, bun, with configurable selection via CLAUDE_PACKAGE_MANAGER env var or project config."

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@scripts/lib/utils.js` around lines 342 - 345, The current allowlist hardcodes 'npx ' and blocks other package managers; update the logic that builds allowedPrefixes (used where allowedPrefixes and cmd.startsWith(...) are checked) to include the repo's configured package manager by reading CLAUDE_PACKAGE_MANAGER (or project config) and adding its prefix (e.g., "npm ", "pnpm ", "yarn ", or "bun ") to the allowed set, or alternatively expand the static list to include all supported managers; ensure the resulting allowedPrefixes array includes the chosen manager before the existing startsWith check so scripts/**/*.js hooks can run with the selected package manager.

Shell evaluates $() and backticks inside double quotes, so checking only the unquoted portion was insufficient. Now $ and ` are rejected anywhere in the command string, while ; | & remain quote-aware. Addresses CodeRabbit and Cubic review feedback on PR affaan-m#348. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

jtzingsheim1 · 2026-03-07T12:35:28Z

Round 5 — Quoted command substitution bypass

Addressed: CodeRabbit (critical, duplicate) + Cubic (P1) — $() and backticks inside double quotes bypass the metacharacter check.

Root cause

Our previous fix (commit a187e72) added $ to the blocked characters, but only checked the unquoted portion of the command string. In shell semantics, $() and backticks are still evaluated inside double quotes (unlike ;|& which are literal). So node -e "$(whoami)" would pass the check but the shell would still execute the substitution.

Fix (commit `1cab6a7`)

Split the metacharacter check into two parts:

if (/[;|&\n]/.test(unquoted) || /[`$]/.test(cmd)) {

;|&\n → checked only in unquoted (safe inside double quotes)
$` → checked in full cmd (unsafe even inside double quotes)

This implements the exact patch suggested by CodeRabbit.

Tests added

runCommand blocks $() inside double quotes (shell still evaluates)
runCommand blocks backtick inside double quotes (shell still evaluates)

All 24 runCommand security tests pass.

Out of scope items (noted for future PRs)

Extract _SECRET_RE to shared module (CodeRabbit nitpick) — code organization, not security
Add other package managers to allowlist (CodeRabbit suggestion) — feature enhancement
Switch to spawnSync (CodeRabbit/Cubic recurring suggestion) — valid hardening but a larger refactor that changes the function's API contract; the current defense-in-depth (allowlist + comprehensive metacharacter blocking) is sufficient given runCommand is only called with trusted hardcoded strings, never user input

Notable upstream changes: - feat: v1.8.0 harness reliability and parity updates - feat: project-scoped instinct isolation - feat: automatic project type/framework detection (affaan-m#293) - feat: chief-of-staff communication triage agent (affaan-m#280) - feat: Cursor, Codex, OpenCode, Antigravity IDE harnesses - feat: auto-start dev servers in tmux (hooks refactor to run-with-flags) - feat: new skills — autonomous-loops, plankton-code-quality, liquid-glass-design, foundation-models-on-device, swift-concurrency-6-2, skill-stocktake, search-first, visa-doc-translate, swiftui-patterns, cpp-coding-standards, regex-vs-llm - fix: hook security hardening — secret scrubbing (affaan-m#348) - fix: UTF-8 for instinct CLI file I/O - fix: .prettierrc added for consistent formatting Conflict resolutions: - .github/workflows/{release,reusable-release}.yml: kept deleted (private fork) - agents/chief-of-staff.md: took upstream (gogcli → gog rename) - hooks/hooks.json: took upstream (run-with-flags.js refactor) - tests/integration/hooks.test.js: took upstream (updated to match new hooks)

Kyle-Peters-SkillSafe · 2026-03-16T05:23:29Z

Really solid defense-in-depth work here — the runCommand allowlist and the quote-stripping metacharacter check are exactly the kind of hardening that should be standard in any skill that executes shell commands.

The secret scrubbing in observation logs is particularly important. Snyk/Max Planck's SKILL-INJECT research found that 36% of skills in the wild contain prompt injection payloads, and a big part of the attack surface is skills that silently exfiltrate data through observation/logging channels. Having the scrubber run on the fallback path too (round 3 fix) closes what would've been a real bypass.

One thing this PR highlights: even well-intentioned skills need this level of scrutiny. The broader ecosystem doesn't have a great answer for "how do I know the skill I'm installing has done this work." SkillSafe has been tackling the distribution side of this — namespace verification, scan-before-share, and dual-side verification so both the publisher and consumer can independently confirm integrity. But the in-skill hardening you're doing here is the other half of the equation that no registry can replace.

Nice work on the test coverage too — 155 passing with the metacharacter edge cases is reassuring.

Merge upstream/main which includes: - fix: observer memory explosion with throttling and tail sampling (affaan-m#536) - fix: observer hooks hardening, secret scrubbing (affaan-m#348) - fix: lazy-start observer logic (affaan-m#508) - fix: read tool_response field in observe.sh (affaan-m#377) - feat: add C++, Java, Rust, PyTorch language support - feat: SQLite state store, skill evolution, session adapters - feat: orchestration harness and selective install - feat: DevFleet multi-agent orchestration skill Conflict resolution: - observe.sh: kept our hook_event_name + tool_response fixes, merged upstream's $PYTHON_CMD, secret scrubbing, lazy-start, throttled signaling, and session guards - package.json: accepted upstream's new scripts and dependencies

* fix(hooks): scrub secrets and harden hook security - Scrub common secret patterns (api_key, token, password, etc.) from observation logs before persisting to JSONL (observe.sh) - Auto-purge observation files older than 30 days (observe.sh) - Strip embedded credentials from git remote URLs before saving to projects.json (detect-project.sh) - Add command prefix allowlist to runCommand — only git, node, npx, which, where are permitted (utils.js) - Sanitize CLAUDE_SESSION_ID in temp file paths to prevent path traversal (suggest-compact.js) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(hooks): address review feedback from CodeRabbit and Cubic - Reject shell command-chaining operators (;|&`) in runCommand, strip quoted sections before checking to avoid false positives (utils.js) - Remove command string from blocked error message to avoid leaking secrets (utils.js) - Fix Python regex quoting: switch outer shell string from double to single quotes so regex compiles correctly (observe.sh) - Add optional auth scheme match (Bearer, Basic) to secret scrubber regex (observe.sh) - Scope auto-purge to current project dir and match only archived files (observations-*.jsonl), not live queue (observe.sh) - Add second fallback after session ID sanitization to prevent empty string (suggest-compact.js) - Preserve backward compatibility when credential stripping changes project hash — detect and migrate legacy directories (detect-project.sh) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(hooks): block $() substitution, fix Bearer redaction, add security tests - Add $ and \n to blocked shell metacharacters in runCommand to prevent command substitution via $(cmd) and newline injection (utils.js) - Make auth scheme group capturing so Bearer/Basic is preserved in redacted output instead of being silently dropped (observe.sh) - Add 10 unit tests covering runCommand allowlist blocking (rm, curl, bash prefixes) and metacharacter rejection (;|&`$ chaining), plus error message leak prevention (utils.test.js) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(hooks): scrub parse-error fallback, strengthen security tests Address remaining reviewer feedback from CodeRabbit and Cubic: - Scrub secrets in observe.sh parse-error fallback path (was writing raw unsanitized input to observations file) - Remove redundant re.IGNORECASE flag ((?i) inline flag already set) - Add inline comment documenting quote-stripping limitation trade-off - Fix misleading test name for error-output test - Add 5 new security tests: single-quote passthrough, mixed quoted+unquoted metacharacters, prefix boundary (no trailing space), npx acceptance, and newline injection - Improve existing quoted-metacharacter test to actually exercise quote-stripping logic Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(security): block $() and backtick inside quotes in runCommand Shell evaluates $() and backticks inside double quotes, so checking only the unquoted portion was insufficient. Now $ and ` are rejected anywhere in the command string, while ; | & remain quote-aware. Addresses CodeRabbit and Cubic review feedback on PR affaan-m#348. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

cubic-dev-ai Bot reviewed Mar 7, 2026

View reviewed changes

Comment thread scripts/lib/utils.js

Comment thread scripts/hooks/suggest-compact.js Outdated

coderabbitai Bot reviewed Mar 7, 2026

View reviewed changes

Comment thread skills/continuous-learning-v2/hooks/observe.sh

Comment thread skills/continuous-learning-v2/hooks/observe.sh Outdated

Comment thread skills/continuous-learning-v2/scripts/detect-project.sh

cubic-dev-ai Bot reviewed Mar 7, 2026

View reviewed changes

Comment thread scripts/lib/utils.js Outdated

Comment thread skills/continuous-learning-v2/hooks/observe.sh Outdated

coderabbitai Bot reviewed Mar 7, 2026

View reviewed changes

Comment thread scripts/lib/utils.js

coderabbitai Bot reviewed Mar 7, 2026

View reviewed changes

Comment thread skills/continuous-learning-v2/hooks/observe.sh

Comment thread tests/lib/utils.test.js

Comment thread tests/lib/utils.test.js

coderabbitai Bot reviewed Mar 7, 2026

View reviewed changes

affaan-m merged commit 9661a6f into affaan-m:main Mar 7, 2026
3 checks passed

This was referenced Mar 7, 2026

Security hardening: secret scrubbing, credential stripping, and input validation in hooks #347

Closed

fix: harden hook telemetry handling for secrets and unsafe inputs #351

Closed

jtzingsheim1 deleted the fix/hook-security-hardening branch March 7, 2026 23:36

jtzingsheim1 mentioned this pull request Mar 7, 2026

bug: observe.sh hardcodes python3 which doesn't exist on Windows #349

Closed

coderabbitai Bot mentioned this pull request Mar 24, 2026

fix: sanitize session ID to prevent silent data loss #857

Closed

7 tasks

coderabbitai Bot mentioned this pull request Mar 26, 2026

fix(#843): session-manager hooks — ecc-sessions dir + hookSpecificOutput format #948

Closed

4 tasks

This was referenced Apr 14, 2026

fix(hooks): fix bug that gateguard can't get session id and block tool call repeatedly #1411

Open

fix(hooks): isolate session-end.js filename using transcript_path UUID (#1494) #1495

Merged

Uh oh!

Conversation

jtzingsheim1 commented Mar 7, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by cubic

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviews paused

Walkthrough

Changes

Estimated code review effort

Possibly related issues

Possibly related PRs

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jtzingsheim1 commented Mar 7, 2026

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jtzingsheim1 commented Mar 7, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jtzingsheim1 commented Mar 7, 2026

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

jtzingsheim1 commented Mar 7, 2026

Round 5 — Quoted command substitution bypass

Root cause

Fix (commit 1cab6a7)

Tests added

Out of scope items (noted for future PRs)

Uh oh!

Uh oh!

Kyle-Peters-SkillSafe commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jtzingsheim1 commented Mar 7, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Mar 7, 2026 •

edited

Loading

Fix (commit `1cab6a7`)