fix(ci): improve Claude Code review reliability by henrypark133 · Pull Request #955 · nearai/ironclaw

henrypark133 · 2026-03-11T18:47:04Z

Summary

Add missing tools to --allowedTools: Read, Glob, Grep, Agent — the review prompt requires these but they weren't permitted, causing 8-9 permission denials per run and ~40% of reviews exiting without posting a comment
Simplify prompt: merge per-issue scoring agents (old step 4) into the review agents themselves, cutting agent spawns from 6+N to 6 and staying well within the 50-turn budget
Add guardrails: only the main process may post PR comments, and it must post exactly one comment before finishing (prevents fragmented output and ensures the staging gate always gets a comment to evaluate)

Evidence of the problem

PR	Claude comments	Permission denials	Outcome
#925	0	9	Gate FAILED
#950	0	8	Gate blocked
#912	0	unknown	No comment posted
#830	4 (fragmented)	9	Gate confused

Test plan

Merge to staging and wait for next staging-ci run to trigger a claude-review
Verify the Claude Code Review job has 0 (or near-0) permission denials in its result JSON
Verify the promotion PR receives exactly 1 Claude comment (not 0, not 4)
Confirm the staging gate passes without needing re-runs

🤖 Generated with Claude Code

The Claude review step was failing ~40% of the time because: - --allowedTools didn't include Read, Glob, Grep, Agent, causing 8-9 permission denials per run and preventing Claude from reading files or spawning the subagents the prompt required - Step 4 spawned N additional scoring agents per issue found, exhausting the 50-turn budget before the PR comment could be posted - Subagents could independently post PR comments, causing fragmented output Fix: add missing tools to --allowedTools, merge per-issue scoring into the review agents themselves, and add guardrails ensuring exactly one consolidated comment is always posted. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

gemini-code-assist · 2026-03-11T18:47:10Z

Note

Gemini is unable to generate a summary for this pull request due to the file types involved not being currently supported.

Copilot

Pull request overview

Updates the Claude Code review GitHub Action workflow to reduce tool permission denials, simplify the review prompt/agent flow, and add guardrails to ensure consistent PR comment output for staging promotion gates.

Changes:

Expand --allowedTools to include Read, Glob, Grep, and Agent.
Simplify the prompt by embedding severity/confidence scoring into the 4 review agents instead of spawning per-issue scoring agents.
Add “single commenter / single comment” guardrails to reduce fragmented or missing output.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

The Claude review step was failing ~40% of the time because: - --allowedTools didn't include Read, Glob, Grep, Agent, causing 8-9 permission denials per run and preventing Claude from reading files or spawning the subagents the prompt required - Step 4 spawned N additional scoring agents per issue found, exhausting the 50-turn budget before the PR comment could be posted - Subagents could independently post PR comments, causing fragmented output Fix: add missing tools to --allowedTools, merge per-issue scoring into the review agents themselves, and add guardrails ensuring exactly one consolidated comment is always posted. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>

Copilot AI review requested due to automatic review settings March 11, 2026 18:47

github-actions bot added scope: ci CI/CD workflows size: M 50-199 changed lines risk: medium Business logic, config, or moderate-risk modules contributor: core 20+ merged PRs labels Mar 11, 2026

Copilot started reviewing on behalf of henrypark133 March 11, 2026 18:47 View session

Copilot AI reviewed Mar 11, 2026

View reviewed changes

Comment thread .github/workflows/claude-review.yml

nickpismenkov approved these changes Mar 11, 2026

View reviewed changes

henrypark133 merged commit d313f44 into staging Mar 11, 2026
12 of 13 checks passed

henrypark133 deleted the fix/claude-review-reliability branch March 11, 2026 21:05

github-actions bot mentioned this pull request Mar 11, 2026

chore: release v0.19.0 #973

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(ci): improve Claude Code review reliability#955

fix(ci): improve Claude Code review reliability#955
henrypark133 merged 1 commit intostagingfrom
fix/claude-review-reliability

henrypark133 commented Mar 11, 2026

Uh oh!

gemini-code-assist bot commented Mar 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

henrypark133 commented Mar 11, 2026

Summary

Evidence of the problem

Test plan

Uh oh!

gemini-code-assist bot commented Mar 11, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants