fix: restore issue-2402 v2 gate resume and action alias consistency by serrrfirat · Pull Request #2458 · nearai/ironclaw

serrrfirat · 2026-04-14T12:21:01Z

Summary

add a staging regression for auth/external gate resumes that proved v2 could lose ActionResult call pairing when pending.call_id was empty or stale, then fix the resume branches to resolve or synthesize the correct correlator before resuming the thread
add a staging regression for hyphen/underscore action aliases in the real structured executor path, then make granted-action matching consistent so lease preflight, policy, and consumption agree on aliased action names
validate the fixes with targeted staging regressions, adjacent router/capability tests, and cargo check -p ironclaw_engine -p ironclaw

Validation

cargo test -p ironclaw_engine alias_normalization_stays_consistent_between_preflight_and_consume
cargo test -p ironclaw covers_action
cargo test -p ironclaw resolve_gate_repairs_call_id_for_resume_output_auth_resume
cargo test -p ironclaw resolved_call_id_
cargo test -p ironclaw handle_with_engine_reemits_approval_status_for_pending_gate
cargo check -p ironclaw_engine -p ironclaw

Keep lease preflight, policy, and consumption consistent for hyphen/underscore action aliases so installed tools do not fail mid-turn after being allowed. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Resolve or synthesize the original action call id when resuming auth or external callback gates so resumed ActionResult messages remain correctly paired with the waiting assistant call. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

gemini-code-assist

Code Review

This pull request introduces normalization for action name aliases, allowing both hyphens and underscores to be used interchangeably when checking granted actions. It also refactors the resolution of call IDs for pending gates into a shared helper function, ensuring that historical call IDs are correctly recovered or synthesized when resuming threads. This fix prevents correlation issues for the LLM when processing action results after a gate resolution. New tests were added to verify alias normalization and call ID repair logic. I have no feedback to provide.

Add a higher-fidelity v2 gate integration regression that proves an install auth resume can flow directly into an aliased follow-up tool call and still complete the thread instead of stalling. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

Resolve structured preflight action definitions with the same hyphen/underscore alias semantics as lease matching so aliased calls cannot bypass approval or deny policies. Ultraworked with [Sisyphus](https://github.com/code-yeongyu/oh-my-openagent) Co-authored-by: Sisyphus <clio-agent@sisyphuslabs.ai>

henrypark133

Review: Gate resume + action alias consistency (Risk: Medium)

Clean fix that correctly addresses the gate resume call_id corruption and action name alias inconsistency.

Positives:

action_name_matches() provides clean bidirectional hyphen/underscore normalization
GrantedActions::covers() updated with same normalization — transitively fixes find_and_consume
resolved_or_synthetic_call_id_for_pending_action fixes both resume_output paths in resolve_gate
Comprehensive test suite: 4 new tests including full install→auth→aliased-follow-up integration test
aliased_action_name_still_triggers_policy_approval verifies policy enforcement isn't bypassed

Convention notes:

The new helper loads the thread from store for call_id resolution — minor extra DB read but necessary since the resume_output paths don't have the thread pre-loaded
Test structure follows the "test through the caller" pattern well

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

standardtoaster

One question on resolved_call_id_for_pending_action (router.rs:189-197): the resolved_ids set is built from thread.messages, but in production the orchestrator writes ActionResult messages to thread.internal_messages via sync_runtime_state. The test uses thread.add_message() which goes to messages, so it passes, but the production resolved_ids set may always be empty.

This probably doesn't matter in practice since new gates always have call_id set (line 185 short-circuits), but if the legacy fallback path is meant to work, it should scan internal_messages instead of messages. Is this a known limitation?

@standardtoaster

The `resolved_call_id_for_pending_action` legacy fallback scanned only `thread.messages`, but in production the orchestrator writes ActionResult and assistant messages to `thread.internal_messages` via `sync_runtime_state`. This meant the `resolved_ids` set was always empty and the fallback never found a match, silently falling through to a synthetic id. Scan both `messages` and `internal_messages` so the legacy path works correctly for orchestrator-driven threads. Addresses review feedback from @standardtoaster on #2458. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

serrrfirat

Good catch @standardtoaster — you're right. resolved_call_id_for_pending_action was only scanning thread.messages, but in production the orchestrator writes ActionResult (and assistant-with-actions) messages to thread.internal_messages via sync_runtime_state. The resolved_ids set was always empty and the fallback never matched, silently falling through to a synthetic id.

Fixed in 893e319: the function now chains both messages and internal_messages iterators. Added a regression test (resolved_call_id_legacy_fallback_scans_internal_messages) that puts messages in internal_messages only — this test would have failed before the fix.

henrypark133

Review: Gate resume + action alias consistency

No verified findings in the current diff. I rechecked the prior internal_messages concern and the fix now scans both visible and internal transcripts before synthesizing a fallback call id. The added engine-v2 coverage also exercises the auth-resume plus aliased follow-up path that previously risked hanging.

standardtoaster

LGTM. I'll move persist_v2_tool_calls into the Completed match arm on #2452 so it doesn't fire during GatePaused.

serrrfirat and others added 2 commits April 14, 2026 15:20

github-actions Bot added size: L 200-499 changed lines risk: low Changes to docs, tests, or low-risk modules contributor: core 20+ merged PRs labels Apr 14, 2026

gemini-code-assist Bot reviewed Apr 14, 2026

View reviewed changes

github-actions Bot added size: XL 500+ changed lines and removed size: L 200-499 changed lines labels Apr 14, 2026

serrrfirat linked an issue Apr 14, 2026 that may be closed by this pull request

[QA] Bot enters infinite "Calling LLM" loop after tool operations #2402

Closed

3 tasks

henrypark133 previously approved these changes Apr 14, 2026

View reviewed changes

style: cargo fmt

4114833

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

zmanian dismissed henrypark133’s stale review via 4114833 April 14, 2026 18:29

github-actions Bot added scope: agent Agent core (agent loop, router, scheduler) risk: medium Business logic, config, or moderate-risk modules and removed risk: low Changes to docs, tests, or low-risk modules labels Apr 14, 2026

standardtoaster reviewed Apr 14, 2026

View reviewed changes

serrrfirat commented Apr 15, 2026

View reviewed changes

github-actions Bot added size: L 200-499 changed lines and removed size: XL 500+ changed lines labels Apr 15, 2026

serrrfirat requested a review from standardtoaster April 15, 2026 11:15

style: format bridge router

2882221

henrypark133 approved these changes Apr 15, 2026

View reviewed changes

standardtoaster approved these changes Apr 15, 2026

View reviewed changes

serrrfirat merged commit 853b6a5 into staging Apr 16, 2026
15 checks passed

serrrfirat deleted the sisyphus/issue-2402-staging branch April 16, 2026 12:11

github-actions Bot mentioned this pull request Apr 16, 2026

chore: promote staging to staging-promote/ecd37e10-24483216739 (2026-04-16 12:18 UTC) #2525

Merged

henrypark133 mentioned this pull request Apr 21, 2026

chore: release #2606

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: restore issue-2402 v2 gate resume and action alias consistency#2458

fix: restore issue-2402 v2 gate resume and action alias consistency#2458
serrrfirat merged 7 commits intostagingfrom
sisyphus/issue-2402-staging

serrrfirat commented Apr 14, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

henrypark133 left a comment

Uh oh!

standardtoaster left a comment

Uh oh!

serrrfirat left a comment

Uh oh!

henrypark133 left a comment

Uh oh!

standardtoaster left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

serrrfirat commented Apr 14, 2026

Summary

Validation

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

henrypark133 left a comment

Choose a reason for hiding this comment

Review: Gate resume + action alias consistency (Risk: Medium)

Uh oh!

standardtoaster left a comment

Choose a reason for hiding this comment

Uh oh!

serrrfirat left a comment

Choose a reason for hiding this comment

Uh oh!

henrypark133 left a comment

Choose a reason for hiding this comment

Review: Gate resume + action alias consistency

Uh oh!

standardtoaster left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants