Replace PR agent with pr-review skill and decouple comment posting by kubaflo · Pull Request #34460 · dotnet/maui

kubaflo · 2026-03-12T18:14:17Z

Note

Are you waiting for the changes in this PR to be merged?
It would be very helpful if you could test the resulting artifacts from this PR and let us know in a comment if this change resolves your issue. Thank you!

🏗️ Overview

This PR replaces the monolithic PR agent with a modular, skill-based PR review orchestrator. The legacy 2-file agent is decomposed into 4 dedicated phase skills, each with clear boundaries, inputs, and outputs.

TL;DR — One big agent file → four focused skills. Cleaner, composable, easier to maintain.

🔀 Before → After

┌─────────────────────────────────┐       ┌─────────────────────────────────────────────┐
│          BEFORE (Agent)         │       │              AFTER (Skills)                  │
├─────────────────────────────────┤       ├─────────────────────────────────────────────┤
│                                 │       │                                             │
│  agents/pr.md (310 lines)      │       │  skills/pr-review/SKILL.md  (orchestrator)  │
│         +                      │  ──►  │     ├─ skills/pr-preflight/SKILL.md         │
│  agents/pr/post-gate.md (331)  │       │     ├─ skills/pr-gate/SKILL.md              │
│  agents/pr/SHARED-RULES.md     │       │     ├─ try-fix  (existing skill)            │
│  agents/pr/PLAN-TEMPLATE.md    │       │     └─ skills/pr-report/SKILL.md            │
│                                 │       │                                             │
│  5 comment scripts              │       │  2-section unified comment                  │
│                                 │       │                                             │
└─────────────────────────────────┘       └─────────────────────────────────────────────┘

✨ Key Changes

1️⃣ Agent → Skill Architecture

Phase	Skill	Responsibility
0	`pr-review`	🎯 Master orchestrator — sets up branch, invokes phases
1	`pr-preflight`	📋 Context gathering — reads issue, PR, comments, classifies files
2	`pr-gate`	🚦 Test verification — tests must FAIL without fix, PASS with fix
3	`try-fix` (existing)	🔧 Multi-model fix exploration (Sonnet → Opus)
4	`pr-report`	📊 Final recommendation — APPROVE or REQUEST CHANGES

2️⃣ Decoupled Comment Posting

pr-review produces output files only (CustomAgentLogsTmp/PRState/). Comments are posted separately via ai-summary-comment — no tight coupling.

3️⃣ Unified Comment Model

Consolidated from 5 separate comment scripts → 2 sections in a single unified comment:

❌ Removed Script	✅ Now Part Of
`post-try-fix-comment.ps1`	PR-REVIEW section
`post-verify-tests-comment.ps1`	PR-REVIEW section
`post-write-tests-comment.ps1`	PR-REVIEW section

post-pr-finalize-comment.ps1 now injects into the unified comment by default (use -Standalone for legacy behavior).

4️⃣ Simplified CI & Orchestration

Review-PR.ps1 went from complex 3-phase script → clean 4-step orchestrator:

Step 1 → pr-review skill     (all 4 phases)
Step 2 → pr-finalize skill   (verify title/description)
Step 3 → ai-summary-comment  (post unified comment)
Step 4 → Update-AgentLabels  (apply labels)

ci-copilot.yml — removed explicit -RunFinalize -PostSummaryComment flags (all phases run automatically).

📁 Files Changed

✅ Added (4 files)

File	Purpose
`.github/skills/pr-review/SKILL.md`	Master orchestrator (Phase 0–4)
`.github/skills/pr-preflight/SKILL.md`	Phase 1: Context gathering
`.github/skills/pr-gate/SKILL.md`	Phase 2: Test verification gate
`.github/skills/pr-report/SKILL.md`	Phase 4: Final recommendation

❌ Deleted (7 files)

File	Reason
`.github/agents/pr.md`	Replaced by `pr-review` skill
`.github/agents/pr/PLAN-TEMPLATE.md`	Skills use direct output files
`.github/agents/pr/SHARED-RULES.md`	Rules distributed to phase skills
`.github/agents/pr/post-gate.md`	Replaced by `pr-gate` + `pr-report`
`post-try-fix-comment.ps1`	Merged into unified PR-REVIEW comment
`post-verify-tests-comment.ps1`	Merged into unified PR-REVIEW comment
`post-write-tests-comment.ps1`	Not part of main review workflow

🔧 Modified (14 files)

File	Change
`.github/scripts/Review-PR.ps1`	Major refactor → 4-step skill orchestration
`.github/skills/ai-summary-comment/SKILL.md`	Simplified to 2-section model
`.github/skills/ai-summary-comment/.../post-ai-summary-comment.ps1`	Updated for unified comment
`.github/skills/ai-summary-comment/.../post-pr-finalize-comment.ps1`	Injects into unified comment
`.github/copilot-instructions.md`	Replaced agent entry with skill entries
`.github/README-AI.md`	Updated architecture docs
`.github/docs/agent-labels.md`	Terminology: "Gate" → "Validate"
`.github/scripts/shared/Update-AgentLabels.ps1`	Updated for phase structure
`.github/scripts/shared/Start-Emulator.ps1`	Minor utility updates
`.github/skills/pr-finalize/SKILL.md`	Minor context update
`.github/skills/verify-tests-fail-without-fix/SKILL.md`	Minor context update
`.github/skills/ai-summary-comment/NO-EXTERNAL-REFERENCES-RULE.md`	Updated for 2-section model
`.github/instructions/sandbox.instructions.md`	Updated references
`eng/pipelines/ci-copilot.yml`	Simplified invocation

📊 Impact

 25 files changed
 843 insertions(+)
 3,271 deletions(-)

Net reduction of ~2,400 lines — less code, better structure. 🎉

github-actions · 2026-03-12T18:14:29Z

🚀 Dogfood this PR with:

⚠️ WARNING: Do not do this without first carefully reviewing the code of this PR to satisfy yourself it is safe.

curl -fsSL https://raw.githubusercontent.com/dotnet/maui/main/eng/scripts/get-maui-pr.sh | bash -s -- 34460

Or

Run remotely in PowerShell:

iex "& { $(irm https://raw.githubusercontent.com/dotnet/maui/main/eng/scripts/get-maui-pr.ps1) } 34460"

Copilot

Pull request overview

This PR migrates the Copilot-driven PR review automation from the legacy “pr agent” docs/scripts to a skill-based workflow centered on a new pr-review orchestrator (plus phase skills), and updates CI + commenting/labeling plumbing accordingly.

Changes:

Introduces new PR review skills (pr-review, pr-preflight, pr-gate, pr-report) and removes legacy PR agent instruction files.
Refactors Review-PR.ps1 to run a 4-step skill sequence (review → finalize → post summary → apply labels) and updates CI invocation.
Simplifies AI summary comment tooling by removing try-fix / write-tests / verify-tests comment scripts, and updates related docs.

Reviewed changes

Copilot reviewed 25 out of 25 changed files in this pull request and generated 7 comments.

Show a summary per file

File	Description
eng/pipelines/ci-copilot.yml	Updates CI to invoke `Review-PR.ps1` without legacy flags; publishes artifacts.
.github/skills/verify-tests-fail-without-fix/SKILL.md	Documentation wording adjustment for where verification output is consumed.
.github/skills/pr-review/SKILL.md	New orchestrator skill describing the 4-phase PR review workflow.
.github/skills/pr-report/SKILL.md	New “report/final recommendation” phase skill documentation.
.github/skills/pr-preflight/SKILL.md	New “context gathering” phase skill documentation.
.github/skills/pr-gate/SKILL.md	New “test verification” phase skill documentation and output format.
.github/skills/pr-finalize/SKILL.md	Updates wording to reference `pr-review` skill rather than the old agent.
.github/skills/ai-summary-comment/SKILL.md	Updates AI summary comment skill docs to match the new workflow structure.
.github/skills/ai-summary-comment/scripts/post-write-tests-comment.ps1	Deletes legacy unified-comment section updater for write-tests attempts.
.github/skills/ai-summary-comment/scripts/post-verify-tests-comment.ps1	Deletes legacy unified-comment section updater for verification results.
.github/skills/ai-summary-comment/scripts/post-try-fix-comment.ps1	Deletes legacy unified-comment section updater for try-fix attempts.
.github/skills/ai-summary-comment/scripts/post-pr-finalize-comment.ps1	Changes default behavior to inject into unified AI Summary comment; renames flag to `-Standalone`.
.github/skills/ai-summary-comment/scripts/post-ai-summary-comment.ps1	Adjusts how review sessions are wrapped and displayed (commit info moved to top-level summary).
.github/skills/ai-summary-comment/NO-EXTERNAL-REFERENCES-RULE.md	Terminology updates (“PR agent” → “agent”).
.github/scripts/shared/Update-AgentLabels.ps1	Terminology update in header; label parsing still reads `gate/content.md`.
.github/scripts/shared/Start-Emulator.ps1	Adds `-Headless` flag and uses headless mode by default in CI contexts.
.github/scripts/Review-PR.ps1	Major refactor to a 4-step skill-based orchestration flow.
.github/README-AI.md	Updates repository AI docs to point to the new skill-based workflow.
.github/instructions/sandbox.instructions.md	Updates guidance to reference `pr-review` skill instead of the old agent.
.github/docs/agent-labels.md	Updates documentation terminology and architecture description for the new workflow.
.github/copilot-instructions.md	Updates “available agents/skills” documentation to include `pr-review` and phase skills.
.github/agents/pr/SHARED-RULES.md	Removes legacy agent shared rules doc.
.github/agents/pr/post-gate.md	Removes legacy agent Phase 3/4 doc.
.github/agents/pr/PLAN-TEMPLATE.md	Removes legacy plan template for the old agent.
.github/agents/pr.md	Removes legacy PR agent definition.

Comments suppressed due to low confidence (1)

.github/skills/ai-summary-comment/SKILL.md:282

The expected directory structure uses PRAgent/validate/, but the actual posting/labeling scripts auto-load from PRAgent/gate/ (e.g., post-ai-summary-comment.ps1 and Update-AgentLabels.ps1). Please align this doc with the implemented folder names, otherwise agents will write files to the wrong location and the summary/labels won’t pick them up.

The agent writes phase output files that comment scripts auto-load:

CustomAgentLogsTmp/PRState/{PRNumber}/
├── PRAgent/
│ ├── pre-flight/
│ │ └── content.md # Phase 1 output (auto-loaded by post-ai-summary-comment.ps1)
│ ├── validate/
│ │ ├── content.md # Phase 2 output (auto-loaded by post-ai-summary-comment.ps1)
│ │ └── verify-tests-fail/ # Script output from verify-tests-fail.ps1
│ │ ├── verification-report.md
│ │ ├── verification-log.txt
│ │ ├── test-without-fix.log
│ │ └── test-with-fix.log
│ ├── try-fix/

</details>

eng/pipelines/ci-copilot.yml

.github/scripts/Review-PR.ps1

.github/pr-review/pr-gate.md

.github/skills/pr-review/SKILL.md

.github/docs/agent-labels.md

PureWeen · 2026-03-16T19:55:48Z

Code Review: Skills Architecture Refactoring

Overall: Good refactoring. The shift from agent-based to skill-based orchestration cuts ~3,800 lines, cleanly separates "what to do" (skill/phase files) from "how to post it" (scripts), and moves deterministic work out of LLM control. The 4-phase workflow is preserved intact.

🔴 Should fix before merge

1. git checkout - on error paths (Review-PR.ps1 lines 201/208/235)
In non-CI local mode, git checkout - toggles to main instead of the user's original branch after failures. CI mode is unaffected (auto-sets -UseCurrentBranch). Fix: capture original branch name before any checkout and use it explicitly on error paths.

�� Nice to fix

2. Fork fetch git errors swallowed (Review-PR.ps1 line 205)
git fetch ... 2>&1 | Out-Null suppresses the actual git error message. There IS a Write-Error with the fork URL, so it's not silent — but the reason for failure (auth, network, bad ref) is lost. Suggest capturing stderr.

3. git pull --ff-only non-fatal (Review-PR.ps1 lines 171-174)
If pull fails, script continues with only a yellow warning and could review stale code. Consider making it fatal or at least logging what commit is being used as the base.

4. Orphaned legacy parameters (post-pr-finalize-comment.ps1 lines 49-59)
10 legacy parameters are accepted but explicitly ignored. Should be removed to avoid confusion.

📋 Separate PR

5. provision.yml Xcode fallback (eng/pipelines/common/provision.yml)
Good change, but purely infrastructure — unrelated to the skills refactoring. Would be cleaner as its own PR.

✅ Already fixed

~~Model count ×2/×4 inconsistency in Quick Reference table~~ (pushed fix in 5ba6389)

✅ Things done well

Autonomous execution rules prevent "waiting for human" hangs
EstablishBrokenBaseline.ps1 auto-restore fixes the 3.7hr CI waste
Phase files in .github/pr-review/ are easy to read and update independently
post-ai-summary-comment.ps1 has robust downstream validation of phase outputs
No stale references to deleted agents/pr/ or old scripts

kubaflo · 2026-03-16T21:14:39Z

Thanks for the review @PureWeen ! All actionable items addressed in 7755046:

1. ✅ git checkout - on error paths — Captured original branch before any checkout, use it explicitly on all 3 error paths instead of git checkout -.

2. ✅ Fork fetch errors swallowed — git fetch output is now captured and included in the Write-Error message so the failure reason (auth, network, bad ref) is visible.

3. ✅ git pull --ff-only non-fatal — Now logs the pull failure output and prints the base commit SHA (Review base: main @ <sha>) so it's clear what code is being reviewed.

4. ✅ Orphaned legacy parameters — Removed all 11 unused legacy parameters from post-pr-finalize-comment.ps1.

Refactor the monolithic PR agent into a pr-review orchestrator and distinct phase skills. Removed legacy agent files (agents/pr.md, PLAN-TEMPLATE.md, SHARED-RULES.md, post-gate.md) and added/registered modular skills (pr-review, pr-preflight, pr-gate, pr-report) with updated SKILL.md files. Updated docs (README-AI.md, copilot-instructions.md) and scripts to reference the new pr-review workflow, adjusted related PowerShell scripts and AI-summary tooling, and updated CI pipeline entries. This consolidates the workflow into an orchestrator that invokes separate preflight/gate/try-fix/report skills for clearer responsibilities and reuse. Replace pr agent with pr-review skill (split phases) Refactor the monolithic PR agent into a pr-review orchestrator and distinct phase skills. Removed legacy agent files (agents/pr.md, PLAN-TEMPLATE.md, SHARED-RULES.md, post-gate.md) and added/registered modular skills (pr-review, pr-preflight, pr-gate, pr-report) with updated SKILL.md files. Updated docs (README-AI.md, copilot-instructions.md) and scripts to reference the new pr-review workflow, adjusted related PowerShell scripts and AI-summary tooling, and updated CI pipeline entries. This consolidates the workflow into an orchestrator that invokes separate preflight/gate/try-fix/report skills for clearer responsibilities and reuse. Co-Authored-By: Copilot <223556219+Copilot@users.noreply.github.com>

Introduce a new Step 0 (branch setup) into .github/scripts/Review-PR.ps1 and update the PR review skill doc. The script now documents a 5-step workflow, adds a -UseCurrentBranch switch and DryRun handling, and by default checks out main to create a pr-review-{PRNumber} branch. It fetches the PR (origin or fork), cherry-picks PR commits squashed into a single review commit, handles conflicts/cleanup, and prints helpful status messages. Also removed the inline Phase 0 instructions from .github/skills/pr-review/SKILL.md and clarified the rule to never switch branches (stay on the review branch set up by the caller). Examples and description text were updated accordingly.

Reorganize PR review documentation by moving pr-preflight, pr-gate, and pr-report from .github/skills/ to .github/pr-review/ and converting them into phase instruction files. Update .github/copilot-instructions.md and .github/skills/pr-review/SKILL.md to reference the new .github/pr-review/* paths and adjust wording to treat those documents as phase instructions (while leaving try-fix as the phase skill). Remove redundant frontmatter from the moved files and make small clarifications (branch setup note, no-comments reminder, and quick-reference table updates). This clarifies phase responsibilities and centralizes PR review guidance.

If the script is called while a previous baseline is still active (state file exists), auto-restore the previous baseline first before establishing the new one. This prevents the dirty-tree error that caused agents to loop endlessly trying to re-establish baseline. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Refactor review/commenting flow: move/rename the ai-summary comment script into .github/scripts, add a new post-pr-finalize-comment.ps1 script to inject pr-finalize summaries into the unified AI Summary comment (with auto-discovery, dry-run preview and update/create behavior), and update Review-PR.ps1 to invoke these scripts directly for Step 3. Enhance agent runtime logging in Review-PR.ps1: richer JSON-stream parsing with model display, turn/tool counting, tool icons, intent reporting, content previews, failure tracking, and a final stats summary. Remove legacy ai-summary-comment skill docs and duplicate script artifacts under .github/skills/ai-summary-comment (IMPROVEMENTS.md, NO-EXTERNAL-REFERENCES-RULE.md, SKILL.md and an old script were deleted).

Update PR review skill to mandate using mode: "sync" for all try-fix task invocations and forbid mode: "background" (background mode causes the orchestrator to proceed before attempts finish and try-fix results are lost). Require updating try-fix/content.md after every attempt (immediately after completion) and add corresponding checklist entries and common-mistakes notes to ensure try-fix outputs are persisted and visible in PR comments. (Changes to .github/skills/pr-review/SKILL.md)

Replace the previous cherry-pick flow with a git merge --squash workflow when preparing a review branch. Adds conflict handling: aborts the merge on conflicts, resets state, posts a comment on the PR notifying about merge conflicts, and exits the script. Also updates dry-run messaging, handles detached HEAD display for current branch, and cleans up temporary branches after the squash-merge. Improves robustness and provides clearer notification when a PR cannot be squashed for review. Co-Authored-By: Copilot <223556219+Copilot@users.noreply.github.com>

Phase 3 try-fix was updated to use four AI models (claude-opus-4.6-1m, claude-sonnet-4.6, gpt-5.3-codex, gemini-3-pro-preview) instead of two. Adjusted multi-model configuration, model order, and all checklist items to reflect 4 sequential attempts and per-attempt content updates. Clarified cross-pollination wording (now requires querying all models), preserved the mandatory mode: "sync" requirement, and updated exhaustion guidance to reference 4 models. Co-Authored-By: Copilot <223556219+Copilot@users.noreply.github.com>

The body consistently references 4 models but the Quick Reference table said ×2. Updated to ×4 to match. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Capture and restore the original branch instead of relying on `git checkout -`, and surface git pull/fetch outputs in warnings to aid debugging. Review-PR.ps1: record the original branch (or HEAD), use it on error and cleanup, include pull/fetch stdout in warnings, and print the review base SHA. post-pr-finalize-comment.ps1: simplify the parameter list by keeping only PreviewFile and removing many legacy/ignored parameters. These changes make branch restoration more reliable and improve diagnostic information while cleaning up the post-PR script API.

PureWeen · 2026-03-17T15:46:17Z

Re-Review (post-rebase)

All previously flagged issues have been fixed ✅ — git checkout - replaced with $originalBranch, fork fetch errors captured, legacy params removed, model count consistent at ×4.

New changes since last review

pr-build-status skill → azdo-build-investigator: Self-contained PowerShell scripts replaced by a thin MAUI-context layer that delegates to the ci-analysis skill from dotnet/arcade-skills plugin. Reasonable architecture — CI investigation logic is shared across dotnet repos rather than duplicated locally.

.github/copilot/settings.json: Registers the dotnet/arcade-skills plugin. This is the dependency powering the new skill. Looks fine.

🟡 One remaining issue

Stale references to deleted pr-build-status in 3 places:

.github/README-AI.md (~line 253 and ~368)
.github/copilot-instructions.md (~line 320)

These still list pr-build-status as a skill. Should be updated to reference azdo-build-investigator and document the new skill + its ci-analysis dependency.

Otherwise looks good to merge.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

kubaflo · 2026-03-17T15:50:20Z

Stale pr-build-status references updated to azdo-build-investigator in 4aaa18a — all 3 spots in README-AI.md and copilot-instructions.md now reflect the new skill name and its ci-analysis dependency.

PureWeen · 2026-03-17T17:05:36Z

Multi-Model Code Review (5 LLMs × consensus filter)

Consensus Findings (2+ models agree)

🟡 MODERATE -- Review-PR.ps1 ~line 227-229 -- Incomplete cleanup on git commit failure (5/5 models)

When git merge --squash succeeds but git commit fails, the error path only deletes $tempBranch then exits. Missing:

git reset --hard HEAD -- staged index left dirty
git checkout $originalBranch -- script exits with HEAD on pr-review-$PRNumber
git branch -D $reviewBranch -- orphan branch left behind

The conflict path (lines 237-243) handles all three correctly. Suggested fix -- mirror it:

if ($LASTEXITCODE -ne 0) {
    git branch -D $tempBranch 2>$null
    git reset --hard HEAD 2>$null
    git checkout $originalBranch 2>$null
    git branch -D $reviewBranch 2>$null
    Write-Error "Failed to create squashed commit"; exit 1
}

🟡 MODERATE -- provision.yml ~line 146-158 -- Silent Xcode fallback affects all CI pipelines (4/5 models)

Old behavior: exit 1 with clear message when requested Xcode missing.
New behavior: Warning + silently selects latest installed Xcode.

This affects all pipelines consuming provision.yml (ci.yml, ci-official.yml, device-tests.yml, handlers.yml -- 11 consumers), not just ci-copilot.yml. Consider scoping via opt-in env var or adding ##vso[task.logissue type=warning] for AzDO visibility.

Review generated by 5-model consensus (claude-opus-4.6 ×2, claude-sonnet-4.6, gemini-3-pro-preview, gpt-5.3-codex). Only issues flagged by 2+ models are included.

PureWeen · 2026-03-17T17:07:34Z

Multi-Model Re-Review (Gemini 3 Pro, GPT 5.1, Sonnet 4.5)

Previous fixes all confirmed intact ✅ (originalBranch, fork errors, legacy params, model count)

Stale pr-build-status references also fixed in latest commit ✅

New findings from multi-model review

🟡 Invoke-CopilotStep exit codes ignored (Review-PR.ps1 lines 447, 461)
If Step 1 (PR Review) fails, the script proceeds to Step 2 (PR Finalize) without checking. Consider checking $LASTEXITCODE or the return value and at least logging a clear warning.
(Found by: Gemini, GPT)

🟡 Process cleanup too aggressive (Review-PR.ps1 lines 539-543)
Get-Process -Name "copilot" | Stop-Process kills ALL copilot instances on the machine, not just the one started by this script. On a dev box, this could terminate other Copilot sessions. Consider scoping by PID or skipping cleanup when not in CI.
(Found by: Gemini, GPT)

🟡 Try-Fix mandatory vs skip contradiction (SKILL.md)
Lines 96-99 say Phase 3 is "MANDATORY… NEVER SKIP… NO EXCEPTIONS", but the Gate failure path (lines 86-89) says "Tests FAIL with fix → Skip Try-Fix, proceed to Report." These should be reconciled.
(Found by: GPT)

🟢 Minor: README-AI.md skill listing
Lists pr-preflight, pr-gate, pr-report as standalone skills (~lines 62-65, 78-82). These are phase docs in .github/pr-review/, not skills. Could be misleading.
(Found by: GPT)

Overall

All three models agreed this is a good refactoring with clean architecture. The items above are the only net-new findings beyond what was already addressed.

kubaflo · 2026-03-17T17:20:20Z

Re: Multi-Model Re-Review Findings

Thanks for running the multi-model review — appreciate the thoroughness. However, I disagree with most of the findings here. Let me explain:

🟡 `Invoke-CopilotStep` exit codes — Disagree, by design

Steps 1 and 2 are intentionally independent. Step 1 (PR Review) does code review + test verification. Step 2 (PR Finalize) checks title/description accuracy. These serve completely different purposes.

If Step 1 crashes or times out, Step 2's analysis is still valuable — knowing the PR title doesn't match the implementation is useful regardless of whether code review completed. Adding exit code gating would create unnecessary coupling between independent phases and reduce the overall value of a partial run.

The downstream script (post-ai-summary-comment.ps1) already handles missing phase output gracefully — if a phase produced no content files, that section is simply omitted from the comment. So a Step 1 failure doesn't corrupt Step 2 output.

🟡 Process cleanup — Disagree, correctly scoped for its context

This cleanup runs at the very end of Review-PR.ps1, which is the CI orchestrator. By this point, ALL Invoke-CopilotStep calls have completed — any remaining copilot or copilot-related node processes are orphans from this script's own execution.

🟡 Try-Fix "contradiction" — Disagree, not a contradiction

These target different scenarios:

Line 88 ("Skip Try-Fix"): Gate found the fix is broken — tests FAIL with the fix applied. Running 4 Try-Fix models against a broken baseline would be meaningless since there's nothing working to compare alternatives against. Going straight to Report is the only sensible action.
Line 96 ("MANDATORY, NEVER SKIP"): This targets the LLM's tendency to shortcut. When Gate passes (fix works, tests pass), the LLM naturally wants to say "fix looks correct, no need to explore alternatives." The emphatic language prevents that — you MUST explore alternatives even when the fix looks perfect.

These aren't contradictory — line 88 is a degenerate failure case, line 96 is about the happy path. The "NO EXCEPTIONS" applies to the scenario where Try-Fix would actually be meaningful.

🟢 README phase listing — Agree, minor doc fix

Fair point — I'll clarify that pr-preflight, pr-gate, and pr-report are phase docs consumed by the pr-review orchestrator, not independently invokable skills.

… condition - README-AI.md: Clarify that pr-preflight, pr-gate, pr-report are phase docs in .github/pr-review/, not standalone skills in skills/ - README-AI.md: Fix skill count to distinguish skills from phase docs - SKILL.md: Clarify Try-Fix mandatory language to explicitly state the exception (broken fix) alongside the rule (never skip when Gate passes) Co-Authored-By: Copilot <223556219+Copilot@users.noreply.github.com>

kubaflo marked this pull request as ready for review March 12, 2026 18:29

Copilot AI review requested due to automatic review settings March 12, 2026 18:29

kubaflo added area-ai-agents Copilot CLI agents, agent skills, AI-assisted development copilot labels Mar 12, 2026

Copilot started reviewing on behalf of kubaflo March 12, 2026 18:30 View session

Copilot AI reviewed Mar 12, 2026

View reviewed changes

kubaflo force-pushed the review-skills branch from 29a4abd to 37ad92c Compare March 12, 2026 23:25

github-actions bot mentioned this pull request Mar 13, 2026

[repo-status] Daily Repo Status - March 13, 2026 🌟 #34468

Closed

kubaflo force-pushed the review-skills branch from d82778a to 82db03c Compare March 14, 2026 00:22

github-actions bot mentioned this pull request Mar 14, 2026

[repo-status] Daily Repo Status - March 14, 2026 🌟 #34475

Closed

This was referenced Mar 14, 2026

PR finalize: Skip Note block for test-only PRs, recommend [Testing] prefix #34185

Closed

PR agent: Add test-only PR workflow handling #34187

Closed

kubaflo force-pushed the review-skills branch from 82db03c to fdabf07 Compare March 15, 2026 16:20

github-actions bot mentioned this pull request Mar 16, 2026

[repo-status] Daily Repo Status - March 16, 2026 🌟 #34487

Closed

PureWeen previously approved these changes Mar 16, 2026

View reviewed changes

kubaflo and others added 12 commits March 16, 2026 22:44

Update Review-PR.ps1

6181548

xcode

e09fce3

Update SKILL.md

f37dfc6

Fix model count inconsistency in pr-review Quick Reference table

b34a7ec

The body consistently references 4 models but the Quick Reference table said ×2. Updated to ×4 to match. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

kubaflo dismissed PureWeen’s stale review via 6cae70b March 16, 2026 21:47

kubaflo force-pushed the review-skills branch from 7755046 to 6cae70b Compare March 16, 2026 21:47

github-actions bot mentioned this pull request Mar 17, 2026

[repo-status] Daily Repo Status - March 17, 2026 🌟 #34507

Closed

Update stale pr-build-status references to azdo-build-investigator

4aaa18a

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

PureWeen approved these changes Mar 17, 2026

View reviewed changes

PureWeen merged commit 541a7a3 into main Mar 17, 2026
3 of 12 checks passed

PureWeen deleted the review-skills branch March 17, 2026 18:08

This was referenced Mar 18, 2026

[repo-status] 📊 Daily Repo Status - March 18, 2026 #34523

Closed

[repo-status] 🌟 Daily Repo Status - March 21, 2026 #34588

Closed

Conversation

kubaflo commented Mar 12, 2026

🏗️ Overview

🔀 Before → After

✨ Key Changes

1️⃣ Agent → Skill Architecture

2️⃣ Decoupled Comment Posting

3️⃣ Unified Comment Model

4️⃣ Simplified CI & Orchestration

📁 Files Changed

📊 Impact

Uh oh!

github-actions bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

PureWeen commented Mar 16, 2026

Code Review: Skills Architecture Refactoring

🔴 Should fix before merge

�� Nice to fix

📋 Separate PR

✅ Already fixed

✅ Things done well

Uh oh!

kubaflo commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PureWeen commented Mar 17, 2026

Re-Review (post-rebase)

New changes since last review

🟡 One remaining issue

Uh oh!

kubaflo commented Mar 17, 2026

Uh oh!

PureWeen commented Mar 17, 2026

Multi-Model Code Review (5 LLMs × consensus filter)

Consensus Findings (2+ models agree)

Uh oh!

PureWeen commented Mar 17, 2026

Multi-Model Re-Review (Gemini 3 Pro, GPT 5.1, Sonnet 4.5)

New findings from multi-model review

Overall

Uh oh!

kubaflo commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Re: Multi-Model Re-Review Findings

🟡 Invoke-CopilotStep exit codes — Disagree, by design

🟡 Process cleanup — Disagree, correctly scoped for its context

🟡 Try-Fix "contradiction" — Disagree, not a contradiction

🟢 README phase listing — Agree, minor doc fix

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions bot commented Mar 12, 2026 •

edited

Loading

kubaflo commented Mar 16, 2026 •

edited

Loading

kubaflo commented Mar 17, 2026 •

edited

Loading

🟡 `Invoke-CopilotStep` exit codes — Disagree, by design