Skip to content

feat: proactive queue monitoring in HEARTBEAT#38

Open
SevFle wants to merge 1 commit intomainfrom
feat/proactive-queue-monitoring
Open

feat: proactive queue monitoring in HEARTBEAT#38
SevFle wants to merge 1 commit intomainfrom
feat/proactive-queue-monitoring

Conversation

@SevFle
Copy link
Copy Markdown

@SevFle SevFle commented Feb 13, 2026

What This Does

Adds automatic WedPilot queue health monitoring to HEARTBEAT.md.

Features

Every 30 min check:

  • PRs stuck in ready-for-implementation >2 hours
  • Workflows that skipped due to missing specs
  • Offline or stuck runners
  • Queue processor health on strato

Auto-Fix Actions:

  • Missing specs → copy from develop, re-trigger
  • Skipped workflow → re-trigger with correct params
  • Runner stuck → alert user (can't auto-fix)
  • Processor down → attempt SSH restart

Related


🤖 Part of Atlas proactive monitoring system

- Check queue health every heartbeat (30 min)
- Auto-fix stuck PRs (missing specs, skipped workflows)
- Alert on unfixable issues (runners, processor down)
- Part of proactive failure detection system
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant