TestOps Copilot

Your test failures now have a detective on payroll.

An AI-powered test operations platform with an agentic copilot, a virtual team of 9 specialist personas, and graduated autonomy that lets you control how much the AI does on its own.

Quick Start · Demo Guide · How It Works · MCP Server · Changelog

The Pitch

Your CI pipeline breaks at 2 AM. By the time you open your laptop, the AI copilot has already searched Jira for similar failures, pulled the relevant Confluence runbook, identified the root cause from your failure knowledge base, and is waiting with a one-click fix.

You review it, hit approve, and move on with your coffee.

3-column Mission Control: navigation | main content | AI Copilot panel with persona routing

Get Running in 2 Minutes

git clone https://github.com/rayalon1984/testops-copilot.git
cd testops-copilot && npm install && npm run dev:simple

No PostgreSQL. No Redis. No API keys. Login with engineer@testops.ai / demo123.

That's it. You're talking to the copilot.

Want production mode? See the Production Quickstart for Docker + PostgreSQL + real AI providers.

What Makes v3 Different

Your AI Team, Not Just a Chatbot

Every query is routed to the right specialist before a single token is generated:

You Ask	Routed To	Why
"Why are my tests flaky?"	Test Engineer	Flaky test analysis, coverage, CI quality
"Pipeline is broken"	DevOps Engineer	Pipelines, deployments, CI/CD infra
"Is there a security vulnerability?"	Security Engineer	Auth, secrets, vulnerabilities
"Schema migration failed"	Data Engineer	Database, schema, migrations
"Page loads too slowly"	Performance Engineer	Latency, throughput, profiling
"What can this tool do?"	Product Manager	Feature discovery, onboarding

9 personas in total. Routing is two-tier: keyword rules fire in <1ms at zero cost. When that misses, a lightweight LLM micro-classification kicks in (~200 tokens). You see who's handling your query in real time: "Test Engineer is on it."

Graduated Autonomy — You Set the Dial

Not everyone wants the same level of AI independence:

Mode	What Happens
Conservative	AI investigates and recommends. You approve everything.
Balanced	Low-risk actions (searches, reads, labels) auto-execute. Team-visible actions show one-click approval cards.
Autonomous	The AI handles what it can, escalates what it should. Destructive actions always need your sign-off.

22 tools. 8 auto-execute. 11 with tiered approval. 3 housekeeping. Every write operation goes through a human-in-the-loop confirmation gate with a 5-minute TTL.

Bring Your Own Provider

Hot-swap AI providers mid-conversation from the in-chat picker:

Provider	What You Get
Anthropic Claude	Direct API — Opus, Sonnet, Haiku
AWS Bedrock	Claude via IAM role — zero credential management in AWS
OpenAI	GPT-4o, o1
Google Gemini	Gemini Pro, Flash
Azure OpenAI	Enterprise Azure deployments
OpenRouter	100+ models through a single gateway

Test Intelligence That Learns

Your failure knowledge base gets smarter with every test run:

Predictive Failure Analysis — Risk scores per test, trend aggregation, z-score anomaly detection that catches problems before they become patterns
Flaky Test Detection — Statistical scoring across historical pass/fail data surfaces the tests you can't trust
Smart Test Selection — Changed 3 files? The platform tells you which 12 of your 400 tests actually need to run
Failure Fingerprinting — Same root cause, different stack trace? The knowledge base links them automatically
Context Enrichment — Pulls context from Jira, Confluence, and GitHub simultaneously so the AI has the full picture

The Full Platform

Area	What You Get
Multi-Pipeline Dashboard	Unified view for Jenkins, GitHub Actions, and custom CI
3-Column Mission Control	Real-time dashboard with integrated AI copilot panel
Team Workspaces	Teams, members (OWNER/ADMIN/MEMBER/VIEWER), scoped pipelines
Collaborative RCA	Comments on failures, version-tracked RCA revisions with optimistic locking
Failure Knowledge Base	Smart fingerprinting, historical trending, category analytics
Auto-Fix Workflow	Analyzes failure, creates branch, commits fix, opens PR
Chat Session Persistence	Full message history stored and retrievable across sessions
Cost Tracking	Per-session cost breakdown by tool and provider with budget alerts

Integrations

TestOps Copilot plugs into your existing stack:

Service	What It Does	Setup
Jira	Issue creation, bi-directional sync, similar issue search (JQL)	`JIRA_BASE_URL` + `JIRA_API_TOKEN`
GitHub	Commit diffs, PR lookup, branch/file ops, workflow triggering	`GITHUB_TOKEN`
Confluence	Knowledge reader (CQL), RCA doc publishing, runbook lookup	`CONFLUENCE_BASE_URL` + `CONFLUENCE_API_TOKEN`
Slack	Push notifications on failures and pipeline status changes	`SLACK_WEBHOOK_URL`
Monday.com	Work OS integration for task management	`MONDAY_API_TOKEN`
TestRail	Test case management sync	`TESTRAIL_HOST` + `TESTRAIL_API_KEY`
Grafana/Prometheus	Metrics at `/metrics`, pre-built dashboards	Built-in

Production-Hardened

This isn't a prototype. v3 went through a dedicated security audit:

SQL injection prevention — Prisma.sql tagged templates, no raw queries
CSRF protection — Double-submit cookie pattern
Redis-backed sessions — Automatic token blacklisting
Structured logging — Every request gets an X-Request-ID you can trace end-to-end
Deep health checks — /health/ready (DB + Redis), /health/live (liveness), /health/full (all services)
Enterprise auth — SSO/SAML 2.0 (Okta, Azure AD), RBAC with 5 roles, audit logging with PII redaction
Clean audit — 0 high/critical vulnerabilities

Demo Mode vs Production Mode

	Demo Mode	Production Mode
Database	SQLite (file-based)	PostgreSQL 14+
AI Provider	Mock (realistic demo data)	Anthropic / OpenAI / Google / Azure / Bedrock
Integrations	Simulated responses	Real Jira, Slack, GitHub, Confluence
Setup time	~2 minutes	~15 minutes
Docker required	No	Yes
Best for	Evaluation, demos, training	Production deployments

Demo credentials (all use password demo123):

Email	Role
`admin@testops.ai`	Site Admin
`lead@testops.ai`	QA Lead
`engineer@testops.ai`	QA Engineer
`viewer@testops.ai`	Stakeholder

Production setup:

cp .env.production.example .env.production   # Edit secrets!
docker compose -f docker-compose.ghcr.yml up -d

Tech Stack

Layer	Technologies
Backend	Node.js 18+ · TypeScript · Express.js · Prisma ORM · PostgreSQL · Redis
Frontend	React 18 · TypeScript · Material-UI v5 · Zustand · React Query · Vite
AI	Anthropic Claude · OpenAI · Google Gemini · Azure OpenAI · AWS Bedrock · OpenRouter
Vector DB	Weaviate for semantic failure matching
Infra	Docker · GitHub Actions · Prometheus · Grafana · OpenTelemetry · Playwright E2E

Development

# Demo mode — SQLite, mock AI, auto-open browser
npm run dev:simple

# Full stack — PostgreSQL + Redis + Weaviate via Docker
npm run local:start && npm run dev

Command	What It Does
`npm run test`	Run all 760 tests (Jest + Vitest)
`npm run typecheck`	Type check backend + frontend
`npm run lint`	ESLint both projects
`npm run build`	Build all packages
`npm run check:architecture`	Verify no layer violations
`npm run check:health`	Flag oversized files/functions

API Reference

Core

POST /api/auth/login                          # Login
POST /api/auth/register                       # Register
GET  /api/pipelines                           # List pipelines
GET  /api/test-runs                           # List test runs

AI Copilot

POST /api/v1/ai/chat                         # SSE streaming chat (ReAct loop)
GET  /api/v1/ai/personas                      # List virtual team personas
GET  /api/v1/ai/config                        # Current provider config
PUT  /api/v1/ai/config                        # Update provider (admin)
POST /api/v1/ai/config/test                   # Test provider connection
POST /api/v1/ai/confirm                       # Approve/deny pending action
GET  /api/v1/ai/health                        # AI services health
GET  /api/v1/ai/costs                         # Cost summary

Failure Knowledge Base

POST /api/v1/failure-archive                  # Create failure entry
PUT  /api/v1/failure-archive/:id/document-rca # Document RCA (version-aware)
GET  /api/v1/failure-archive/trends           # Time-series failure trends
GET  /api/v1/failure-archive/predictions      # Risk scores per test
GET  /api/v1/failure-archive/anomalies        # Anomaly detection

Full API reference: docs/api.md

Project Structure

testops-copilot/
├── backend/
│   ├── prisma/                          # Schema (dev + production) & migrations
│   └── src/
│       ├── routes/ai/                   # AI & copilot REST routes
│       ├── services/ai/
│       │   ├── AIChatService.ts         # ReAct loop + SSE streaming
│       │   ├── PersonaRouter.ts         # Two-tier query classifier
│       │   ├── tools/                   # 22 agentic tool wrappers
│       │   ├── providers/               # 6 AI provider adapters
│       │   └── features/               # RCA, categorization, enrichment
│       ├── middleware/                   # Auth, validation, error handling
│       └── utils/                       # Logger, validators, helpers
├── frontend/
│   └── src/
│       ├── components/AICopilot/        # Copilot panel + cards + persona badge
│       ├── hooks/useAICopilot.ts        # SSE chat hook with persona support
│       ├── pages/                       # Dashboard, KB, Teams, Settings
│       └── services/                    # API clients
├── mcp-server/                          # Model Context Protocol server (8 tools)
├── specs/                               # Living specification documents
│   ├── features/                        # 16 YAML feature manifests (229 assertions)
│   └── team/                            # 9 persona specs + routing rubric
├── docs/                                # User & developer documentation
└── .github/workflows/                   # CI/CD pipelines

Documentation

Document	Description
Quick Start	Get running in 5 minutes
How Does It Work?	Plain-English guide to the platform
Demo Guide	Visual guide with workflow diagrams
UI Tour	Visual walkthrough with annotated screenshots
API Reference	Full REST API documentation
Architecture	System design and components
MCP Server	Model Context Protocol integration
Development Guide	Coding standards, testing, git workflow
Roadmap	What's shipped and what's next
Changelog	Full version history
Lessons Learned	Living error pattern registry

Contributing

Fork the repository
Create a feature branch: git checkout -b feat/amazing-feature
Make changes, write tests, update docs
Commit: git commit -m 'feat: add amazing feature' (Conventional Commits)
Push and open a Pull Request

Before submitting:

npm run lint && npm run typecheck && npm run test && npm run build

License

Apache License 2.0 — see LICENSE.

Built by Rotem Ayalon

Report a Bug · Request a Feature

If you find this project useful, give it a star!

Name		Name	Last commit message	Last commit date
Latest commit History 640 Commits
.github		.github
.husky		.husky
backend		backend
docs		docs
frontend		frontend
infra		infra
marketing		marketing
mcp-server		mcp-server
plans		plans
scripts		scripts
specs		specs
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.env.production.example		.env.production.example
.gitignore		.gitignore
.health-baseline.json		.health-baseline.json
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
commitlint.config.js		commitlint.config.js
docker-compose.ghcr.yml		docker-compose.ghcr.yml
docker-compose.prod.yml		docker-compose.prod.yml
docker-compose.yml		docker-compose.yml
package-lock.json		package-lock.json
package.json		package.json
plan.md		plan.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TestOps Copilot

The Pitch

Get Running in 2 Minutes

What Makes v3 Different

Your AI Team, Not Just a Chatbot

Graduated Autonomy — You Set the Dial

Bring Your Own Provider

Test Intelligence That Learns

The Full Platform

Integrations

Production-Hardened

Demo Mode vs Production Mode

Tech Stack

Development

API Reference

Core

AI Copilot

Failure Knowledge Base

Project Structure

Documentation

Contributing

License

About

Uh oh!

Releases 35

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TestOps Copilot

The Pitch

Get Running in 2 Minutes

What Makes v3 Different

Your AI Team, Not Just a Chatbot

Graduated Autonomy — You Set the Dial

Bring Your Own Provider

Test Intelligence That Learns

The Full Platform

Integrations

Production-Hardened

Demo Mode vs Production Mode

Tech Stack

Development

API Reference

Core

AI Copilot

Failure Knowledge Base

Project Structure

Documentation

Contributing

License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 35

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages