User Story
As a manager
I want to test my agents before deploying them to real interviewees
So that I can verify agent behavior and catch issues early
Priority: Phase 2
Simulation testing is deferred to Phase 2 to focus on core architecture first.
Acceptance Criteria
Manual Role-Play Testing
Automated Simulation Scenarios (Phase 3)
Simulation Results
API Requirements
Technical Notes
- Manual role-play uses same Interview Agent infrastructure
- Automated simulations use LLM to play interviewee role
- Ground truth stored in simulation scenario
- See TESTING-EVALUATION-FRAMEWORK.md for evaluation criteria
Definition of Done
🤖 Generated with Claude Code
User Story
As a manager
I want to test my agents before deploying them to real interviewees
So that I can verify agent behavior and catch issues early
Priority: Phase 2
Simulation testing is deferred to Phase 2 to focus on core architecture first.
Acceptance Criteria
Manual Role-Play Testing
Automated Simulation Scenarios (Phase 3)
Simulation Results
API Requirements
POST /api/v1/design-sessions/{id}/simulate- Run automated simulationWebSocket /api/v1/design-sessions/{id}/roleplay/{agent_id}- Manual role-playTechnical Notes
Definition of Done
🤖 Generated with Claude Code