CLAUDE.md

This file provides guidance to Claude Code when working with code in this repository.

Project Overview

Clara is an AI-powered interview discovery platform that enables organizations to conduct structured discovery interviews at scale. A human manager orchestrates the process by configuring specialized AI interview agents via an Interview Blueprint, setting context, defining outcomes, and inviting interviewees.

Design Documents

All design specifications are located at /Users/mantiz/Clara-Analysis/:

Document	Purpose
`PRD.md`	Product requirements (v2.0, Phase 3 architecture)
`CLARA-DATA-MODEL.md`	Neo4j + PostgreSQL schema, entity resolution
`CLARA-UI-INTERACTION-FLOW.md`	AG-UI event flows, state models, UI components
`DESIGN-ASSISTANT-SDK-INTEGRATION.md`	Interview Blueprint schema, MCP integrations
`SYNTHESIS-PIPELINE.md`	4-stage extraction → synthesis pipeline
`ANALYSIS-TEMPLATES.md`	Project type analysis frameworks
`ADAPTIVE-INTERVIEW-UI.md`	Dynamic UI component triggers
`PYDANTIC-ECOSYSTEM-ARCHITECTURE.md`	Pydantic AI/Graph/Logfire/Gateway usage
`TESTING-EVALUATION-FRAMEWORK.md`	Eval methodology, LLM-as-judge
`AGENT-DESIGN-ASSISTANT.md`	Opus-powered design helper
`SECURITY-GOVERNANCE.md`	Security controls, data governance, threat model
`DEPLOYMENT-OPTIONS.md`	SaaS to air-gapped deployment

Tech Stack

Component	Technology
Backend	Python 3.12, FastAPI
Agent Framework	Pydantic AI with AG-UI adapter
Agent Coordination	A2A Protocol
Knowledge Graph	Neo4j 5.x
Relational DB	PostgreSQL 15+
Real-time	AG-UI Protocol (SSE)
Enterprise Integrations	MCP servers (Jira, Confluence)
File Storage	S3
Observability	Logfire
Frontend	React, TypeScript, @ag-ui/react, Tailwind CSS

Project Structure

ClaraMap/
├── src/
│   ├── backend/
│   │   ├── clara/                    # Main Python package
│   │   │   ├── main.py               # FastAPI app entry point
│   │   │   ├── config.py             # Settings/configuration
│   │   │   ├── api/                  # FastAPI routers
│   │   │   ├── agents/               # Pydantic AI agents
│   │   │   │   ├── interview_agent.py    # Dynamic agent configured by blueprint
│   │   │   │   ├── design_assistant.py   # Opus-powered blueprint designer
│   │   │   │   └── synthesis_agent.py    # A2A synthesis coordinator
│   │   │   ├── blueprints/           # Blueprint schema and loading
│   │   │   ├── services/             # Business logic
│   │   │   ├── models/               # Pydantic models
│   │   │   ├── graph/                # Neo4j operations
│   │   │   ├── db/                   # PostgreSQL/SQLAlchemy
│   │   │   ├── integrations/         # MCP, A2A, S3
│   │   │   └── security/             # Auth, sanitization, audit
│   │   │
│   │   ├── tests/
│   │   │   ├── unit/
│   │   │   ├── integration/
│   │   │   └── evaluation/           # LLM evaluation suite
│   │   │
│   │   └── pyproject.toml
│   │
│   └── frontend/
│       ├── src/
│       │   ├── components/
│       │   ├── pages/
│       │   └── hooks/                # AG-UI hooks
│       ├── tests/
│       └── package.json
│
├── docs/
├── docker/
├── README.md
└── CLAUDE.md

Development Commands

Backend

cd src/backend

# Install dependencies
uv sync

# Run development server
uv run uvicorn clara.main:app --reload --port 8000

# Run tests
uv run pytest

# Run specific test file
uv run pytest tests/unit/test_interview_agent.py -v

# Type checking
uv run mypy clara

# Linting
uv run ruff check clara

# Format
uv run ruff format clara

Frontend

cd src/frontend

# Install dependencies
pnpm install

# Run development server
pnpm dev

# Run tests
pnpm test

# Build
pnpm build

Docker

# Start all services (Neo4j, PostgreSQL, backend, frontend)
docker-compose up -d

# View logs
docker-compose logs -f backend

Key Architecture Concepts

Interview Agents are Dynamic

Interview agents are NOT statically defined. They are configured at runtime from an Interview Blueprint created by the Design Assistant. The blueprint specifies:

Agent persona (role, tone, expertise)
Interview topics and goals
Entity extraction schema
Adaptive UI triggers
Follow-up behavior

# Example: Creating an agent from a blueprint
from clara.agents.interview_agent import InterviewAgent
from clara.blueprints.loader import load_blueprint

blueprint = await load_blueprint(blueprint_id)
agent = InterviewAgent.from_blueprint(blueprint, agent_config_id)
app = agent.to_ag_ui(deps=StateDeps(InterviewState(...)))

AG-UI State Management

All real-time interview state flows through AG-UI protocol:

from pydantic_ai import Agent
from pydantic_ai.ag_ui import StateDeps
from ag_ui.core import StateSnapshotEvent, StateDeltaEvent, CustomEvent

# Interview state synchronized with frontend
class InterviewState(BaseModel):
    interview_id: str
    phase: InterviewPhase
    detected_entities: list[DetectedEntity]
    topics: list[TopicCoverage]
    active_components: list[ActiveUIComponent]

# Tools return AG-UI events
@agent.tool_plain
async def detect_entity(entity_type: str, name: str, context: str) -> list:
    return [
        StateDeltaEvent(type=EventType.STATE_DELTA, delta=[...]),
        CustomEvent(type=EventType.CUSTOM, name="clara:entity_detected", value={...})
    ]

Neo4j Knowledge Graph

All extracted entities link to evidence (interview quotes):

// Entity structure with evidence chain
(:System {id, project_id, name, vendor, owner})
  -[:SUPPORTED_BY]->(:Evidence {quote, timestamp, confidence})
  -[:FROM_INTERVIEW]->(:Interview {id})

// Always scope queries by project_id
MATCH (n:System {project_id: $project_id})

Entity Resolution

Uses blocking strategy (not O(n²)) and preserves relationship types during merges. See CLARA-DATA-MODEL.md for full implementation.

Security Requirements

Reference SECURITY-GOVERNANCE.md for full details.

Input Sanitization: All interviewee input through InputSanitizer
File Uploads: Malware scanning required before storage
Tool Policy: Design Assistant tools restricted via ToolPolicyEnforcer
PII Handling: Redact from logs, classify by level
Audit Logging: All data access logged to immutable store
Structured Rationales: Never expose raw chain-of-thought

Testing Strategy

Unit Tests: pytest for services, models, utilities
Integration Tests: Test against real Neo4j/PostgreSQL (Docker)
Evaluation Suite:
- Ground truth datasets for entity extraction
- LLM-as-judge for conversation quality
- Regression gates for prompt changes

Git Workflow & Branch-Based Development

Branch Strategy

IMPORTANT: All development MUST follow a branch-based workflow. Never commit directly to main.

Branch Naming Convention

Every story/task gets its own feature branch:

# Format: feature/{issue-number}-{brief-description}
feature/3-create-discovery-project
feature/9-design-assistant-conversation
feature/18-blueprint-core-schema

# For bug fixes
fix/{issue-number}-{brief-description}

# For documentation
docs/{brief-description}

Development Workflow

Create Feature Branch

# Always branch from main
git checkout main
git pull origin main
git checkout -b feature/{issue-number}-{description}

Make Commits During Development

Commit frequently (after each logical unit of work)
Every commit should have a clear, descriptive message
Reference the issue number in commits

# Good commit messages
git commit -m "feat(projects): Add project creation endpoint (#3)"
git commit -m "test(projects): Add unit tests for project service (#3)"
git commit -m "docs(projects): Update API documentation (#3)"

Commit Message Format

<type>(<scope>): <subject> (#issue)

Types:
- feat: New feature
- fix: Bug fix
- refactor: Code refactoring
- test: Adding tests
- docs: Documentation changes
- chore: Build/config changes
- style: Code style changes (formatting)

Examples:
feat(blueprint): Implement core schema models (#18)
test(blueprint): Add validation tests for AgentBlueprint (#19)
refactor(agents): Extract agent factory to separate module (#24)
fix(invitations): Handle duplicate email validation (#26)

Push and Create Pull Request

# Push feature branch
git push -u origin feature/{issue-number}-{description}

# Create PR via GitHub CLI
gh pr create --title "feat: {Feature name} (#{issue})" \
             --body "Closes #{issue-number}

## Changes
- List of changes

## Testing
- How to test

## Checklist
- [ ] Tests passing
- [ ] Documentation updated
- [ ] Code reviewed"

Merge and Cleanup

# After PR approved and merged
git checkout main
git pull origin main
git branch -d feature/{issue-number}-{description}

Commit Guidelines

Commit Frequently:

After implementing a function/class
After writing tests
After fixing a bug
After updating documentation
Before switching context

Atomic Commits:

Each commit should be a single logical change
Should be independently reviewable
Should not break the build

Commit Message Quality:

First line: Clear summary (50 chars max)
Reference issue number
Use imperative mood ("Add" not "Added")
Explain WHY, not just WHAT

Pull Request Requirements

Every PR must:

Reference the GitHub issue (Closes #X)
Pass all tests (unit, integration, type checks, linting)
Include tests for new functionality
Update documentation if needed
Have a clear description of changes
Be reviewed by at least one team member
Have all CI checks passing

Working on Multiple Stories

If working on multiple stories simultaneously:

# Switch between feature branches
git checkout feature/3-create-project
# ... make changes, commit ...

git checkout feature/4-project-dashboard
# ... make changes, commit ...

# Always commit before switching branches
git commit -m "WIP: Partial implementation"

Integration with Main

Never commit directly to main
All changes go through pull requests
Main branch is always deployable
Protect main branch with required PR reviews
Enable branch protection rules on GitHub

Example Full Workflow

# Starting work on issue #18 (Blueprint Core Schema)
git checkout main
git pull origin main
git checkout -b feature/18-blueprint-core-schema

# Make changes, commit frequently
git add src/backend/clara/models/blueprint.py
git commit -m "feat(blueprint): Add InterviewBlueprint base model (#18)"

git add src/backend/clara/models/project_context.py
git commit -m "feat(blueprint): Add ProjectContext schema (#18)"

git add tests/unit/test_blueprint_models.py
git commit -m "test(blueprint): Add validation tests for blueprint models (#18)"

# Run tests before pushing
cd src/backend && uv run pytest
uv run mypy clara
uv run ruff check clara

# Push and create PR
git push -u origin feature/18-blueprint-core-schema
gh pr create --title "feat: Blueprint Core Schema Definition (#18)" \
             --body "Closes #18

## Changes
- Implemented InterviewBlueprint base model
- Added ProjectContext schema with validation
- Added comprehensive unit tests
- All Pydantic models with strict validation

## Testing
- Unit tests: pytest tests/unit/test_blueprint_models.py
- Type checking: mypy clara/models/
- All tests passing ✓

## Checklist
- [x] Tests passing
- [x] Type checks passing
- [x] Documentation updated
- [x] Ready for review"

# After PR review and approval, merge via GitHub UI
# Then cleanup locally
git checkout main
git pull origin main
git branch -d feature/18-blueprint-core-schema

Common Patterns

Creating a New API Endpoint

# clara/api/interviews.py
from fastapi import APIRouter, Depends
from clara.services.interview_service import InterviewService

router = APIRouter(prefix="/interviews", tags=["interviews"])

@router.get("/{interview_id}")
async def get_interview(
    interview_id: str,
    service: InterviewService = Depends()
):
    return await service.get_interview(interview_id)

Adding a New Entity Type to Graph

Define Pydantic model in clara/models/entities.py
Add node creation in clara/graph/entities.py
Update extraction schema in blueprint templates
Add to entity resolution service

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Overview

Design Documents

Tech Stack

Project Structure

Development Commands

Backend

Frontend

Docker

Key Architecture Concepts

Interview Agents are Dynamic

AG-UI State Management

Neo4j Knowledge Graph

Entity Resolution

Security Requirements

Testing Strategy

Git Workflow & Branch-Based Development

Branch Strategy

Branch Naming Convention

Development Workflow

Commit Guidelines

Pull Request Requirements

Working on Multiple Stories

Integration with Main

Example Full Workflow

Common Patterns

Creating a New API Endpoint

Adding a New Entity Type to Graph

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Overview

Design Documents

Tech Stack

Project Structure

Development Commands

Backend

Frontend

Docker

Key Architecture Concepts

Interview Agents are Dynamic

AG-UI State Management

Neo4j Knowledge Graph

Entity Resolution

Security Requirements

Testing Strategy

Git Workflow & Branch-Based Development

Branch Strategy

Branch Naming Convention

Development Workflow

Commit Guidelines

Pull Request Requirements

Working on Multiple Stories

Integration with Main

Example Full Workflow

Common Patterns

Creating a New API Endpoint

Adding a New Entity Type to Graph