ClawMetry Architecture — How It Works

A human-friendly guide to how ClawMetry sees everything your AI agents do.

The Big Picture

┌─────────────────────────────────────────────────────────┐
│                    Your Machine                          │
│                                                          │
│  ┌──────────────┐     reads      ┌──────────────────┐   │
│  │              │ ──────────────► │                  │   │
│  │   OpenClaw   │   filesystem   │    ClawMetry     │   │
│  │   Gateway    │ ◄──────────────│    Dashboard     │   │
│  │              │   WebSocket    │   (Flask app)    │   │
│  │  port 18789  │    JSON-RPC    │   port 8900      │   │
│  └──────┬───────┘                └────────┬─────────┘   │
│         │                                 │              │
│         │  spawns/manages                 │  serves UI   │
│         ▼                                 ▼              │
│  ┌──────────────┐                ┌──────────────────┐   │
│  │  AI Agents   │                │  Your Browser    │   │
│  │  (Claude,    │                │  http://...:8900 │   │
│  │   sessions)  │                └──────────────────┘   │
│  └──────────────┘                                        │
└─────────────────────────────────────────────────────────┘

ClawMetry is a read-only observer that sits alongside your OpenClaw gateway. It never modifies your agents or their data. It reads what OpenClaw already writes to disk and connects to the gateway's WebSocket API for real-time updates.

How ClawMetry Gets Its Data

ClawMetry has three data sources, all local to your machine:

1. Filesystem Reading (Primary)

OpenClaw stores everything as files. ClawMetry reads them directly:

What	Where	Format
Session transcripts	`~/.openclaw/agents/main/sessions/*.jsonl`	JSON Lines — one event per line
Gateway config	`~/.openclaw/openclaw.json`	JSON — model, channels, auth
Gateway logs	`/tmp/moltbot/moltbot-YYYY-MM-DD.log`	Structured JSON logs
Memory files	`{workspace}/memory/*.md`	Markdown — agent's daily notes
Cron state	Internal gateway state	Via WebSocket RPC

Session transcripts are the richest data source. Each .jsonl file contains every message, tool call, tool result, thinking block, and token count for a session. ClawMetry parses these to build timelines, calculate costs, and show what each agent decided.

2. Gateway WebSocket (Real-time)

ClawMetry connects to the OpenClaw gateway via WebSocket (ws://localhost:18789) using JSON-RPC:

Session list — which sessions are active right now
Cron jobs — scheduled tasks, their status, run history
Gateway config — model settings, channel config
Tool invocations — ClawMetry can invoke gateway tools (e.g., restart crons)

This is how the dashboard updates in real-time without polling.

3. OpenTelemetry Receiver (Optional)

ClawMetry can receive OTLP metrics and traces on:

POST /v1/metrics — Prometheus-style metrics in protobuf
POST /v1/traces — Distributed traces in protobuf

This allows external tools or custom instrumentation to feed data into ClawMetry.

Auto-Detection — Zero Config

When you run clawmetry, it automatically finds everything:

Workspace — Checks OPENCLAW_HOME, then ~/.openclaw, then common paths
Gateway port — Reads openclaw.json for the bind port (default 18789)
Gateway token — Reads auth token from config for API access
Log directory — Checks /tmp/moltbot/ for gateway logs
Sessions directory — Finds ~/.openclaw/agents/main/sessions/

No environment variables, no config files, no database setup. If OpenClaw is running, ClawMetry finds it.

The Dashboard — What You See

ClawMetry serves a single-page web app (embedded HTML/CSS/JS in the Python file) with these views:

Overview (`/api/overview`)

The main dashboard. Aggregates:

Active sessions — main + sub-agents, their status, last activity
Token usage — input/output/cache tokens, cost estimates per session
Cron status — running jobs, failures, next run times
System health — disk, memory, uptime, gateway version

Flow Visualization

An interactive graph showing how data flows through your system:

Channels (Telegram, etc.) → Gateway → Models (Claude, etc.) → Tools → Nodes

Each node shows real-time metrics (messages, tokens, calls).

Session Timeline (`/api/timeline`)

A chronological view of every action in a session:

User messages, assistant responses
Tool calls with arguments and results
Thinking blocks (when reasoning is enabled)
Token counts per turn

Transcripts (`/api/transcript/<id>`)

Full conversation history for any session. Supports:

Main sessions — direct user conversations
Sub-agent sessions — background tasks
Event filtering — show only tool calls, only messages, etc.

Sub-Agent Tracker (`/api/subagents`)

Real-time view of all spawned sub-agents:

What task they were given
Their current status (running, completed, failed)
Token consumption and runtime
Link to full transcript

Cost & Usage (`/api/usage`)

Token and cost analytics:

Per-model breakdown — which models consume the most
Per-session costs — find expensive sessions
Time series — cost trends over hours/days
Export — CSV download for billing

Cron Manager (`/api/crons`)

Full cron job management:

View all scheduled jobs with next run time
See run history and errors
Toggle enable/disable
Trigger manual runs
Create and edit jobs

System Health (`/api/system-health`)

Infrastructure monitoring:

Disk usage (warns at 85%+)
Memory consumption
Gateway uptime and version
Service port checks
GPU status (if available)

Budget & Alerts System

ClawMetry includes a built-in budget monitor:

Budget Config → Monitor Loop (every 60s) → Check Spend
                                              │
                              ┌────────────────┼────────────────┐
                              ▼                ▼                ▼
                        Under budget     Warning (80%)    Over budget
                          (no-op)        Send alert       Pause gateway

Daily/monthly budgets with configurable limits
Alert channels: Telegram, webhooks, email
Auto-pause: Can automatically pause the gateway when budget exceeded
Custom alert rules: Token spikes, error rates, session duration

Multi-Node Fleet Mode

For users running multiple OpenClaw instances:

┌──────────┐    ┌──────────┐    ┌──────────┐
│  Node A  │    │  Node B  │    │  Node C  │
│ (laptop) │    │  (Pi)    │    │ (server) │
└────┬─────┘    └────┬─────┘    └────┬─────┘
     │               │               │
     └───────────────┼───────────────┘
                     ▼
            ┌────────────────┐
            │   ClawMetry    │
            │  Fleet View    │
            │  (central)     │
            └────────────────┘

Nodes register via POST /api/nodes/register and send periodic metrics. The fleet view shows all nodes, their health, sessions, and aggregated costs.

Secured with CLAWMETRY_FLEET_KEY — nodes must provide the API key to register.

Technical Details

Single-File Architecture

ClawMetry is intentionally a single Python file (dashboard.py, ~11,600 lines). This makes it:

Easy to install (pip install clawmetry)
Easy to audit (one file to read)
Easy to deploy (no build step)
Portable (runs on a Raspberry Pi)

The HTML/CSS/JS dashboard is embedded as template strings inside the Python file.

Dependencies

Minimal by design:

Flask — Web server
No database — reads OpenClaw's files directly
Optional: opentelemetry-proto for OTLP support
Optional: history.py for time-series storage (SQLite-based)

History Module (`history.py`)

An optional companion that adds persistent time-series:

Stores snapshots every 5 minutes in SQLite
Enables historical charts (token usage over days/weeks)
Session history with cost trends
Cron execution history

Performance

Memory: ~30-80MB typical
CPU: Negligible (event-driven, no polling loops except health)
Disk: Zero (reads existing files, history.db is optional)
Startup: <2 seconds

Security

Gateway token auth — Dashboard requires the gateway token to access sensitive APIs
Local-only by default — Binds to 0.0.0.0:8900 but designed for LAN use
Read-only — Cannot modify agent behavior (except cron management via gateway RPC)
No external calls — Your data never leaves your machine

Data Flow Example

Here's what happens when you open ClawMetry and look at a running sub-agent:

Browser requests /api/subagents
ClawMetry reads ~/.openclaw/agents/main/sessions/sessions.json (session index)
ClawMetry identifies sub-agent sessions, reads each .jsonl transcript
For each session, it parses events to extract:
- Task description (from the spawn message)
- Current status (running if no completion event)
- Token counts (summed from all assistant turns)
- Tools used (from tool_use events)
- Runtime (first event timestamp to last)
ClawMetry also queries the gateway via WebSocket for live session state
Response sent to browser as JSON
Browser renders the sub-agent cards with live-updating metrics

All of this happens in <100ms for typical setups.

ClawMetry is open source under MIT License. See github.com/vivekchand/clawmetry

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ClawMetry Architecture — How It Works

The Big Picture

How ClawMetry Gets Its Data

1. Filesystem Reading (Primary)

2. Gateway WebSocket (Real-time)

3. OpenTelemetry Receiver (Optional)

Auto-Detection — Zero Config

The Dashboard — What You See

Overview (`/api/overview`)

Flow Visualization

Session Timeline (`/api/timeline`)

Transcripts (`/api/transcript/<id>`)

Sub-Agent Tracker (`/api/subagents`)

Cost & Usage (`/api/usage`)

Cron Manager (`/api/crons`)

System Health (`/api/system-health`)

Budget & Alerts System

Multi-Node Fleet Mode

Technical Details

Single-File Architecture

Dependencies

History Module (`history.py`)

Performance

Security

Data Flow Example

FilesExpand file tree

ARCHITECTURE.md

Latest commit

History

ARCHITECTURE.md

File metadata and controls

ClawMetry Architecture — How It Works

The Big Picture

How ClawMetry Gets Its Data

1. Filesystem Reading (Primary)

2. Gateway WebSocket (Real-time)

3. OpenTelemetry Receiver (Optional)

Auto-Detection — Zero Config

The Dashboard — What You See

Overview (/api/overview)

Flow Visualization

Session Timeline (/api/timeline)

Transcripts (/api/transcript/<id>)

Sub-Agent Tracker (/api/subagents)

Cost & Usage (/api/usage)

Cron Manager (/api/crons)

System Health (/api/system-health)

Budget & Alerts System

Multi-Node Fleet Mode

Technical Details

Single-File Architecture

Dependencies

History Module (history.py)

Performance

Security

Data Flow Example

Overview (`/api/overview`)

Session Timeline (`/api/timeline`)

Transcripts (`/api/transcript/<id>`)

Sub-Agent Tracker (`/api/subagents`)

Cost & Usage (`/api/usage`)

Cron Manager (`/api/crons`)

System Health (`/api/system-health`)

History Module (`history.py`)