Product engineer.
Working on:
- Lachesis (behavioral eval)
- Samsara (multi-agent sim)
Pinned Loading
-
opencode-dashboard
opencode-dashboard PublicOCD - OpenClaw Dashboard is a mini-kanban board for your OpenClaw agent. Handoff tasks, set cronjobs, and unblock your agents! them
-
trenchcoat-mpp
trenchcoat-mpp Publictell your agent to order you UberEats, using MPP
-
Samsara write-up: early results on s...
Samsara write-up: early results on strategic exploration, fog, and repeated play under pressure 1# How Do Models Approach Strategic Exploration Under Competitive Pressure?2## Early results from a behavioral reliability evaluation environment34**Date:** 2026-04-14
5**Project:** Samsara
-
Lachesis Eval: Silence Battery v2 wr...
Lachesis Eval: Silence Battery v2 writeup 1# Do Models Know When Not to Act?2## Results from the Lachesis Silence Battery34**Date:** 2026-04-13
5**Project:** Lachesis (behavioral evaluation research program)
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



