lash

AI coding agent with a persistent Monty-backed repl runtime or native tool-calling execution mode, hashline-safe file editing, and a host-friendly runtime API for TUI and headless frontends.

Setup

cat > ~/.local/bin/lash << 'SCRIPT'
#!/bin/bash
cargo build --release --manifest-path ~/code/lash/Cargo.toml 2>/dev/null
exec ~/code/lash/target/release/lash "$@"
SCRIPT
chmod +x ~/.local/bin/lash

Runtime Modes

lash/lash-core support two execution backends:

repl (default): a persistent Monty-backed sandbox that executes agent-written Python-like code with host-call boundaries for tools.
native-tools: provider-native tool calling without the REPL sandbox.

Both backends are driven by a single sans-IO state machine (TurnMachine in lash/src/sansio.rs). All protocol logic (prompt assembly, fence parsing, retry/backoff, context folding, turn limits) lives in the synchronous machine; the async Agent::run / run_native_tools methods are thin host drivers that fulfil I/O effects. This makes the core turn logic independently testable without a tokio runtime.

Local dev

./dev.sh

Build commands

cargo build -p lash-cli
cargo build -p lash-cli --release

Downstream (`lash-core`) integration

[dependencies]
lash-core = { path = "../lash", default-features = false, features = ["full"] }

CLI Usage

lash                               # start interactive TUI session
lash -p "summarize this repo"      # headless mode: run one prompt and print result
lash --print "summarize this repo"  # same, long form
lash --model gpt-5.4               # override model
lash --execution-mode native-tools # use provider-native tool calling instead of the repl sandbox
lash --no-mouse                    # disable mouse (re-enables terminal text selection)
lash --provider                    # force provider setup flow
lash --reset                       # delete ~/.lash and ~/.cache/lash, then exit

Prompt Customization

Override system prompt sections at launch:

lash --prompt-replace "section=replacement text"
lash --prompt-replace-file "section=path/to/file.md"
lash --prompt-prepend "section=prepended text"
lash --prompt-prepend-file "section=path/to/file.md"
lash --prompt-append "section=appended text"
lash --prompt-append-file "section=path/to/file.md"
lash --prompt-disable "section"

Skills

Skills are modular directories that extend lash with specialized knowledge and workflows.

Skill directories: ~/.lash/skills/ (global) and .lash/skills/ (project-local, overrides global).

Each skill is a directory containing a SKILL.md with YAML frontmatter (name, description) and a markdown body. Supporting files (scripts, references, templates) are included alongside.

~/.lash/skills/
  my-skill/
    SKILL.md          # required: frontmatter + instructions
    scripts/foo.py    # optional supporting files
    references/bar.md

Use /skills to browse, /<skill-name> to invoke, or call load_skill("name") from REPL/native-tools execution.

Terminal Bench

Use scripts/run-terminalbench.sh to run Harbor + Terminal Bench 2 with the in-repo lash adapter.

By default it runs terminal-bench-sample@2.0, builds a glibc-compatible binary in target-bookworm/release/lash, and uses your local ~/.lash/config.json inside benchmark containers (so OAuth/OpenAI-generic config is reused).

scripts/run-terminalbench.sh --sample
scripts/run-terminalbench.sh --full --task "git-*"
scripts/run-terminalbench.sh --sample --task chess-best-move --model gpt-5.4
scripts/run-terminalbench.sh --sample --build-mode host   # use host binary instead

Slash Commands

Command	Description
`/clear`, `/new`	Reset conversation
`/controls`	Show keyboard shortcuts
`/model [name]`	Show current model or switch LLM model
`/mode [name]`	Show current execution mode or switch modes (`repl` or `native-tools`)
`/provider`	Open provider setup in-app
`/login`	Alias for `/provider`
`/logout`	Remove stored credentials from disk
`/retry`	Replay the previous turn payload exactly
`/resume [name]`	Browse/load previous sessions
`/skills`	Browse loaded skills
`/tools`	Inspect or edit dynamic tools
`/caps`	Inspect or edit dynamic capabilities
`/reconfigure`	Apply or inspect pending runtime reconfigure
`/help`, `/?`	Show help
`/exit`, `/quit`	Quit

/logout only clears persisted config. The active session may continue with in-memory credentials until you switch provider or restart.

repl is the default execution mode and is always available. native-tools uses provider-native tool calling with the same lash tool definitions, but it does not preserve arbitrary REPL locals across turns. Low-intelligence sub-agents spawned with agent_call(..., intelligence="low") always run in native-tools; medium/high sub-agents inherit the parent session's execution mode.

Context folding is batched and cache-friendly. Lash keeps the prompt stable until the hard watermark is hit, then folds old history back to the soft watermark with one stable archive marker instead of mutating prompt status every turn. Defaults are 50% soft and 60% hard. Override them with:

lash --context-fold-soft-pct 45 --context-fold-hard-pct 58

batch is native-tools-only. Use it for 2-25 independent tool calls when you already know the arguments up front; it runs those calls concurrently and returns a structured per-call result summary.

In native-tools, lash persists structured tool call/result history and replays it back to providers as native tool-call turns. Multi-call assistant steps are replayed as grouped tool-calling turns where the provider supports it, and tool results are fed back as direct output/error content rather than wrapped in an extra lash-specific envelope.

Plan Mode

Plan mode is a first-class runtime mode used for plan-then-execute workflows.

The agent enters plan mode by calling enter_plan_mode.
The TUI switches to plan mode and allocates a plan file in .lash/plans/.
Runtime injects plan-mode guardrails (read/explore/design, no project file edits) and points the model at that plan file.
exit_plan_mode returns plan content for user approval.
After approval, lash clears message history for a fresh execution phase while preserving the active execution state where applicable (for example REPL state in repl mode).

Keyboard Shortcuts

Key	Action
`Esc`	Cancel running agent / dismiss prompt
`Shift+Enter`	Insert newline
`Ctrl+U` / `Ctrl+D`	Scroll half-page up / down
`PgUp` / `PgDn`	Scroll full page
`Ctrl+V`	Paste image as inline `[Image #n]`
`Ctrl+Shift+V`	Paste text only
`Ctrl+Y`	Copy last response
`Ctrl+O`	Cycle tool expansion level
`Alt+O`	Full expansion (code + stdout)

Provider Defaults

Default root model by provider:

Provider	Default model
`openai-generic`	`anthropic/claude-sonnet-4.6`
`Claude`	`claude-opus-4-6`
`Codex`	`gpt-5.4`
`Google OAuth`	`gemini-3.1-pro-preview`

openai-generic defaults to https://openrouter.ai/api/v1 as its base URL.

Stored config lives at ~/.lash/config.json and uses this shape:

{
  "provider": {
    "type": "openai-generic",
    "api_key": "...",
    "base_url": "https://openrouter.ai/api/v1"
  },
  "auxiliary_secrets": {
    "tavily_api_key": "..."
  },
  "agent_models": {
    "low": "...",
    "medium": "...",
    "high": "..."
  }
}

auxiliary_secrets.tavily_api_key is the supported Tavily location. The old top-level tavily_api_key field and legacy sub-agent tier aliases are no longer part of the documented config surface.

Default sub-agent tier mapping (low / medium / high):

Provider	low	medium	high
`openai-generic`	`minimax/minimax-m2.5`	`z-ai/glm-5`	`anthropic/claude-sonnet-4.6`
`Claude`	`claude-haiku-4-5`	`claude-sonnet-4-6`	`claude-sonnet-4-6`
`Codex`	`gpt-5.3-codex-spark`	`gpt-5.4` (`reasoning=medium`)	`gpt-5.4` (`reasoning=high`)
`Google OAuth`	`gemini-3-flash-preview`	`gemini-3.1-pro-preview`	`gemini-3.1-pro-preview`

Frontend / Runtime Integration

lash-core exposes a unified turn input surface for third-party frontends (lash/src/runtime.rs).

TurnInput.items is an ordered list of typed items.
Supported item kinds:
- text
- file_ref
- dir_ref
- image_ref
- skill_ref
TurnInput.image_blobs carries raw image bytes keyed by image_ref.id.

This preserves interleaving and intent, e.g. text -> image -> text -> file reference in a single turn.

Example payload

{
  "items": [
    {"type": "text", "text": "Please update this"},
    {"type": "image_ref", "id": "img-1"},
    {"type": "text", "text": "and check"},
    {"type": "file_ref", "path": "/repo/src/main.rs"}
  ],
  "image_blobs": {
    "img-1": "<binary png bytes>"
  }
}

Runtime APIs

RuntimeEngine exposes two first-class host APIs:

stream_turn(input, sink, cancel):
- Streams low-level, mode-specific AgentEvent values to an EventSink.
- Returns canonical AssembledTurn when the turn terminates.
run_turn_assembled(input, cancel):
- Convenience path when the host only needs the high-level terminal result.

Assembled turn contract

AssembledTurn is the canonical terminal result:

status: completed | interrupted | failed
done_reason: model_stop | max_turns | user_abort | tool_failure | runtime_error
execution.mode: repl | native_tools
execution.had_tool_calls and execution.had_code_execution: high-level execution summary
assistant_output.safe_text: host-renderable output (always present, may be empty)
assistant_output.raw_text: unsanitized terminal output (always present, may be empty)
assistant_output.state: usable | empty_output | traceback_only | sanitized | recovered_from_error
tool_calls, code_outputs, errors, token_usage, and updated state

High-level contract:

AssembledTurn is stable across execution modes.
native-tools is a subset of the repl result shape at this level.
REPL-only detail is represented by code_outputs, which is empty for native tool-calling turns.
The batch tool is an optimization at the execution layer only; batched work still folds back into the same high-level turn contract.

Host/runtime policy knobs are configured via RuntimeConfig:

host_profile (interactive / headless / embedded)
context_folding.soft_limit_pct / context_folding.hard_limit_pct
base_dir + optional custom path_resolver
sanitizer and termination policies

The same folding policy is persisted in CLI config under:

{
  "runtime": {
    "context_folding": {
      "soft_limit_pct": 50,
      "hard_limit_pct": 60
    }
  }
}

Integration invariants:

Keep one RuntimeEngine instance per active session.
Do not rebuild message history manually between turns while a runtime is alive.
Treat streamed AgentEvents as low-level preview/progress; render terminal user output from AssembledTurn.assistant_output.safe_text.

Tool Exposure Model

Tools define inject_into_prompt: bool (lash/src/lib.rs).

inject_into_prompt=true: tool appears directly in the LLM system prompt.
inject_into_prompt=false: tool is omitted from prompt for brevity, but still callable at runtime.
Tool identity is shared across execution modes, but tool description and examples are mode-scoped.
Lash projects REPL-tagged metadata into the repl helper surface, and native-tools-tagged metadata into provider-native tool schemas and prompt docs.

Runtime discovery is available via:

list_tools(...)
search_tools(query, mode="hybrid", ...)

The repl executor exposes visible tools as global functions (for example read_file(...), search_tools(...)). Use list_tools(...) and search_tools(...) to discover prompt-omitted tools at runtime.

Default Native-Tools Surface

With the default capability profile, native-tools can call:

Core read: read_file, glob, grep, ls, list_tools, search_tools
Core write: write_file, edit_file, find_replace
Shell: shell, shell_wait, shell_read, shell_write, shell_kill
Tasks: tasks, tasks_summary, get_task, create_task, update_task, delete_task, claim_task
Planning: enter_plan_mode, exit_plan_mode
Delegation: agent_call, agent_result, agent_output, agent_kill
History: search_history, history_add_turn, history_export, history_load
Memory: search_mem, mem_set_turn, mem_set, mem_get, mem_delete, mem_export, mem_load
Skills: skills, load_skill, read_skill_file, search_skills
Web, when Tavily is configured: search_web, fetch_url
Native-tools only: batch

This is the callable default surface, not the prompt-injected subset. Lash still injects only a smaller prompt-visible tool list and relies on list_tools(...) / search_tools(...) for discovery of omitted tools. For delegation, low-tier sub-agents use this native-tools surface by default even when the parent session is in repl.

Filesystem Listing Output

glob(...) and ls(...) return typed filesystem entries (PathEntry) rather than plain path/tree strings.

Each item includes:

path
kind (file / dir / symlink / other)
size_bytes
lines (null unless with_lines=true)
modified_at (RFC3339 UTC)

Both tools support limit (null for uncapped) and expose truncation metadata on the returned object in REPL wrappers.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.cargo		.cargo
.github/workflows		.github/workflows
.lash/plans		.lash/plans
docs		docs
lash-cli		lash-cli
lash		lash
scripts		scripts
vendor/monty		vendor/monty
xtask		xtask
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
dev.sh		dev.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lash

Setup

Runtime Modes

Local dev

Build commands

Downstream (`lash-core`) integration

CLI Usage

Prompt Customization

Skills

Terminal Bench

Slash Commands

Plan Mode

Keyboard Shortcuts

Provider Defaults

Frontend / Runtime Integration

Example payload

Runtime APIs

Assembled turn contract

Tool Exposure Model

Default Native-Tools Surface

Filesystem Listing Output

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

lash

Setup

Runtime Modes

Local dev

Build commands

Downstream (lash-core) integration

CLI Usage

Prompt Customization

Skills

Terminal Bench

Slash Commands

Plan Mode

Keyboard Shortcuts

Provider Defaults

Frontend / Runtime Integration

Example payload

Runtime APIs

Assembled turn contract

Tool Exposure Model

Default Native-Tools Surface

Filesystem Listing Output

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Downstream (`lash-core`) integration

Packages