Browserbase

"A web browser for AI agents & applications."


Website	https://browserbase.com
Docs	https://docs.browserbase.com
GitHub	https://github.com/browserbase
Stars
MCP	https://www.browserbase.com/mcp
Classification	`agent-native`
Category	Browser & Web Execution Services
Compliance	SOC-2 Type I · HIPAA

Official Website

https://browserbase.com

Official Repo

https://github.com/browserbase/stagehand — Stagehand (open-source NL web automation SDK built on Browserbase)

https://github.com/browserbase/mcp-server-browserbase — Official MCP server

Agent Skills

Agent Skills are portable SKILL.md instruction sets following the AgentSkills open standard that teach AI coding assistants (Claude Code, Cursor, Codex, Windsurf, etc.) how to use this service correctly.

Status: ✅ Official skills published at browserbase/skills

npx skills add browserbase/skills

Skill	What It Teaches the Agent
`browser`	Automate web interactions via CLI with remote Browserbase sessions, stealth mode, CAPTCHA solving, and residential proxy support
`functions`	Deploy serverless browser automation functions to Browserbase cloud

Compatibility: Claude Code, Cursor, Codex, Windsurf, and all AgentSkills-compatible tools.

Tip: After installation, set BROWSERBASE_API_KEY and BROWSERBASE_PROJECT_ID in your agent's environment settings.

MCP

Status: ✅ Available

Browserbase provides an official MCP server that allows LLMs to control remote browser sessions directly through the Model Context Protocol.

Detail	Value
MCP Page	https://www.browserbase.com/mcp
MCP Docs	https://docs.browserbase.com/integrations/mcp/introduction
MCP Repo	https://github.com/browserbase/mcp-server-browserbase
Transport	stdio / Streamable HTTP
Compatible Clients	Claude Desktop, Cursor, OpenAI Agents, any MCP-compatible client

What It Does

Browserbase provides managed, cloud-hosted headless browser infrastructure specifically for AI agents. Agents spin up isolated browser sessions via API, control them through Playwright, Puppeteer, or Selenium — or through the Stagehand SDK using natural-language web actions — and destroy them when done. The entire browser lifecycle is agent-controlled, with no human sitting at a keyboard.

Why It Is Agent-Native

Criterion	Evidence
Agent-first positioning	Homepage: "A web browser for AI agents & applications"; Stagehand SDK, MCP server, and agent integration guides are first-class products
Agent-specific primitive	Remote browser session with per-agent isolation; natural-language action via Stagehand (`"click the login button"`); session recording for agent trajectory audit
Autonomy-compatible control plane	Thousands of parallel sessions; stealth capabilities prevent bot detection; agents operate entirely without human intervention
M2M integration surface	REST API, Playwright/Puppeteer/Selenium driver, Chrome DevTools Protocol (CDP), MCP server
Identity / delegation	Per-session proxy and fingerprint configuration; session IDs enable per-agent audit

Primary Primitives

Primitive	Description
Remote Browser Session	Isolated cloud browser instance created and destroyed via API
Stagehand Actions	Natural-language web commands (`act("click signup button")`, `extract(...)`)
Session Recording	Full replay of agent's browser actions for debugging and audit
Stealth Mode	Fingerprint randomization and proxy rotation to avoid bot detection
Parallel Sessions	Spin up thousands of simultaneous browser contexts in milliseconds
MCP Server	Direct LLM tool use via Model Context Protocol

Autonomy Model

Agent calls POST /sessions → receives a remote browser session endpoint
Agent connects via Playwright/CDP or Stagehand
Agent issues actions (navigate, click, extract, screenshot) without human involvement
Session is automatically recorded for later inspection
Agent calls DELETE /sessions/{id} when done

There is no human browser window. The agent is the only actor in the session.

Identity and Delegation Model

Each session is isolated — no shared state between agent sessions
Proxy and fingerprint configuration is per-session, enabling agent-specific identity on the open web
Session IDs provide per-agent attribution in logs
API key authentication scopes access per project

Protocol Surface

Interface	Detail
REST API	Session lifecycle management (create, list, delete)
Playwright	Standard Playwright over CDP pointing to remote session
Puppeteer	Standard Puppeteer over CDP pointing to remote session
Selenium	Standard Selenium WebDriver pointing to remote session
Chrome DevTools Protocol	Low-level CDP access for custom tooling
Stagehand SDK	Natural-language action layer built on Browserbase
MCP Server	Model Context Protocol for direct LLM browser control

Human-in-the-Loop Support

None required by default. Session recordings can be reviewed by humans post-hoc for debugging. HumanLayer can be composed on top for approval gates on specific high-risk actions (e.g., before the agent submits a form with financial implications).

Why Generic Alternatives Do Not Qualify

Alternative	Why It Fails
Local Playwright/Puppeteer	Runs on developer's machine; no cloud scaling, no per-agent session isolation, no stealth, no agent-optimized SDK
BrowserStack	Built for human QA engineers running test suites; not designed for agent-initiated autonomous navigation at scale
Selenium Grid	Human test infrastructure; no natural-language action interface, no agent-oriented session management
AWS Lambda + Chromium	DIY infra with no agent-specific primitives; developer builds all session management, stealth, and recording from scratch

Use Cases

Web automation — agents log in to third-party services, extract data, submit forms, take screenshots
Research agents — agents navigate multi-step research workflows across multiple sites
Testing agents — agents validate UI behavior as part of an agentic CI pipeline
E-commerce agents — agents monitor prices, fill carts, complete checkouts on behalf of users
Data extraction — agents navigate dynamic, JavaScript-rendered pages and extract structured information

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Browserbase

Official Website

Official Repo

Agent Skills

MCP

What It Does

Why It Is Agent-Native

Primary Primitives

Autonomy Model

Identity and Delegation Model

Protocol Surface

Human-in-the-Loop Support

Why Generic Alternatives Do Not Qualify

Use Cases

Uh oh!

FilesExpand file tree

browserbase.md

Latest commit

History

browserbase.md

File metadata and controls

Browserbase

Official Website

Official Repo

Agent Skills

MCP

What It Does

Why It Is Agent-Native

Primary Primitives

Autonomy Model

Identity and Delegation Model

Protocol Surface

Human-in-the-Loop Support

Why Generic Alternatives Do Not Qualify

Use Cases