Agent Browser

⭐ 15,000•Rust•Browser Automation

Fast native Rust CLI for browser automation via CDP with accessibility-tree snapshots.

RustBrowser AutomationCDPElectronClaude CodeCursor

Overview

Agent Browser is a fast, native Rust CLI for browser automation designed specifically for AI agents. It provides Chrome/Chromium control via the Chrome DevTools Protocol with accessibility-tree snapshots and compact element references, enabling reliable web automation without Playwright or Puppeteer dependencies. Works with Claude Code, Cursor, Codex, Windsurf, and Gemini CLI.

Features

✓Native Rust CLI with sub-second startup
✓Chrome DevTools Protocol (CDP) connection
✓Accessibility-tree snapshots
✓Compact @eN element references
✓Session persistence and authentication vault
✓Built-in video recording
✓Electron app automation support
✓Slack workspace automation skill
✓Vercel Sandbox and AWS Bedrock integrations
✓Observability dashboard on port 4848

Installation

npm i -g agent-browser && agent-browser install

Pros

+Fast native Rust CLI with sub-second startup
+No Playwright/Puppeteer dependency — smaller footprint
+Accessibility-tree snapshots for semantic element understanding
+Works with any AI agent (Claude Code, Cursor, Codex, Windsurf)
+Specialized skills for Electron apps, Slack, and cloud browsers
+Built-in video recording and observability dashboard

Cons

−Requires Chrome/Chromium installation
−Relatively new (as of July 2026), ecosystem still growing
−Documentation is CLI-driven, may feel less discoverable
−Windows support may require additional setup

Alternatives

Playwright →Puppeteer →Anthropic Computer Use →UI-TARS →

Documentation

Agent Browser

Overview

Agent Browser is a fast, native Rust CLI for browser automation designed specifically for AI agents. It provides Chrome/Chromium control via the Chrome DevTools Protocol (CDP) with accessibility-tree snapshots and compact @eN element references, enabling reliable interaction with web pages without Playwright or Puppeteer dependencies.

Released in mid-2026, Agent Browser quickly gained traction as the go-to browser automation layer for coding agents like Claude Code, Cursor, Codex, Windsurf, and Gemini CLI. By the end of June 2026, it had achieved 5,000+ weekly active users and was integrated into the Claude Code skills ecosystem as a first-class plugin.

Features

Native Rust CLI: Sub-second startup, no Node.js runtime overhead
Chrome DevTools Protocol: Direct CDP connection for fast, reliable automation
Accessibility-tree snapshots: Semantic DOM understanding via browser accessibility tree
Compact element refs: @eN notation for precise element targeting
No Playwright/Puppeteer: Smaller footprint, faster execution
Session management: Persistent sessions, authentication vault, state persistence
Video recording: Built-in recording for debugging and audit trails
Multi-agent compatible: Works with Claude Code, Cursor, Codex, Continue, Windsurf, Gemini CLI
Electron app support: Specialized skills for automating desktop apps (VS Code, Slack, Discord, Figma)
Cloud browser support: Integrations with Vercel Sandbox and AWS Bedrock AgentCore

Installation

npm i -g agent-browser && agent-browser install

Requires Chrome or Chromium installed on the system.

Quick Start

# Start an interactive session
agent-browser

# Launch a headless session
agent-browser --headless

# Load a specialized skill
agent-browser skills get core
agent-browser skills get electron  # for desktop apps
agent-browser skills get slack     # for Slack automation

Core Concepts

Element References (`@eN`)

Agent Browser uses compact @eN references derived from the accessibility tree:

agent-browser click @e3       # click element with ref @e3
agent-browser type @e5 hello  # type text into element @e5

Accessibility-tree Snapshots

Instead of parsing raw DOM, Agent Browser queries the browser's accessibility tree, which is already optimized for screen readers and contains semantic meaning (button, link, heading, etc.).

Session Management

agent-browser session list      # list active sessions
agent-browser session save foo  # save current state as "foo"
agent-browser session load foo  # restore session

Advanced Features

Specialized Skills

Agent Browser ships with domain-specific automation skills:

Skill	Use Case
`electron`	Automating VS Code, Slack, Discord, Figma
`slack`	Workspace automation (search, send, reply)
`dogfood`	Exploratory testing, QA, bug hunts
`vercel-sandbox`	Running inside Vercel microVMs
`agentcore`	AWS Bedrock cloud browsers

Observability Dashboard

Agent Browser includes an observability dashboard running on port 4848, providing:

Real-time session status
Traffic streams
Video playback
Session history

Accessible at https://dashboard.agent-browser.localhost.

Video Recording

Built-in video recording captures browser sessions for debugging, auditing, and training data:

agent-browser --record recordings/session.mp4

Examples

Web Scraping

agent-browser "Open https://example.com, take a screenshot, extract all headings"

Form Automation

agent-browser "Go to login page, fill email and password fields, submit form"

Desktop App Automation (Electron)

agent-browser skills get electron
agent-browser "Open VS Code, open file settings.json, add new setting, save"

Slack Automation

agent-browser skills get slack
agent-browser "Search Slack for messages containing 'deployment', summarize results"

Pros

✅ Fast native Rust CLI with sub-second startup
✅ No Playwright/Puppeteer dependency — smaller footprint
✅ Accessibility-tree snapshots for semantic element understanding
✅ Works with any AI agent (Claude Code, Cursor, Codex, Windsurf, etc.)
✅ Specialized skills for domain-specific automation (Electron, Slack, etc.)
✅ Built-in video recording and observability dashboard
✅ Session persistence and authentication vault

Cons

❌ Requires Chrome/Chromium installation
❌ Relatively new (as of July 2026), ecosystem still growing
❌ Documentation is CLI-driven (skills get core), may feel less discoverable
❌ Windows support may require additional setup for some features

When to Use

Web automation for AI agents: When you need an agent to interact with web pages reliably
Testing and QA: Automating exploratory testing, bug reproduction, or UI regression checks
Desktop app automation: Controlling Electron apps like VS Code, Slack, or Figma
Cloud browser scenarios: Running browser automation in Vercel Sandbox or AWS Bedrock
Form automation: Filling out forms, logging in, navigating multi-step flows

Integration with Claude Code

Agent Browser is available as a first-class skill in Claude Code. When triggered by user requests involving browser interaction, Claude Code automatically loads the Agent Browser skill and executes commands through the CLI.

# Claude Code automatically handles:
# - Launching browser sessions
# - Taking accessibility-tree snapshots
# - Interacting with elements via @eN refs
# - Recording sessions for debugging

Resources

Published: July 3, 2026

Agent Browser

Overview

Features

Installation

Pros

Cons

Alternatives

Documentation

Agent Browser

Overview

Features

Installation

Quick Start

Core Concepts

Element References (@eN)

Accessibility-tree Snapshots

Session Management

Advanced Features

Specialized Skills

Observability Dashboard

Video Recording

Examples

Web Scraping

Form Automation

Desktop App Automation (Electron)

Slack Automation

Pros

Cons

When to Use

Integration with Claude Code

Resources

Element References (`@eN`)