AG

Agent Browser

15,000RustBrowser Automation

Fast native Rust CLI for browser automation via CDP with accessibility-tree snapshots.

RustBrowser AutomationCDPElectronClaude CodeCursor

Overview

Agent Browser is a fast, native Rust CLI for browser automation designed specifically for AI agents. It provides Chrome/Chromium control via the Chrome DevTools Protocol with accessibility-tree snapshots and compact element references, enabling reliable web automation without Playwright or Puppeteer dependencies. Works with Claude Code, Cursor, Codex, Windsurf, and Gemini CLI.

Features

  • Native Rust CLI with sub-second startup
  • Chrome DevTools Protocol (CDP) connection
  • Accessibility-tree snapshots
  • Compact @eN element references
  • Session persistence and authentication vault
  • Built-in video recording
  • Electron app automation support
  • Slack workspace automation skill
  • Vercel Sandbox and AWS Bedrock integrations
  • Observability dashboard on port 4848

Installation

npm i -g agent-browser && agent-browser install

Pros

  • +Fast native Rust CLI with sub-second startup
  • +No Playwright/Puppeteer dependency — smaller footprint
  • +Accessibility-tree snapshots for semantic element understanding
  • +Works with any AI agent (Claude Code, Cursor, Codex, Windsurf)
  • +Specialized skills for Electron apps, Slack, and cloud browsers
  • +Built-in video recording and observability dashboard

Cons

  • Requires Chrome/Chromium installation
  • Relatively new (as of July 2026), ecosystem still growing
  • Documentation is CLI-driven, may feel less discoverable
  • Windows support may require additional setup

Alternatives

Documentation

Agent Browser

Overview

Agent Browser is a fast, native Rust CLI for browser automation designed specifically for AI agents. It provides Chrome/Chromium control via the Chrome DevTools Protocol (CDP) with accessibility-tree snapshots and compact @eN element references, enabling reliable interaction with web pages without Playwright or Puppeteer dependencies.

Released in mid-2026, Agent Browser quickly gained traction as the go-to browser automation layer for coding agents like Claude Code, Cursor, Codex, Windsurf, and Gemini CLI. By the end of June 2026, it had achieved 5,000+ weekly active users and was integrated into the Claude Code skills ecosystem as a first-class plugin.

Features

  • Native Rust CLI: Sub-second startup, no Node.js runtime overhead
  • Chrome DevTools Protocol: Direct CDP connection for fast, reliable automation
  • Accessibility-tree snapshots: Semantic DOM understanding via browser accessibility tree
  • Compact element refs: @eN notation for precise element targeting
  • No Playwright/Puppeteer: Smaller footprint, faster execution
  • Session management: Persistent sessions, authentication vault, state persistence
  • Video recording: Built-in recording for debugging and audit trails
  • Multi-agent compatible: Works with Claude Code, Cursor, Codex, Continue, Windsurf, Gemini CLI
  • Electron app support: Specialized skills for automating desktop apps (VS Code, Slack, Discord, Figma)
  • Cloud browser support: Integrations with Vercel Sandbox and AWS Bedrock AgentCore

Installation

npm i -g agent-browser && agent-browser install

Requires Chrome or Chromium installed on the system.

Quick Start

# Start an interactive session
agent-browser

# Launch a headless session
agent-browser --headless

# Load a specialized skill
agent-browser skills get core
agent-browser skills get electron  # for desktop apps
agent-browser skills get slack     # for Slack automation

Core Concepts

Element References (@eN)

Agent Browser uses compact @eN references derived from the accessibility tree:

agent-browser click @e3       # click element with ref @e3
agent-browser type @e5 hello  # type text into element @e5

Accessibility-tree Snapshots

Instead of parsing raw DOM, Agent Browser queries the browser's accessibility tree, which is already optimized for screen readers and contains semantic meaning (button, link, heading, etc.).

Session Management

agent-browser session list      # list active sessions
agent-browser session save foo  # save current state as "foo"
agent-browser session load foo  # restore session

Advanced Features

Specialized Skills

Agent Browser ships with domain-specific automation skills:

SkillUse Case
electronAutomating VS Code, Slack, Discord, Figma
slackWorkspace automation (search, send, reply)
dogfoodExploratory testing, QA, bug hunts
vercel-sandboxRunning inside Vercel microVMs
agentcoreAWS Bedrock cloud browsers

Observability Dashboard

Agent Browser includes an observability dashboard running on port 4848, providing:

  • Real-time session status
  • Traffic streams
  • Video playback
  • Session history

Accessible at https://dashboard.agent-browser.localhost.

Video Recording

Built-in video recording captures browser sessions for debugging, auditing, and training data:

agent-browser --record recordings/session.mp4

Examples

Web Scraping

agent-browser "Open https://example.com, take a screenshot, extract all headings"

Form Automation

agent-browser "Go to login page, fill email and password fields, submit form"

Desktop App Automation (Electron)

agent-browser skills get electron
agent-browser "Open VS Code, open file settings.json, add new setting, save"

Slack Automation

agent-browser skills get slack
agent-browser "Search Slack for messages containing 'deployment', summarize results"

Pros

  • ✅ Fast native Rust CLI with sub-second startup
  • ✅ No Playwright/Puppeteer dependency — smaller footprint
  • ✅ Accessibility-tree snapshots for semantic element understanding
  • ✅ Works with any AI agent (Claude Code, Cursor, Codex, Windsurf, etc.)
  • ✅ Specialized skills for domain-specific automation (Electron, Slack, etc.)
  • ✅ Built-in video recording and observability dashboard
  • ✅ Session persistence and authentication vault

Cons

  • ❌ Requires Chrome/Chromium installation
  • ❌ Relatively new (as of July 2026), ecosystem still growing
  • ❌ Documentation is CLI-driven (skills get core), may feel less discoverable
  • ❌ Windows support may require additional setup for some features

When to Use

  • Web automation for AI agents: When you need an agent to interact with web pages reliably
  • Testing and QA: Automating exploratory testing, bug reproduction, or UI regression checks
  • Desktop app automation: Controlling Electron apps like VS Code, Slack, or Figma
  • Cloud browser scenarios: Running browser automation in Vercel Sandbox or AWS Bedrock
  • Form automation: Filling out forms, logging in, navigating multi-step flows

Integration with Claude Code

Agent Browser is available as a first-class skill in Claude Code. When triggered by user requests involving browser interaction, Claude Code automatically loads the Agent Browser skill and executes commands through the CLI.

# Claude Code automatically handles:
# - Launching browser sessions
# - Taking accessibility-tree snapshots
# - Interacting with elements via @eN refs
# - Recording sessions for debugging

Resources


Published: July 3, 2026