@nxavis/agent-browser-mcp
v0.1.1
Published
MCP server integrating with Vercel's agent-browser for AI-driven browser automation
Maintainers
Readme
@nxavis/agent-browser-mcp
A Model Context Protocol (MCP) server that provides browser automation capabilities through Vercel's agent-browser. This enables LLMs to interact with web pages using a fast Rust CLI with Node.js fallback.
Quick Start
# 1. Install agent-browser CLI
npm install -g agent-browser && agent-browser install
# 2. Add to Claude Desktop (or your MCP client)
npx @nxavis/agent-browser-mcpThen use tools like browser_navigate, browser_click, browser_snapshot to control the browser from your AI agent.
Features
- AI-Optimized Browser Control - Semantic element locators using accessibility properties, text matching, and data attributes
- Session Isolation - Multiple isolated browser sessions with separate cookies, storage, and navigation history
- Comprehensive Automation - Navigation, form filling, clicking, scrolling, keyboard input, and more
- Data Extraction - Get text, HTML, attributes, accessibility snapshots, screenshots, and PDFs
- Cookie Management - Full control over browser cookies and storage
- JavaScript Execution - Run arbitrary scripts in the browser context
- Network Inspection - Monitor console messages and network requests
Installation
npm install @nxavis/agent-browser-mcpOr run directly with npx:
npx @nxavis/agent-browser-mcpPrerequisites
- Node.js 18 or newer
- agent-browser CLI installed:
# Install agent-browser globally
npm install -g agent-browser
# Download Chromium browser
agent-browser install
# On Linux, install system dependencies if needed:
# agent-browser install --with-deps⚠️ Windows Note: agent-browser currently has known issues on Windows with native shells (PowerShell/CMD). For Windows users, we recommend using WSL (Windows Subsystem for Linux) until the upstream issue is resolved.
Configuration
Claude Desktop
Add to your claude_desktop_config.json:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json
{
"mcpServers": {
"agent-browser": {
"command": "npx",
"args": ["@nxavis/agent-browser-mcp"]
}
}
}VS Code
Add to your VS Code settings (JSON):
{
"mcp": {
"servers": {
"agent-browser": {
"command": "npx",
"args": ["@nxavis/agent-browser-mcp"]
}
}
}
}Antigravity
Add to your .mcp.json file in your project root or home directory:
{
"mcpServers": {
"agent-browser": {
"command": "npx",
"args": ["@nxavis/agent-browser-mcp"]
}
}
}Or use the global configuration at ~/.mcp.json:
{
"mcpServers": {
"agent-browser": {
"command": "npx",
"args": ["@nxavis/agent-browser-mcp"],
"env": {
"AGENT_BROWSER_PATH": "/usr/local/bin/agent-browser"
}
}
}
}Custom agent-browser Path
If agent-browser is not in your PATH, specify its location:
{
"mcpServers": {
"agent-browser": {
"command": "npx",
"args": ["@nxavis/agent-browser-mcp"],
"env": {
"AGENT_BROWSER_PATH": "/path/to/agent-browser"
}
}
}
}Available Tools
Navigation
browser_navigate- Navigate to a URLbrowser_go_back- Navigate back in browser historybrowser_go_forward- Navigate forward in browser historybrowser_reload- Reload the current page
Interaction
browser_click- Click on an elementbrowser_fill- Fill a text input fieldbrowser_type- Type text character by characterbrowser_hover- Hover over an elementbrowser_scroll- Scroll the page or a specific elementbrowser_select- Select an option from a dropdownbrowser_check- Check a checkbox or radio buttonbrowser_uncheck- Uncheck a checkboxbrowser_press- Press a keyboard key
Data Extraction
browser_get_text- Get text content from an element or pagebrowser_get_html- Get HTML contentbrowser_get_attribute- Get an attribute valuebrowser_get_url- Get the current page URLbrowser_get_title- Get the current page titlebrowser_snapshot- Get accessibility tree snapshot
Session Management
browser_new_session- Create a new isolated browser sessionbrowser_close_session- Close a browser session
Screenshots & PDF
browser_screenshot- Take a screenshotbrowser_pdf- Generate a PDF of the current page
For a complete list of tools and their parameters, see the original documentation.
Development
# Clone the repository
git clone https://github.com/nxavis/agent-browser-mcp.git
cd agent-browser-mcp
# Install dependencies
npm install
# Build
npm run build
# Run tests
npm test
# Watch mode
npm run devLicense
MIT
Credits
This package is based on agent-browser-mcp by minhlucvan, adapted for the @nxavis ecosystem.
