hanzi-in-chrome
v2.1.4
Published
Give your AI agent a real browser — click, type, fill forms, test workflows, post content, and read authenticated pages
Maintainers
Readme
Hanzi in Chrome — MCP Server
The MCP server exposes browser tools to MCP clients and forwards browser work to the Chrome extension over the local WebSocket relay.
Setup
cd mcp-server
npm install
npm run buildAdd to your MCP config (e.g., ~/.claude/claude_desktop_config.json):
{
"mcpServers": {
"browser": {
"command": "node",
"args": ["/path/to/hanzi-in-chrome/mcp-server/dist/index.js"]
}
}
}Prerequisites: The Chrome extension must be installed and running. See the main README for full setup.
How It Works
MCP client
-> mcp-server (stdio)
-> relay (WebSocket)
-> Chrome extension
-> browser agentThe extension is the browser executor. The MCP server should only manage MCP tool calls, local session bookkeeping, and blocking waits for completion.
Tools
browser_start
Start a browser task. Blocks until complete or timeout.
browser_start(
task: "Search for flights to Tokyo on Google Flights",
url: "https://flights.google.com", // optional starting URL
context: "Departing March 15, economy" // optional extra info
)
→ {
"session_id": "abc123",
"status": "complete",
"task": "Search for flights to Tokyo...",
"answer": "Found 3 flights: JAL $850, ANA $920, United $780",
"total_steps": 8,
"recent_steps": ["Opened Google Flights", "Set destination to Tokyo", ...]
}browser_message
Send follow-up instructions to an existing session. Also blocks until the agent finishes.
browser_message(session_id: "abc123", message: "Book the cheapest one")browser_status
Check known sessions and their latest status.
browser_status() // all active sessions
browser_status(session_id: "abc123") // specific sessionbrowser_stop
Stop a task.
browser_stop(session_id: "abc123")
browser_stop(session_id: "abc123", remove: true) // also delete sessionbrowser_screenshot
Capture the current browser state as an image.
browser_screenshot(session_id: "abc123")Examples
Research:
browser_start("Find the top 3 competitors for Acme Corp and summarize their pricing")Logged-in workflows:
browser_start("Go to Jira, find my open tickets, and summarize what needs attention this week")Multi-turn:
s = browser_start("Go to LinkedIn and find AI Engineer jobs in Montreal")
→ { session_id: "x1", answer: "Found: Applied AI Engineer at Cohere" }
browser_message("x1", "Click into that job and tell me the requirements")
→ { answer: "Requirements: 3+ years Python, ML experience..." }
browser_message("x1", "Apply to this job using my profile")
→ { answer: "Application submitted successfully" }Parallel execution:
browser_start("Check flight prices to Tokyo")
browser_start("Check hotel prices in Shibuya")
browser_start("Look up train pass costs")
// All three run simultaneouslyConfiguration
| Environment Variable | Default | Description |
|---|---|---|
| HANZI_IN_CHROME_MAX_SESSIONS | 5 | Max concurrent browser tasks |
| WS_RELAY_PORT | 7862 | WebSocket relay port |
Architecture
AI Tool (Claude Code, Cursor, etc.)
↓ MCP Protocol (stdio)
MCP Server
↓ WebSocket
Relay Server
↓ WebSocket
Chrome Extension
↓ Extension agent loop
Target WebsiteThe relay server starts automatically when the MCP server connects. It routes messages between the MCP server and the Chrome extension and briefly queues messages while the extension service worker is asleep.
Principle: Hanzi is for real browser work in your signed-in Chrome. Agents should prefer code, logs, APIs, and existing tools first. Use Hanzi when the job needs a real browser session.
Prompts
The server exposes MCP prompts that clients auto-discover as slash commands:
| Prompt | Description |
|--------|-------------|
| linkedin-prospector | Goal-driven LinkedIn outreach — networking, sales, partnerships, or hiring |
| e2e-tester | Test your app in a real browser — reports bugs with screenshots and code references |
| social-poster | Post across LinkedIn, Twitter, Reddit, HN — drafts per-platform, posts from your browser |
In Claude Code, use the built-in linkedin-prospector prompt from the MCP prompt list.
Skills CLI
hanzi-browser skills # list available skills
hanzi-browser skills install linkedin-prospector # install SKILL.md to your projectSkills are portable SKILL.md files for agents that don't support MCP prompts (Cline, Codex). Each skill follows the same principle: use existing tools first, Hanzi only for real browser steps.
