@cloudbrowser/mcp-server
v0.1.3
Published
MCP server for CloudBrowser.ai - Control remote cloud browsers with AI capabilities
Maintainers
Readme
CloudBrowser MCP Server
An MCP server implementation that provides tools for controlling remote cloud browsers through CloudBrowser.ai with AI-powered automation capabilities.
Features
- Remote Browser Management: Open, close, and control cloud browsers
- Direct Browser Control: Puppeteer-based automation via Chrome DevTools Protocol
- Session Management: Save and restore browser sessions with cookies/localStorage
- Remote Desktop Access: Integration for visual browser control
- AI-Powered Automation: Intelligent page content analysis and element interaction
- Dual Transport: Support for both stdio and HTTP communication
Configuration
Prerequisites
You need a CloudBrowser.ai API token. Get one from CloudBrowser.ai.
Usage with OpenAI Agents, n8n, and Make (HTTP method)
For OpenAI Agents, n8n, and Make integrations, use the HTTP endpoint:
- Endpoint:
https://mcp.cloudbrowser.ai - Authorization Header:
Bearer your_api_token_here
Usage with Claude Desktop (STDIO method)
Add this to your claude_desktop_config.json:
{
"mcpServers": {
"cloudbrowser": {
"command": "npx",
"args": ["@cloudbrowser/mcp-server"],
"env": {
"CLOUDBROWSER_API_TOKEN": "your_api_token_here"
}
}
}
}MCP Tools list
Browser Management
open_browser- Opens a new remote browser instance in the cloudclose_browser- Closes a specific browser instanceget_browsers- Lists all active browser instancesstart_remote_desktop- Starts VNC access to a browserstop_remote_desktop- Stops VNC access to a browserget_saved_sessions- Lists saved browser sessionsremove_saved_session- Removes a saved session
Browser Automation
connect_to_browser- Establishes direct control connection to a browsernavigate_to_url- Navigates to a specific URLget_page_content- Extracts comprehensive page content and structureclick_element- Clicks on page elements using CSS selectorstype_text- Types text into form fields and inputsget_elements- Gets information about elements matching a selectortake_screenshot- Captures page screenshotsevaluate_script- Executes JavaScript code on the pagedetect_captcha- Detects captcha widgets in current pagesolve_captcha- Solves captcha via CloudBrowserPublicApi/solveand injects token
License
MIT
Captcha Resolution Integration (CloudBrowserPublicApi)
This MCP resolves captchas through CloudBrowserPublicApi (/solve). The MCP does not call the captcha solver service directly.
Configure these env vars in the MCP deployment:
env:
- name: CLOUDBROWSER_PUBLICAPI_BASE_URL
value: "http://cloudbrowserpublicapi.browsers.svc.cluster.local"
- name: CLOUDBROWSER_PUBLICAPI_API_KEY
valueFrom:
secretKeyRef:
name: cloudbrowserpublicapi-secret
key: api-key
- name: CLOUDBROWSER_PUBLICAPI_SOLVE_PATH
value: "/solve"
- name: CLOUDBROWSER_PUBLICAPI_TIMEOUT_MS
value: "120000"
- name: CAPTCHA_AUTO_SOLVE
value: "true"Production interaction flow
open_browserconnect_to_browsernavigate_to_urlwithautoSolveCaptcha: true(or calldetect_captchaandsolve_captchamanually)- Continue with normal actions (
click_element,type_text, etc.)
Example tools/call for automatic solve:
{
"jsonrpc": "2.0",
"id": 7,
"method": "tools/call",
"params": {
"name": "navigate_to_url",
"arguments": {
"sessionId": "my-session",
"url": "https://example.com/login",
"autoSolveCaptcha": true,
"detectCaptcha": true
}
}
}