alterlab-mcp-server
v1.0.0
Published
MCP server for AlterLab web scraping API — scrape, extract, screenshot any webpage
Maintainers
Readme
One-Line Install
Claude Code
claude mcp add alterlab -- npx -y alterlab-mcp-serverThen set your API key: export ALTERLAB_API_KEY=sk_live_... or add it to .claude.json (see full setup below).
Cursor
# Add to .cursor/mcp.json — see full config belowSmithery
npx -y @smithery/cli install alterlab-mcp-server --client claudeWhy AlterLab Instead of WebFetch or Browser MCP?
Claude's built-in WebFetch tool and open-source browser MCP servers fail on most real-world websites. They cannot bypass Cloudflare, render JavaScript SPAs, or extract structured data.
AlterLab replaces broken fetch tools with one MCP server that actually works:
| Capability | WebFetch / fetch() | Browser MCP | AlterLab MCP | |---|---|---|---| | Anti-bot bypass (Cloudflare, DataDome, Akamai) | No | Partial | Yes — automatic | | JavaScript rendering (React, Angular, Vue SPAs) | No | Yes (slow) | Yes — headless Chromium | | Structured data extraction (JSON, Schema.org) | No | No | Yes — built-in profiles | | Smart tier escalation (cheapest method first) | N/A | N/A | Yes — saves 60-80% | | Residential proxy rotation (195+ countries) | No | No | Yes | | Screenshot and PDF capture | No | Screenshot only | Yes — both | | OCR text extraction from images | No | No | Yes | | Cost per request | Free (but fails) | Free (but slow) | From $0.0002 |
How Does AlterLab Web Scraping Work?
AlterLab uses a multi-tier scraping architecture. It automatically selects the cheapest method capable of fetching each URL:
- Curl ($0.0002/req) — Direct HTTP for static pages, RSS feeds, public APIs
- HTTP ($0.0003/req) — TLS fingerprint rotation for moderately protected sites
- Stealth ($0.0005/req) — Browser impersonation for Cloudflare/DataDome-protected sites
- Light JS ($0.0007/req) — Lightweight JS extraction from server-rendered HTML
- Browser ($0.001/req) — Full headless Chromium for JavaScript-heavy SPAs
Auto mode starts at Tier 1 and escalates only when blocked. Most websites resolve at Tiers 1-2, so $1 gets you 1,000 to 5,000 scrapes depending on the sites you target.
Installation
Install in Claude Desktop / Claude Code
Add to your Claude config file (~/.claude.json for Claude Code, or Settings for Claude Desktop):
{
"mcpServers": {
"alterlab": {
"command": "npx",
"args": ["-y", "alterlab-mcp-server"],
"env": {
"ALTERLAB_API_KEY": "sk_live_your_key_here"
}
}
}
}Install in Cursor
Add to .cursor/mcp.json in your project root:
{
"mcpServers": {
"alterlab": {
"command": "npx",
"args": ["-y", "alterlab-mcp-server"],
"env": {
"ALTERLAB_API_KEY": "sk_live_your_key_here"
}
}
}
}Install in Windsurf
Add to Windsurf MCP settings (~/.codeium/windsurf/mcp_config.json):
{
"mcpServers": {
"alterlab": {
"command": "npx",
"args": ["-y", "alterlab-mcp-server"],
"env": {
"ALTERLAB_API_KEY": "sk_live_your_key_here"
}
}
}
}Install via Smithery
npx -y @smithery/cli install alterlab-mcp-server --client claudeGet Your API Key
- Sign up free — $1 free balance on signup
- Go to Dashboard → API Keys and copy your key
- Paste it into the
ALTERLAB_API_KEYfield in your MCP config
Tools
alterlab_scrape — Scrape Any Webpage
Scrape a URL and return its content as markdown, text, HTML, or JSON. Automatically handles anti-bot protection with tier escalation. Returns markdown by default — optimized for LLM context windows.
"Scrape https://www.amazon.com/dp/B0BSHF7WHW and summarize the product"| Parameter | Type | Default | Description |
|---|---|---|---|
| url | string | required | URL to scrape |
| mode | auto | html | js | pdf | ocr | auto | Scraping mode |
| formats | (text\|json\|html\|markdown)[] | ["markdown"] | Output formats |
| render_js | boolean | false | Use headless browser (+3 credits) |
| use_proxy | boolean | false | Premium proxy (+1 credit) |
| proxy_country | string | — | ISO country code for geo-targeting (e.g., US, DE) |
| wait_for | string | — | CSS selector to wait for before extraction |
| timeout | number | 90 | Timeout in seconds (1-300) |
| include_raw_html | boolean | false | Include raw HTML alongside formatted content |
alterlab_extract — Extract Structured Data
Extract structured fields from any webpage using pre-built profiles or custom JSON Schema. Returns clean JSON — ready for databases, spreadsheets, or downstream processing.
"Extract the product name, price, and rating from this Amazon page"| Parameter | Type | Default | Description |
|---|---|---|---|
| url | string | required | URL to extract from |
| extraction_profile | enum | auto | Profile: product, article, job_posting, faq, recipe, event |
| extraction_schema | object | — | Custom JSON Schema for structured output |
| extraction_prompt | string | — | Natural language extraction instructions |
| render_js | boolean | false | Use headless browser |
| use_proxy | boolean | false | Premium proxy |
Extraction profiles:
- Product — name, price, currency, rating, reviews, availability, images, description
- Article — title, author, published date, body text, featured image
- Job Posting — title, company, location, salary, description, requirements
- FAQ — question-answer pairs
- Recipe — ingredients, instructions, prep time, servings
- Event — name, date, location, description, organizer
alterlab_screenshot — Screenshot Any Page
Take a full-page screenshot of any URL. Returns a PNG image directly in the conversation — no URLs to copy, no files to download.
"Take a screenshot of our landing page at https://alterlab.io"| Parameter | Type | Default | Description |
|---|---|---|---|
| url | string | required | URL to screenshot |
| wait_for | string | — | CSS selector to wait for before capture |
| wait_until | enum | networkidle | networkidle, domcontentloaded, or load |
alterlab_estimate_cost — Estimate Before You Scrape
Check how much a scrape will cost before running it. Returns the predicted tier, cost per request, and confidence level.
"How much would it cost to scrape linkedin.com?"| Parameter | Type | Default | Description |
|---|---|---|---|
| url | string | required | URL to estimate |
| mode | enum | auto | Scraping mode |
| render_js | boolean | false | Include JS rendering cost |
| use_proxy | boolean | false | Include proxy cost |
alterlab_check_balance — Check Your Credits
Check your account balance, total deposited, and total spent. No parameters needed.
"Check my AlterLab balance"What Can You Do with AlterLab MCP?
Research and Analysis
Ask Claude to scrape and analyze websites in real-time:
- "Scrape the top 5 results from this Google search and summarize them"
- "Extract all product prices from this Amazon category page"
- "Compare the pricing pages of these 3 competitors"
Code Generation with Real Data
Let Cursor or Windsurf fetch live data while building:
- "Scrape this API documentation page and generate TypeScript types from it"
- "Extract the color palette from this website and create a Tailwind config"
- "Screenshot this design and recreate it in React"
Content and SEO
Use Claude to analyze content at scale:
- "Scrape this blog post and suggest improvements for SEO"
- "Extract all FAQ entries from this help center and create a structured dataset"
- "Compare our landing page to the competitor's and identify gaps"
Monitoring and Alerts
Build agentic workflows that watch the web:
- "Check if this product is back in stock"
- "Scrape this page daily and alert me when the price drops below $50"
- "Monitor this job board for new senior engineering positions"
Pricing — Pay-As-You-Go Web Scraping
No subscriptions. No monthly minimums. Add balance and use it whenever you need it.
Base Scraping Costs
| Tier | Method | Cost per Request | Use Case | |---|---|---|---| | Curl | Direct HTTP | $0.0002 | Static pages, RSS feeds, public APIs | | HTTP | TLS fingerprinting | $0.0003 | Sites with basic bot detection | | Stealth | Browser impersonation | $0.0005 | Cloudflare, DataDome, PerimeterX protected sites | | Light JS | JSON extraction | $0.0007 | Server-rendered pages needing structured data | | Browser | Headless Chromium | $0.001 | Full JavaScript SPAs (React, Angular, Vue) |
Optional Add-Ons
| Add-On | Extra Cost | Description | |---|---|---| | JavaScript Rendering | +$0.0006 | Headless Chromium for dynamic content | | Screenshot Capture | +$0.0002 | Full-page PNG screenshot | | Premium Proxy | +$0.0002 | Geo-targeted residential proxy (195+ countries) | | OCR Text Extraction | +$0.001 | Extract text from images on the page |
$1 = 5,000 light scrapes. New accounts get $1 free balance on signup.
Environment Variables
| Variable | Required | Default | Description |
|---|---|---|---|
| ALTERLAB_API_KEY | Yes | — | Your API key (get one free) |
| ALTERLAB_API_URL | No | https://api.alterlab.io | API base URL (for self-hosted or development) |
Frequently Asked Questions
How do I add web scraping to Claude, Cursor, or Windsurf?
Install the AlterLab MCP server. For Claude Code, run claude mcp add alterlab -- npx -y alterlab-mcp-server and set your ALTERLAB_API_KEY. For Claude Desktop, Cursor, or Windsurf, add the JSON config block to your MCP settings file. Once configured, your AI assistant can scrape any URL, extract structured data, and take screenshots directly in conversation.
Can Claude scrape websites that are behind Cloudflare or anti-bot protection?
Yes. AlterLab automatically handles Cloudflare, DataDome, PerimeterX, Akamai, and other anti-bot systems. It uses a multi-tier approach that starts with the cheapest method and escalates only when blocked. You don't need to configure anything — anti-bot bypass is fully automatic.
What is an MCP server and how does it work with Claude?
MCP (Model Context Protocol) is Anthropic's open standard for connecting AI assistants to external tools and data sources. An MCP server is a small program that exposes tools — like web scraping — that Claude, Cursor, or Windsurf can call during a conversation. The AlterLab MCP server gives your AI assistant 5 tools: scrape, extract, screenshot, estimate cost, and check balance.
How is AlterLab different from Firecrawl, ScrapingBee, or Apify MCP servers?
AlterLab starts at $0.0002 per request — 5-20x cheaper than most scraping APIs — because it only uses expensive browser rendering when a site actually requires it. Smart tier escalation means you pay for what each site needs, not the maximum. AlterLab also includes built-in structured data extraction with pre-built profiles (product, article, job posting, etc.) at no extra cost.
Can I scrape Amazon, Walmart, and other e-commerce sites from Claude?
Yes. AlterLab handles all major e-commerce anti-bot protection. Use the alterlab_extract tool with extraction_profile: "product" to get structured JSON: product name, price, currency, rating, review count, availability, and images — ready for analysis, comparison, or data pipelines.
Can Cursor scrape documentation and generate code from it?
Yes. With AlterLab MCP installed in Cursor, you can ask it to scrape API documentation, library docs, or any reference page and generate TypeScript types, API clients, or component code from the live content. This is more reliable than relying on the LLM's training data, which may be outdated.
Does AlterLab MCP work with JavaScript-heavy sites (React, Angular, Vue)?
Yes. Use render_js: true or set mode: "js" to enable full headless Chromium rendering. AlterLab renders the complete page including all JavaScript, waits for dynamic content to load, then extracts content from the fully rendered DOM. This works for React, Angular, Vue, Next.js, and any other JavaScript framework.
What output format is best for AI and LLM context windows?
Use markdown (the default). It preserves document structure — headings, tables, lists, links — while being 60-80% smaller than raw HTML. Claude, GPT-4, and other LLMs process markdown significantly better than HTML. AlterLab's markdown output is specifically optimized for LLM context windows.
Is there rate limiting?
Free-tier accounts have rate limits. Adding any balance removes rate limits. The MCP server includes automatic retry with exponential backoff for transient rate limit errors (429).
Can I use this MCP server for large-scale scraping?
Yes. The MCP server processes one request at a time through the conversation interface, but you can build agentic workflows that scrape many URLs sequentially. For batch processing, use the AlterLab API directly or the n8n integration.
Error Handling
The MCP server returns helpful error messages with suggested next actions:
| Error | What Happens | Suggested Action |
|---|---|---|
| 401 Unauthorized | Invalid API key | Check ALTERLAB_API_KEY is set correctly |
| 402 Insufficient Credits | Balance too low | Run alterlab_check_balance, add funds |
| 403 Forbidden | Site blocked the request | Try render_js: true + use_proxy: true |
| 429 Rate Limited | Too many requests | Automatic retry with backoff |
| 504 Gateway Timeout | Scrape took too long | Increase timeout, simplify request |
Contributing
git clone https://github.com/RapierCraft/alterlab-mcp-server.git
cd alterlab-mcp-server
npm install
npm run build