leapfrog-mcp
v0.7.4
Published
Multi-session browser MCP for AI agents — 36 tools, stealth, persistent auth, code-first scripts, API sniffer, agent intelligence
Maintainers
Readme
The Problem
Playwright MCP sends ~14,000 tokens for a content-heavy page like Hacker News. Most of that is noise. Your context window fills up. Your agent gets confused. You pay for it.
Leapfrog sends ~1,400 tokens. Same page. Same information. Up to 10x less noise.
┌─────────────────────────────────────────────────────┐
│ Playwright MCP │
│ ████████████████████████████████████████ ~14,000 │
│ │
│ Leapfrog │
│ █████ ~1,400 │
└─────────────────────────────────────────────────────┘
tokens per page (Hacker News, real test)Savings range from 2-10x depending on page complexity. Content-heavy pages see the biggest wins. Dense forms see the smallest. The median across real-world sites is ~4-5x.
Quick Start
npx leapfrog-mcp --doctor # verify everything works
npx leapfrog-mcp --stealth-audit # test all 19 stealth patches
npx leapfrog-mcp --config # print MCP config to pasteAdd to ~/.mcp.json (Claude Code) or your editor's MCP config:
{
"leapfrog": {
"command": "npx",
"args": ["-y", "leapfrog-mcp"],
"env": {
"LEAP_MAX_SESSIONS": "15",
"LEAP_TILE": "true",
"LEAP_HUD": "true",
"LEAP_AUTO_CONSENT": "true"
}
}
}Leapfrog uses playwright-core (15MB) instead of playwright (1.6GB) and does not bundle a browser. Either:
- Set
LEAP_CHANNEL=chrometo use your installed Chrome/Chromium (recommended) - Or run
npx playwright-core install chromiumto install the bundled Chromium binary - Or set
LEAP_CDP_ENDPOINTto connect to an already-running Chrome instance
Feature Matrix
| | Leapfrog | Playwright MCP | agent-browser | |---|:---:|:---:|:---:| | Tokens per page | ~1,200-2,500 | ~3,800-15,000 | ~300 | | Parallel sessions | 15 | 1 | 1 | | Session isolation | Yes | No | No | | Multi-tab / popups | Yes | No | No | | Network intercept | Yes | No | No | | Console capture | Yes | Yes | No | | Stealth / anti-bot | Yes | No | No | | Smart wait (5 types) | Yes | Basic | No | | Crash recovery | Yes | No | No | | Batch actions (100/call) | Yes | No | No | | Init script injection | Yes | Yes | No | | Drag / upload / resize | Yes | Yes | No | | Per-session proxy | Yes | No | No | | Humanization (opt-in) | Yes | No | No | | Auth profile reuse | Yes | No | No | | Cookie persistence | Yes | No | No | | Page classification (18) | Yes | No | No | | Session memory | Yes | No | No | | API intelligence | Yes | No | No | | Adaptive wait + auto-retry | Yes | No | No | | CAPTCHA auto-resolve | Yes | No | No | | Self-improvement (9 dims) | Yes | No | No | | Record / replay | Yes | No | No | | Pagination extraction | Yes | No | No | | Incremental snapshots (diff) | Yes | No | No | | Stealth self-test CLI | Yes | No | No | | SSRF protection | Yes | No | No |
Stealth
Leapfrog ships 19 anti-detection patches enabled by default (LEAP_STEALTH=true). Four modes:
true(default) — all 19 patches activepassive— removes automation signals only (webdriver, HeadlessChrome). Does NOT fake identity (WebGL, fonts, audio). Better for sites where trust matters more than evasion.auto— per-domain EXP3 bandit selects the optimal stealth configuration based on what's worked beforefalse— no stealth patches
These cover the vectors that fingerprint services like CreepJS and fingerprint-pro actually check:
- Client Hints brands (strips HeadlessChrome)
navigator.webdriverforced toundefined- WebGL vendor/renderer (replaces SwiftShader with real GPU strings)
- Connection RTT (non-zero)
- Alert dismiss timing (human-speed delay)
- Window outer/inner height offset
- MIME type array population
- Platform inference from user agent
chrome.appemulation- iframe
contentWindowprotection - Media codec spoofing (
canPlayType) document.hasFocus()override- Source URL comment stripping
- Custom UA + stealth coexistence (custom user agents no longer disable stealth context)
- CDP
Runtime.enabledetection (Error.prepareStackTracefilter) - Permissions API spoofing (20+ permission types)
- AudioContext fingerprint noise (
getChannelData/getFloatFrequencyData) - WebRTC IP leak prevention (ICE candidate filtering)
- Font enumeration fingerprint spoofing
Per-session stealth control: pass stealth: false in session_create to disable for a specific session.
Humanization (Experimental)
Set LEAP_HUMANIZE=true to enable human-like browser interaction. This is opt-in and adds latency in exchange for more realistic behavior. Six modules:
- Mouse — Bezier curve paths with Fitts's Law timing and micro-tremor jitter
- Typing — Log-normal inter-key delays (200ms median), key dwell time, bigram-aware speed, rollover typing
- Scroll — Inertial simulation with ramp-up and momentum decay (touchpad/mouse-wheel physics)
- Pause — Inter-action "think" delays that simulate cognitive gaps between actions
- Fingerprint — Coherent browser fingerprint generation (platform, device memory, GPU, timezone)
- Utils — Shared math primitives (Box-Muller gaussian, distributions)
Page Classification
Every navigate and snapshot call automatically classifies the page type using weighted signal scoring (no LLM required). 18 types:
login · search-results · product · product-list · checkout · article · dashboard · form · error · challenge · landing · documentation · profile · media · feed · qa · ecommerce · unknown
Classification drives smarter snapshot extraction — login pages surface form fields, articles surface content, dashboards surface interactive elements.
Harness Intelligence
The harness tracks every action in a session and classifies outcomes:
- Action outcome classification —
SUCCESS,SILENT_CLICK,NAVIGATION,WRONG_ELEMENT,BLOCKED,ERROR,PENDING - Bot redirect detection — detects when a site redirects to a challenge or block page after an action
- Loop detection — warns when the agent is stuck clicking the same element, ping-ponging between URLs, or repeating actions
- Session memory —
session_memorytool recalls actions after context window compression
Cookie Persistence
Persistent browser profiles now use context.cookies() + addCookies() instead of storageState(), which returns empty on persistent contexts. Auth state survives across sessions.
Adaptive Wait + Stealth Escalation
Navigate automatically retries with fallback strategies when pages fail to load:
- Try
load(fastest) — if empty, retry withnetworkidle(10s cap) - If
networkidletimes out (Amazon, ad-heavy sites), fall back todomcontentloaded - If blocked/challenged, escalate stealth: random delays → wait for JS challenge → rotate session with fresh fingerprint
- Profile sessions (auth'd) never have their session destroyed — hard-capped at Level 2
Opt-out with autoRetry: false on navigate. Control max escalation with maxRetryLevel (0-5, default 3).
Record / Replay
Export a session's action history as a replayable recording, then replay it in new sessions:
session_export— creates parameterized JSON or Playwright script from session history.@eNrefs resolved to stable CSS selectors. Auto-detects emails, passwords, URLs as{{placeholders}}.session_replay— replays a recording with parameter overrides. SupportsonError: 'stop'or'skip'.
Turn one-off agent workflows into reusable automations.
Pagination Extraction
Extract data across multiple pages in a single tool call:
- Click-next — auto-detects "Next" buttons, pagination links, "Load more" buttons
- Infinite scroll — scrolls and waits for new content via DOM hash comparison
- URL pattern — increments
?page={page}or custom patterns
Replaces 3-4 tool calls per page. Cap: 50 pages, 100K total chars. Stops on: no next button, empty page, duplicate content, or bot detection.
Incremental Snapshots
The diff tool returns only what changed since the last snapshot — additions, removals, changes. Massive token savings for monitoring and polling workflows.
HUD Overlays (LEAP_HUD=true)
When running headed, Leapfrog overlays visual feedback on every session:
- Click ripple — expanding green circle at click coordinates (agent actions only)
- Zoom-to-target — browser zooms to 1.15x on the clicked element briefly so agents (and humans) can visually track what's happening in tiled windows
- Scroll-to-target — scrollIntoView before clicks so you can see what the agent is about to click
Minimal by design. No borders, no status bars, no cursor overlay — just the feedback that matters.
Multi-Terminal Tiling (LEAP_TILE=true)
Multiple Leapfrog instances share the screen via file-based coordination. Each instance tracks its own windows and a TilesCoordinator assigns global grid slots — no overlap, no manual arrangement. Set LEAP_TILE=true (or LEAP_TILE=master for the primary instance). Padding between tiles is configurable with LEAP_TILE_PADDING (default 8px).
Human Intervention
Leapfrog auto-detects situations that need a human — CAPTCHAs, login forms, OAuth redirects, Cloudflare challenges — and tries to self-resolve before pausing.
- Auto-resolves first: clicks reCAPTCHA checkboxes, Cloudflare verify buttons, generic verify/continue buttons, then a second-pass retry — all before asking for help
- External solvers: set
LEAP_CAPTCHA_PROVIDER+LEAP_CAPTCHA_API_KEYfor CapSolver, 2Captcha, or NopeCHA integration - Learns what works: remembers which resolution method succeeded per domain and tries the known-good method first on revisit
- Detects reCAPTCHA, hCaptcha, Turnstile, login forms, OAuth redirects, Cloudflare challenges
- Tab title changes to "NEEDS HUMAN" when intervention is needed
wait_for_humantool — agent calls when stuck, blocks until you resolve it or navigate past
Cookie Consent Auto-Dismiss (LEAP_AUTO_CONSENT=true)
Automatically dismisses cookie consent banners across 10 frameworks (OneTrust, CookieBot, TrustArc, Quantcast, Didomi, Cookielaw, Osano, Usercentrics, + generic) plus text-matching fallback. Per-domain selector caching for instant replay on revisit.
Tracing (LEAP_TRACE=true)
Per-session Playwright tracing with screenshots + DOM snapshots. Export ZIP files viewable at trace.playwright.dev via the session_export_trace tool. Auto-saves on session destroy.
Self-Improvement
Leapfrog learns from every visit. Per-domain knowledge persists at ~/.leapfrog/domains/{domain}.json — 9 dimensions, all automatic:
| # | Dimension | What it does |
|---|---|---|
| 1 | Wait strategies | Learns optimal wait method per domain (networkidle vs domcontentloaded vs load) + running average timing |
| 2 | Stealth tiers | Auto-escalates 0→3 when blocks are detected (2+ blocks in 1 hour). Starts at learned tier on revisit |
| 3 | Consent selectors | Remembers cookie banner dismiss selectors, auto-clicks on revisit |
| 4 | Challenge resolution | Records which CAPTCHA method worked (reCAPTCHA checkbox, Cloudflare verify, etc.), tries known-good method first |
| 5 | Stable element suppression | Identifies nav/footer/sidebar elements seen 3+ visits, suppresses from snapshots (30-40% token savings on mature domains) |
| 6 | Selector healing | Remembers element fingerprints → selectors, heals broken refs across visits |
| 7 | API endpoint caching | Discovered API endpoints persist across sessions |
| 8 | Interaction heat maps | Tracks which elements agents actually use, suppresses untouched elements (coming) |
| 9 | Strategy selection | Adversarial bandit (EXP3) for stealth config optimization. Use LEAP_STEALTH=auto to enable. |
LRU eviction at 500 domains. Inspect with the domain_knowledge tool.
SSRF Hardening
URL validation blocks hex-encoded IPs (0x7f000001), octal notation (0177.0.0.1), CGNAT ranges (100.64.0.0/10), and redirect chains that resolve to internal addresses. Localhost and 127.0.0.0/8 are allowed by default for local dev workflows — set LEAP_BLOCK_LOCALHOST=true to block them.
The Ecosystem
Leapfrog uses pond metaphors to keep things memorable. Your agent is the frog.
| Concept | Leapfrog term | What it means |
|---|---|---|
| Sessions | Ponds | Isolated browser contexts — cookies, storage, state |
| Tabs | Lily pads | Where the frog lands within a pond |
| Navigate | Leap | Jump to a URL, get a compact snapshot back |
| Snapshots | Surface | What you see on the surface — interactive @eN refs |
| Network traffic | Ripple | HTTP requests flowing under the surface |
| Console errors | Croak | Something went wrong in the browser |
| Stealth mode | Camouflage | Anti-bot evasion patches |
All 36 Tools
Pond Management (11)
| Tool | What it does |
|---|---|
| session_create | Open a new pond — isolated cookies, state, viewport, locale, timezone, stealth, proxy |
| session_destroy | Drain a pond and free the slot |
| session_list | See all active ponds with URLs and idle times |
| session_save_profile | Save auth state to disk for future ponds |
| session_list_profiles | List saved auth profiles |
| pool_status | Pool stats, memory, uptime |
| session_health | Is the pond healthy? Browser connected, page responsive? |
| profile_list | List saved persistent browser profiles |
| profile_delete | Delete a saved persistent browser profile and its data |
| profile_import_from_chrome | Import cookies and state from an installed Chrome profile |
| profile_warm | Pre-warm a profile by loading key URLs to establish cookies/state |
Navigation & Snapshots (12)
| Tool | What it does |
|---|---|
| navigate | Leap to a URL, return a compact @eN snapshot. Adaptive wait + stealth escalation built in. |
| snapshot | Re-read the surface (scope with CSS selector) |
| diff | Incremental snapshot — returns only what changed since last snapshot |
| act | Click, fill, type, check, select, press, scroll, hover, mousemove, drag, upload, resize, back, forward |
| batch_actions | Up to 100 sequential actions in one MCP call — eliminates round-trip overhead |
| paginate | Extract data across multiple pages in one call (click-next, scroll, URL pattern) |
| add_init_script | Inject JS that runs before every page load, persists across navigations |
| wait_for | Wait for element / text / network idle / navigation / JS expression |
| screenshot | Capture PNG (full page or element) |
| extract | Pull text, HTML, title, URL, or evaluate JS |
| session_memory | Recall actions performed in this session — recovers context after compression |
| session_export | Export session history as a replayable JSON recording or Playwright script |
Tab Management (3)
| Tool | What it does |
|---|---|
| tabs_list | List all pads in a pond |
| tab_switch | Hop to another pad (-1 for most recent popup) |
| tab_close | Close a pad (can't close the last one) |
Agent Intelligence (3)
| Tool | What it does |
|---|---|
| wait_for_human | Pause for human intervention — blocks until user clicks Done on the @..@ overlay |
| domain_knowledge | Inspect what Leapfrog has learned about a domain (wait strategies, stealth tiers, endpoints) |
| session_export_trace | Export a Playwright trace ZIP — viewable at trace.playwright.dev |
Network & API Intelligence (7)
| Tool | What it does |
|---|---|
| network_log | See HTTP traffic — filter by URL, method, status, content-type |
| console_log | Read browser console output, filtered by level |
| network_intercept | Block, mock, or log requests by URL pattern |
| api_discover | List JSON APIs the page has called, classified by category (data, tracking, auth, cdn, ads) |
| api_export | Generate an OpenAPI v3 spec from observed API traffic |
| execute | Run a Playwright script in a sandboxed environment — replaces 5-20 sequential MCP round trips |
| session_replay | Replay a recording in the current session with parameter overrides |
Environment Variables
| Variable | Default | Description |
|---|---|---|
| LEAP_MAX_SESSIONS | 15 | Max concurrent sessions |
| LEAP_IDLE_TIMEOUT | 1800000 | Session idle timeout in ms (30 min). Set 0 to disable. |
| LEAP_HEADLESS | true | Set false to watch the browser |
| LEAP_HEADED | false | Set true to watch the browser (positive alternative to LEAP_HEADLESS) |
| LEAP_CHANNEL | (bundled chromium) | Set chrome to use your installed Chrome |
| LEAP_CDP_ENDPOINT | (none) | Connect to a running Chrome instance (e.g. http://localhost:9222) |
| LEAP_EXTENSIONS | (none) | Comma-separated paths to browser extensions to load |
| LEAP_ALLOW_JS | true | Allow JS evaluation in extract and wait_for |
| LEAP_STEALTH | true | Stealth mode: true | passive | auto | false. See Stealth section. |
| LEAP_STEALTH_PROFILES | false | Enable stealth patches on profile (auth'd) sessions |
| LEAP_CDP_STEALTH | true | CDP detection evasion (Runtime.enable filtering) |
| LEAP_HUMANIZE | false | Experimental. Human-like mouse movement, typing cadence, and scroll behavior. |
| LEAP_ALLOW_EXECUTE | true | Allow the execute tool (sandboxed Playwright scripts) |
| LEAP_BLOCK_LOCALHOST | false | Block localhost/127.x.x.x (allowed by default for local dev) |
| LEAP_PROFILES_DIR | ~/.leapfrog/chrome-profiles | Directory for persistent browser profiles |
| LEAP_TILE | false | Tile sessions in a grid (true | master | false) |
| LEAP_TILE_PADDING | 8 | Padding between tiled windows (px) |
| LEAP_MULTI_TILE | false | Multi-terminal tiling coordination across Leapfrog instances |
| LEAP_SCREEN_WIDTH | (auto) | Explicit screen width for tiling calculations |
| LEAP_SCREEN_HEIGHT | (auto) | Explicit screen height for tiling calculations |
| LEAP_HUD | false | Click ripple, zoom-to-target, scroll-to-target on agent actions |
| LEAP_AUTO_CONSENT | true | Auto-dismiss cookie consent banners (10 frameworks + fallback) |
| LEAP_TRACE | false | Per-session Playwright tracing (screenshots + DOM snapshots) |
| LEAP_RECORD | false | Session recording (action history export) |
| LEAP_REBROWSER | false | Enable Rebrowser integration |
| LEAP_AUTO_WARM | false | Auto-warm profiles by loading key URLs on session create |
| LEAP_CAPTCHA_PROVIDER | (none) | External CAPTCHA solver: capsolver | 2captcha | nopecha |
| LEAP_CAPTCHA_API_KEY | (none) | API key for the configured CAPTCHA provider |
| LEAP_MAX_SESSIONS_PER_CLIENT | (none) | Per-client session pool limit |
| LEAP_LOG_LEVEL | info | debug / info / warn / error |
Tests
769 passing across 31 suitesSession management, snapshot engine, snapshot differ, network intelligence, tab management, security, SSRF protection, stealth patches (19), stealth enhanced, humanization (mouse, typing, scroll), page classification, harness intelligence, API intelligence, script executor, extended actions, HUD overlays, human intervention, cookie consent, domain knowledge, selector healing, stable elements, tile manager, bug regression, integration smoke, stress tests, benchmarks.
npm testRequirements
- Node.js >= 20
- Chromium — use system Chrome (
LEAP_CHANNEL=chrome) or install vianpx playwright-core install chromium
License
MIT
