supapup

v0.1.31

Published

6 months ago

⚡ Lightning-fast MCP browser dev tool. Navigate → Get instant structured data. No screenshots needed! Puppeteer: 📸 → CSS selectors → JS eval. Supapup: semantic IDs ready to use. 10x faster, 90% fewer tokens.

0High
0Medium
0Low

onepointfour-packs

mcp model-context-protocol puppeteer web-automation browser-automation ai-agents web-scraping debugging network-monitoring responsive-testing qa-testing visual-testing

Supapup 🐾

🎬 Watch Demo Video

See Supapup in action - automated web testing with AI agents

🚀 See the Difference

❌ With Regular Puppeteer MCP

$ puppeteer navigate https://example.com
✓ Navigated to https://example.com

$ puppeteer screenshot
✓ Screenshot taken (800x600)
[📸 High token cost image data...]

$ # Now what? Need to inspect elements, write selectors, execute JS...

✅ With Supapup

$ supapup navigate https://example.com
✅ Navigation successful
📍 URL: https://example.com

AGENT PAGE VIEW
==============================
Found 42 interactive elements

📝 FORMS:
  • form: login-form
    - email: form-login-email
    - password: form-login-password  
    - submit: form-login-submit

🎛️ CONTROLS:
  • link: sign-up → signup-link
  • link: forgot-password → forgot-password-link
  • button: get-started → get-started-button
  
✨ Ready to interact! No screenshots needed.
Example: execute_action({actionId: "form-login-email", params: {value: "[email protected]"}})

The Result?

⚡ 10x faster - Instant structured data vs screenshots + manual inspection
💰 90% fewer tokens - Text-based output instead of images
🎯 Zero complexity - Semantic IDs ready to use, no CSS selectors needed
🔄 Automatic DOM Remapping - After every action, Supapup automatically detects DOM changes and returns an updated agent page with fresh element IDs. No stale references!

🎥 See it in Action - Watch Demo

Supapup is an intelligent web automation tool that bridges the gap between AI agents and web browsers. It wraps Puppeteer with Model Context Protocol (MCP) support, providing a structured, predictable interface for programmatic web interaction.

✨ Features

🤖 Agent-Aware Web Pages

Semantic Element IDs: Instead of brittle CSS selectors, use meaningful IDs like form-login-email
Structured Representation: Web pages are presented as organized, actionable interfaces
Automatic Element Detection: Intelligently identifies interactive elements on any page
Visual Element Mapping: Number-based element identification for easy interaction
Unique ID Generation: Handles duplicate elements intelligently - multiple "Add to Cart" buttons get unique IDs with product context

🔄 Automatic DOM Remapping (Key Feature!)

Zero Manual Updates: After every execute_action, Supapup automatically detects DOM changes
Fresh Element IDs: All elements are re-mapped with new data-mcp-id attributes
Handles Dynamic Content: Perfect for SPAs, AJAX updates, and reactive frameworks
Updated Agent Page: The response always includes the latest page state
Example: Fill a search box → autocomplete appears → new dropdown elements are automatically mapped and returned

🔍 Advanced Debugging

JavaScript Breakpoints: Set conditional breakpoints and step through code
Variable Inspection: Examine local variables and expressions during debugging
Automated Function Debugging: Trigger and debug specific functions automatically

📊 Network Monitoring

Request/Response Logging: Capture all network traffic with headers and payloads
API Request Replay: Replay requests with modified parameters
Request Interception: Modify requests on-the-fly with custom rules
Network Throttling: Simulate slow connections (slow-3g, fast-3g, offline) for testing
Performance Metrics: Track page load times and resource usage

🎯 Smart Navigation

CAPTCHA Detection: Automatically identifies CAPTCHA pages
Dynamic Content Handling: Waits for AJAX/DOM changes after actions
Chunked Screenshots: Handles large pages by splitting screenshots automatically
Browser Tab Management: Open content directly in new tabs

📦 Installation

Install with npm

npm install -g supapup

Using with Claude CLI

First, ensure you have Claude CLI installed:

npm install -g @anthropic/claude-cli

Configure Supapup as an MCP server:

claude mcp add supapup "supapup"

Start using Supapup in Claude:

claude "Navigate to example.com and find all buttons"

For more Claude CLI MCP configuration:

claude mcp help

Using with Gemini CLI

Install Gemini CLI:

npm install -g @google/gemini-cli

Configure Supapup in your Gemini config:

{
  "mcpServers": {
    "supapup": {
      "command": "npx",
      "args": ["supapup"]
    }
  }
}

As a Library

npm install supapup

From Source

git clone https://github.com/jaspergreen/supapup.git
cd supapup
npm install
npm run build

🔧 Configuration

Role-Based Tool Sets

Supapup supports role-based tool filtering to optimize for different use cases:

# Full tool set (61 tools - backward compatible)
supapup

# Testing & QA (19 tools - headless by default)
supapup --role tester

# Development with debugging (57 tools - visible browser)  
supapup --role developer

# Minimal automation (8 tools - headless by default)
supapup --role automation

Available Roles:

tester: Essential tools for QA testing, responsive design, form testing, network monitoring (perfect for headless servers)
developer: Full toolkit with debugging, DevTools modification, storage management
automation: Minimal set for basic web automation tasks

Environment Variables

Supapup supports environment variables for configuration:

SUPAPUP_ROLE - Set role: 'tester', 'developer', or 'automation'
SUPAPUP_HEADLESS - Set to 'true' for headless mode (default: 'false' - shows browser window)
SUPAPUP_DEBUG_PORT - Chrome remote debugging port (default: 9222)
SUPAPUP_DEVTOOLS - Set to 'true' to open DevTools (default: false)

MCP Configuration Examples

Full Tool Set (Default):

{
  "mcpServers": {
    "supapup": {
      "command": "supapup"
    }
  }
}

Testing/QA Role (Headless Server):

{
  "mcpServers": {
    "supapup-tester": {
      "command": "supapup",
      "args": ["--role", "tester", "--headless"]
    }
  }
}

Development Role (Full Debugging):

{
  "mcpServers": {
    "supapup-dev": {
      "command": "supapup",
      "args": ["--role", "developer"]
    }
  }
}

Environment Variable Approach:

{
  "mcpServers": {
    "supapup": {
      "command": "supapup",
      "env": {
        "SUPAPUP_ROLE": "tester",
        "SUPAPUP_HEADLESS": "true"
      }
    }
  }
}

Headless Server Deployment

Supapup works out-of-the-box on headless Linux servers thanks to bundled Chromium:

✅ No manual Chrome installation required
✅ No X11/display server needed
✅ Automatic fallback to headless mode
✅ Works in Docker containers

🚀 Quick Start

With Claude CLI

# Add full Supapup (61 tools)
claude mcp add supapup "supapup"

# Add testing-focused Supapup (19 tools, headless)
claude mcp add supapup-tester "supapup --role tester --headless"

# Use it in a conversation
claude "Navigate to https://example.com and test the responsive design"

# Or start an interactive session
claude --interactive

With Gemini CLI

# Configure in gemini-config.json then:
gemini "Use Supapup to navigate to example.com and find all forms"

🛠️ MCP Tools

Browser Management

navigate - Navigate to a URL and get agent-friendly page representation
screenshot - Capture screenshots with automatic chunking for large pages
close_browser - Close the browser instance

Element Interaction

execute_action - Execute actions on elements using semantic IDs
discover_actions - Find all available actions on the current page
get_page_state - Get current page state and element count

Visual Element Mapping

devtools_visual_element_map - Create numbered visual map of page elements
JavaScript helpers for numbered elements:
- window.__AGENT_PAGE__.clickElement(1) - Click element 1
- window.__AGENT_PAGE__.fillElement(25, "text") - Fill element 25
- window.__AGENT_PAGE__.highlightElement(100) - Highlight element 100
- window.__AGENT_PAGE__.getElementByNumber(1) - Get element DOM reference

Debugging

set_breakpoint - Set JavaScript breakpoints with optional conditions
debug_continue - Resume execution after breakpoint
debug_step_over - Step over current line
debug_evaluate - Evaluate expressions in debug context
debug_get_variables - Get local variables at breakpoint

Network Analysis

get_network_logs - Get all network requests
get_api_logs - Get detailed API request/response logs
replay_api_request - Replay requests with modifications
intercept_requests - Set up request interception rules
network_throttle - Control network speed to simulate slow connections (slow-3g, fast-3g, offline, no-throttling)

Page Analysis

get_performance_metrics - Get page load and runtime metrics
get_accessibility_tree - Get page accessibility structure
inspect_element - Get detailed element properties
evaluate_script - Execute JavaScript in page context

Content Reading

agent_read_content - Extract readable page content in markdown format for articles, search results, or any page text
- Supports pagination for large content (Wikipedia, long articles)
- Professional HTML-to-markdown conversion
- Handles pages with thousands of elements efficiently

📚 Examples

Testing a Website

claude "Test the login flow on https://myapp.com - try logging in with test credentials and tell me if it works"

claude "Navigate to https://shop.example.com and add 3 different products to cart, then check if the cart total is calculated correctly"

Debugging API Errors

claude "Go to https://myapp.com/dashboard and check the network logs for any failed API calls. Tell me what's causing the 404 errors"

claude "Monitor the API requests on https://example.com/form when I submit the form. Check if the auth token is being sent correctly"

Visual Debugging

claude "Why does the submit button on https://mysite.com/contact look broken? Take a screenshot and inspect its CSS"

claude "The layout on https://myapp.com is messed up on mobile. Can you check how it looks at 375px width and tell me what's wrong?"

Finding Elements

claude "I can't find the logout button on https://app.example.com. Can you help me locate it?"

claude "Show me all the forms on https://example.com and tell me which fields are required"

Performance Testing

claude "Check the performance metrics for https://mysite.com and tell me what's making it load slowly"

claude "Test https://heavysite.com and identify which resources are taking the longest to load"

claude "Set network throttling to slow-3g and test how https://myapp.com performs on a slow connection"

Content Reading

claude "Read the Wikipedia article at https://en.wikipedia.org/wiki/Python_(programming_language) and summarize the key points"

claude "Extract the main content from https://news.ycombinator.com and tell me what the top stories are about"

claude "Go to https://docs.python.org/3/tutorial/ and read through the first few sections to help me understand Python basics"

Automated Testing

claude "Run through the checkout process on https://shop.com: add item to cart, go to checkout, fill in the form with test data, but stop before placing the order"

claude "Test all the navigation links on https://mysite.com and tell me if any are broken"

🏗️ Architecture

Supapup consists of several specialized modules:

Agent Page Generator: Creates structured representations of web pages
Element Detector: Automatically identifies interactive elements
ID Generator: Creates unique, semantic IDs incorporating context from parent forms, headings, and element properties
Debugging Tools: Full Chrome DevTools Protocol integration
Network Tools: Comprehensive request/response monitoring
Page Analysis: Performance metrics and accessibility analysis
DevTools Elements: Visual element mapping and manipulation

ID Generation Strategy

Supapup ensures every element gets a unique, meaningful ID by:

Extracting context from parent containers (forms, sections, headings)
Including semantic meaning (label text, placeholder, button text)
Adding element type suffixes (button, link, input, checkbox)
Appending unique indices to guarantee no duplicates
Handling dynamic content with stable ID regeneration

🤝 Contributing

We welcome contributions! Please see our Contributing Guidelines for details.

Development Setup

# Clone the repository
git clone https://github.com/jaspergreen/supapup.git
cd supapup

# Install dependencies
npm install

# Run in development mode
npm run dev

# Build for production
npm run build

Running Tests

# Run test scripts
node test-agent-generator.js
node test-complex-page.cjs
node test-supapup-flow.cjs

📋 Changelog

v0.1.25 (Latest)

🖼️ Improved Screenshot Handling
- Increased screenshot size limit from 8.8KB to 45KB for better usability
- Viewport screenshots now auto-chunk when too large instead of failing
- Added detailed logging to show exact screenshot sizes
- Better error messages with actionable suggestions
🎯 Fixed Visual Element Mapping
- Removed references to non-existent helper functions
- Integrated with existing agent page system using data-mcp-id attributes
- Provides correct MCP tool usage examples
- Visual element map now properly guides interaction with execute_action()
📦 Agent Page Chunking
- Implemented missing get_agent_page_chunk functionality
- Large pages (>20K tokens) now automatically chunk
- Wikipedia and other large sites now work without token limit errors
- Clear instructions provided for retrieving subsequent chunks
🔧 Developer Experience
- Form fill now provides success feedback with filled fields list
- Debugging tools show helpful setup instructions when not paused
- Browser visible by default (was headless) for better debugging
- Added browser visibility control via MCP tools

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Built on top of Puppeteer
Implements Model Context Protocol (MCP)
Inspired by the need for better AI-browser interaction

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Supapup 🐾

🎬 Watch Demo Video

🚀 See the Difference

❌ With Regular Puppeteer MCP

✅ With Supapup

🎥 See it in Action - Watch Demo

✨ Features

🤖 Agent-Aware Web Pages

🔄 Automatic DOM Remapping (Key Feature!)

🔍 Advanced Debugging

📊 Network Monitoring

🎯 Smart Navigation

📦 Installation

Install with npm

Using with Claude CLI

Using with Gemini CLI

As a Library

From Source

🔧 Configuration

Role-Based Tool Sets

Environment Variables

MCP Configuration Examples

Headless Server Deployment

🚀 Quick Start

With Claude CLI

With Gemini CLI

🛠️ MCP Tools

Browser Management

Element Interaction

Visual Element Mapping

Debugging

Network Analysis

Page Analysis

Content Reading

📚 Examples

Testing a Website

Debugging API Errors

Visual Debugging

Finding Elements

Performance Testing

Content Reading

Automated Testing

🏗️ Architecture

ID Generation Strategy

🤝 Contributing

Development Setup

Running Tests

📋 Changelog

v0.1.25 (Latest)

📄 License

🙏 Acknowledgments

🔗 Links