aieye

v1.3.0

Published

a month ago

MCP server that gives AI agents visual access to web applications — screenshot, inspect, click, type, visual diff, and more

0High
0Medium
0Low

spra03

mcp mcp-server model-context-protocol claude ai browser-automation playwright cdp ai-eye aieye source-mapping developer-tools

aieye

MCP (Model Context Protocol) server for AI Eye. Exposes browser automation and inspection tools to AI agents like Claude.

Features

The MCP server provides 13 tools for AI-driven visual development:

🔍 Inspection Tools

aieye_inspect_element - Get detailed element information with source mapping
aieye_get_source - Get source file location for any element
aieye_get_structure - Get page DOM structure with source mapping
aieye_find_elements - Find elements by text, role, or CSS selector

🎯 Interaction Tools

aieye_click - Click elements
aieye_type - Type text into inputs
aieye_scroll - Scroll the page

📸 Debugging Tools

aieye_screenshot - Capture page screenshots
aieye_get_console_logs - Retrieve console logs
aieye_clear_console_logs - Clear captured logs
aieye_get_hmr_status - Check HMR status

🌐 Navigation Tools

aieye_navigate - Navigate to URLs
aieye_get_current_url - Get current page URL

Installation

The MCP server is automatically built as part of the AI Eye workspace:

# Build all packages
npx pnpm build

# Or build just the MCP server
cd packages/mcp-server
npx pnpm build

Usage with Claude Desktop

Configuration

Add to your Claude Desktop config at ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "aieye": {
      "command": "node",
      "args": ["/Users/yourname/path/to/vscode-eye/packages/mcp-server/dist/index.js"]
    }
  }
}

Important: Replace /Users/yourname/path/to/vscode-eye with the actual path to your AI Eye project.

Starting the Preview

Before using the MCP tools, make sure your dev server is running:

cd examples/react-vite-app
npx pnpm dev

Note the port number (e.g., http://localhost:5176).

Using with Claude

Once configured, you can ask Claude to interact with your UI:

Example prompts:

"Take a screenshot of the current page"
- Claude will call aieye_screenshot
- Returns base64-encoded image
"What button elements are on the page?"
- Claude will call aieye_find_elements with query "button"
- Returns array of buttons with selectors and source locations
"Click the increment button"
- Claude will call aieye_find_elements to find button
- Then aieye_click to click it
"Where is the h1 element defined in the source code?"
- Claude will call aieye_get_source with selector "h1"
- Returns file path, line, and column
"Show me the page structure"
- Claude will call aieye_get_structure
- Returns DOM tree with source mapping

How It Works

STDIO Transport: MCP server communicates with Claude via standard input/output
Playwright Connection: Connects to browser using Playwright
Bridge API: Calls window.__AIEYE__ APIs injected by Vite plugin
Source Mapping: All elements include source file locations

Architecture

Claude Desktop
    ↕ (STDIO)
MCP Server
    ↕ (Playwright/CDP)
Browser (Chromium)
    ↕ (window.__AIEYE__)
Your React App

Development

Build

pnpm build

Watch Mode

pnpm dev

Test Locally

# Start dev server in one terminal
cd examples/react-vite-app && npx pnpm dev

# Test MCP server (requires browser to be running)
cd packages/mcp-server
node dist/index.js

The server will wait for STDIO input in MCP format.

Tool Reference

aieye_screenshot

Capture a screenshot of the current page.

Parameters:

format (optional): "jpeg" or "png" (default: "jpeg")
quality (optional): 0-100 (default: 80, only for JPEG)

Returns: Base64-encoded image data

aieye_inspect_element

Inspect an element and get detailed information.

Parameters:

selector (required): CSS selector (e.g., "button.primary")

Returns:

{
  "tagName": "BUTTON",
  "sourceFile": "src/App.tsx",
  "sourceLine": "41",
  "sourceCol": "12",
  "boundingBox": { "x": 100, "y": 200, "width": 80, "height": 32 },
  "computedStyles": { "color": "rgb(255, 255, 255)", ... },
  "attributes": { "class": "primary" }
}

aieye_get_source

Get source location for an element.

Parameters:

selector (required): CSS selector

Returns:

{
  "file": "src/App.tsx",
  "line": "41",
  "col": "12"
}

aieye_click

Click an element.

Parameters:

selector (required): CSS selector

Returns:

{
  "success": true
}

aieye_type

Type text into an input element.

Parameters:

selector (required): CSS selector
text (required): Text to type

Returns:

{
  "success": true
}

aieye_find_elements

Find elements by text, role, or CSS selector.

Parameters:

query (required): Search query
mode (optional): "text", "role", or "css" (default: "css")

Returns: Array of matching elements with selectors and source locations

aieye_get_structure

Get page DOM structure.

Parameters:

maxDepth (optional): Maximum traversal depth (default: 5)

Returns: Tree structure with source mapping

aieye_get_console_logs

Retrieve captured console logs.

Parameters:

level (optional): Filter by "log", "warn", "error", or "info"
limit (optional): Maximum number of logs to return

Returns: Array of console logs with timestamps

Troubleshooting

"Failed to connect to browser"

Ensure your dev server is running (npx pnpm dev)
Check the port number matches (default: 5176)
Try closing other Chrome/Chromium instances

"Bridge not available"

Verify the Vite plugin is enabled in vite.config.ts
Restart the dev server to pick up plugin changes
Check browser console for bridge initialization messages

"Command not found: node"

Ensure Node.js is installed and in PATH
Use absolute path to node in Claude Desktop config

For AI Agents

Overview

This MCP server is your primary interface to AI Eye. As an AI agent, you communicate with this server through the Model Context Protocol (MCP) to interact with the user's browser and application.

You have access to 14 tools (see list at top of this README) that allow you to:

See the UI (screenshots)
Inspect elements (get source locations)
Interact with UI (click, type, scroll)
Debug issues (console logs, HMR status)

How Communication Works

You (AI Agent)
  ↕ MCP Protocol (JSON-RPC over STDIO)
This MCP Server
  ↕ Playwright/CDP
Browser (Running user's app)
  ↕ window.__AIEYE__
User's Application (with AI Eye plugin)

Key points:

You don't talk to the browser directly - you talk to this MCP server
This server handles browser automation using Playwright
The browser has AI Eye bridge injected for source mapping
All responses are JSON (tool results)

Connection Modes

The MCP server supports three connection modes:

Playwright Launch (default)
- MCP server launches its own browser
- Most reliable and feature-complete
- Higher memory usage (~200MB)
CDP Connect
- Connects to existing Chrome with remote debugging
- Lower memory usage
- Requires user to start Chrome with --remote-debugging-port=9222
CDP-Only (fallback)
- No bridge available
- Limited functionality (screenshots, clicking)
- Falls back when AI Eye plugin not configured

Tool Usage Patterns

Pattern 1: Visual Debugging

User: "The button looks weird"

You:
1. aieye_screenshot           → See the button
2. aieye_find_elements        → Find button selector
3. aieye_inspect_element      → Get styles & source location
4. [Read source file at src/Button.tsx:45]
5. [Identify CSS issue]
6. [Fix code]
7. aieye_screenshot           → Verify fix

Pattern 2: Feature Request

User: "Add a dark mode toggle"

You:
1. aieye_screenshot           → See current UI
2. aieye_get_structure        → Understand layout
3. [Create new component]
4. aieye_get_console_logs     → Check for errors
5. aieye_screenshot           → Show new feature

Pattern 3: Bug Investigation

User: "Something's broken but I don't know what"

You:
1. aieye_get_console_logs     → Check for errors
2. aieye_screenshot           → See error state
3. aieye_inspect_element      → Find problematic element
4. aieye_get_source           → Get code location
5. [Analyze & fix]
6. aieye_get_console_logs     → Verify no errors

Tool Selection Guide

For seeing UI:

aieye_screenshot - Takes ~1-2 seconds, use sparingly

For finding elements:

aieye_find_elements - Search by text, role, or CSS
Use this when you don't know the exact selector

For inspecting:

aieye_get_source - Fast, just returns file location
aieye_inspect_element - Slower, returns everything (styles, attributes, location)
Tip: Use get_source if you only need the file location

For interacting:

aieye_click - Click buttons, links, etc.
aieye_type - Fill forms
aieye_scroll - Access elements below fold

For debugging:

aieye_get_console_logs - Check for errors (use filter: "error")
aieye_get_hmr_status - Check if HMR is working
aieye_clear_console_logs - Clear logs before testing

Common Errors and Solutions

Error: "Not connected to browser"

What it means: No browser session active

AI should:

Check if user's dev server is running
Tell user to start dev server: npm run dev
Verify URL with user
Try again after confirmation

Error: "Element not found: button.primary"

What it means: CSS selector didn't match anything

AI should:

Take screenshot to see what's actually there
Use aieye_find_elements with text search:
```
{"query": "Submit", "mode": "text"}
```
Get list of possible matches
Choose correct selector from results

Error: "Bridge not available"

What it means: AI Eye Vite/Babel plugin not configured

AI should:

Explain plugin is required for source mapping
Guide user to install: npm install @ai-eye/vite-plugin --save-dev
Guide user to configure vite.config.ts
Tell user to restart dev server

Error: "Source mapping not available for this element"

What it means: Element is from third-party library (in node_modules)

AI should:

Explain this is expected for library components
Say you can still inspect styles/attributes
But can't navigate to source (it's not user's code)

Configuration for Users

When setting up AI Eye with users, guide them to add this to their Claude Desktop config:

macOS:

{
  "mcpServers": {
    "aieye": {
      "command": "node",
      "args": ["/absolute/path/to/ai-eye/packages/mcp-server/dist/index.js"],
      "env": {
        "AIEYE_URL": "http://localhost:5173",
        "AIEYE_MODE": "playwright-launch"
      }
    }
  }
}

Environment variables:

AIEYE_URL - Dev server URL (default: http://localhost:5173)
AIEYE_MODE - Connection mode:
- playwright-launch (default) - Launch own browser
- cdp-connect - Connect to existing Chrome
- cdp-only - CDP without bridge (fallback)

Best Practices for AI Agents

DO:

✅ Always start with aieye_screenshot to see current state
✅ Use aieye_get_source for quick source lookups
✅ Check aieye_get_console_logs after interactions
✅ Clear console logs before testing: aieye_clear_console_logs
✅ Use aieye_find_elements to search by description
✅ Take before/after screenshots to verify changes

DON'T:

❌ Don't take screenshots repeatedly without changes (slow)
❌ Don't guess CSS selectors - use find_elements instead
❌ Don't use inspect_element if you only need file location
❌ Don't forget to check console for errors after clicking/typing

Performance Tips

Screenshots are slow (~1-2s) - use sparingly
get_source is fast (~50ms) - prefer over inspect_element when possible
find_elements is efficient - better than guessing selectors
get_structure gets multiple elements at once (efficient)

Verification Checklist

Before telling user "AI Eye is set up correctly", verify:

□ 1. Dev server is running
     [User confirms or you check with them]

□ 2. Navigate to dev server
     aieye_navigate: "http://localhost:5173"

□ 3. Page loads successfully
     aieye_get_current_url
     Should match the navigate URL

□ 4. Can take screenshot
     aieye_screenshot
     Should return image data

□ 5. Bridge is available (source mapping works)
     aieye_inspect_element: "body"
     Check if sourceFile/sourceLine exist

□ 6. Console logs work
     aieye_get_console_logs
     Should return array (may be empty)

□ 7. HMR tracking works
     aieye_get_hmr_status
     Should show connected: true

If all checks pass → AI Eye is fully operational ✅

Debugging MCP Server Issues

When things aren't working:

Check 1: Is MCP server running?

Should see it in Claude Desktop's server list
Check Claude Desktop logs for startup errors

Check 2: Can it connect to browser?

aieye_get_current_url

Success → Connected ✅
Error → Not connected ❌

Check 3: Is Playwright installed?

MCP server uses Playwright
User may need to: npx playwright install chromium

Check 4: Is bridge available?

aieye_inspect_element: "body"

Has sourceFile → Bridge working ✅
No sourceFile → Bridge not available ❌

Example AI Workflow

User: "Help me debug why my app won't load"

AI Response:

Let me investigate using AI Eye:

1. Taking screenshot of current state...
   [calls aieye_screenshot]

   I can see a blank page with an error overlay.

2. Checking console for errors...
   [calls aieye_get_console_logs with level: "error"]

   Found error: "Uncaught TypeError: Cannot read property 'map' of undefined"
   at App.tsx:45

3. Let me inspect the problematic component...
   [reads src/App.tsx:45]

   Issue: You're calling .map() on 'items' but it's undefined.
   This happens when data hasn't loaded yet.

4. Fix: Add a check before mapping:
   {items?.map(...)} or {items && items.map(...)}

5. Verifying fix...
   [calls aieye_get_console_logs again]

   No errors! ✅

6. Taking screenshot to confirm...
   [calls aieye_screenshot]

   App is now loading correctly!

Tool Capabilities Summary

| Tool | Speed | Use When | |------|-------|----------| | screenshot | Slow (1-2s) | Need to see UI | | navigate | Medium (500ms) | Changing pages | | inspect_element | Medium (200ms) | Need full element details | | get_source | Fast (50ms) | Only need file location | | get_structure | Medium (300ms) | Need component tree | | find_elements | Fast (100ms) | Searching for elements | | click | Fast (100ms) | Testing interactions | | type | Fast (150ms) | Filling forms | | scroll | Fast (50ms) | Accessing below fold | | get_console_logs | Fast (10ms) | Checking for errors | | clear_console_logs | Fast (5ms) | Resetting before test | | get_hmr_status | Fast (10ms) | Checking HMR | | clear_hmr_errors | Fast (5ms) | Resetting HMR logs | | get_current_url | Fast (10ms) | Verifying navigation |

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

aieye

Features

🔍 Inspection Tools

🎯 Interaction Tools

📸 Debugging Tools

🌐 Navigation Tools

Installation

Usage with Claude Desktop

Configuration

Starting the Preview

Using with Claude

How It Works

Architecture

Development

Build

Watch Mode

Test Locally

Tool Reference

aieye_screenshot

aieye_inspect_element

aieye_get_source

aieye_click

aieye_type

aieye_find_elements

aieye_get_structure

aieye_get_console_logs

Troubleshooting

"Failed to connect to browser"

"Bridge not available"

"Command not found: node"

For AI Agents

Overview

How Communication Works

Connection Modes

Tool Usage Patterns

Tool Selection Guide

Common Errors and Solutions

Configuration for Users

Best Practices for AI Agents

Performance Tips

Verification Checklist

Debugging MCP Server Issues

Example AI Workflow

Tool Capabilities Summary

License