@stackone/testing-agent

v0.0.2

Published

5 months ago

Automated testing agent for MCP servers

0High
0Medium
0Low

s1guillaume

jorge.stackone

stackone-publisher

mcp modelcontextprotocol testing claude-code automation

MCP Testing Agent

Automated testing agent for model context protocol (MCP) servers. Test your MCP servers with just a URL.

Overview

This tool provides automated testing for MCP servers using the HTTP streaming transport. It leverages Claude Code SDK to systematically test all available tools exposed by an MCP server, generating comprehensive test reports.

The agent automatically discovers available tools from the MCP server by connecting and listing all tools before testing begins.

Features

Auto-discovery: Automatically discovers and tests all available MCP tools
AI-powered testing: Uses Claude to generate realistic test data and validate responses
Comprehensive reports: Generates detailed JSON reports with test results
Clean testing: Attempts to clean up created resources after testing
Authentication support: Supports custom headers for authenticated endpoints

Installation

NPM

npx @stackone/testing-agent --transport http --url http://localhost:3000/mcp

Bun

bunx @stackone/testing-agent --transport http --url http://localhost:3000/mcp

From Source

git clone https://github.com/your-org/mcp-connectors.git
cd apps/testing-agent
bun install
bun start --transport http --url http://localhost:3000/mcp

Usage

Prerequisites

Set your Anthropic API key in environment variables:

export ANTHROPIC_API_KEY=your-api-key

Or create a .env file (Bun will load it automatically):

ANTHROPIC_API_KEY=your-api-key

Basic Usage

Test an MCP server running locally:

bunx @stackone/testing-agent --transport http --url http://localhost:3000/mcp

With Authentication

Include authentication headers:

bunx @stackone/testing-agent \
  --transport http \
  --url https://api.example.com/mcp \
  --headers '{"Authorization": "Bearer your-token"}'

How It Works

The testing agent follows this process:

Connect to MCP server: Establishes connection using the MCP client SDK
Discover tools: Automatically discovers all available tools by calling listTools() on the official MCP client SDK
Configure Claude Code: Sets up Claude Code SDK with the discovered tools
Test each tool: Systematically tests each discovered tool with realistic data
Generate report: Creates a detailed JSON report with test results

Test Process

For each discovered tool, the agent:

Generates realistic input data using AI
Calls the tool with the generated data
Validates the response
Documents success/failure status
Provides improvement suggestions when applicable
Attempts cleanup of any created resources

Output

Test results are saved to .agent/results_<server>_<timestamp>.json with the following structure:

{
  "timestamp": "2025-01-09T10:30:00.000Z",
  "server_url": "http://localhost:3000/mcp",
  "tool_count": 2,
  "tools_tested": [
    {
      "name": "search",
      "input": {
        "query": "python programming tutorial",
        "maxResults": 5
      },
      "expected_output": "list of search results with urls and summaries",
      "actual_output": { ... },
      "success": true,
      "suggestions": "consider adding pagination support"
    }
  ]
}

Architecture

The testing agent consists of:

CLI interface (src/cli.ts): Parses command-line arguments
Tool discovery (src/discover-tools.ts): Connects to MCP server and lists tools
Test orchestrator (src/index.ts): Main testing logic and Claude Code integration
Prompt generator (src/prompt.ts): Creates testing prompts for Claude
Test data generator (src/test-data-mcp-server.ts): Internal MCP server for generating test data

Development

Prerequisites

Node.js 18+ or Bun
TypeScript
An Anthropic API key

Setup

# Clone the repository
git clone https://github.com/your-org/mcp-connectors.git
cd apps/testing-agent

# Install dependencies
bun install

# Run tests
bun test

# Build for production
bun run build

Limitations

Currently only supports HTTP streaming transport
Requires an Anthropic API key
Testing is limited to 50 turns to prevent infinite loops
Large response outputs may be truncated in console logs

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

MCP Testing Agent

Overview

Features

Installation

NPM

Bun

From Source

Usage

Prerequisites

Basic Usage

With Authentication

How It Works

Test Process

Output

Architecture

Development

Prerequisites

Setup

Limitations

License