@stackone/testing-agent
v0.0.2
Published
Automated testing agent for MCP servers
Readme
MCP Testing Agent
Automated testing agent for model context protocol (MCP) servers. Test your MCP servers with just a URL.
Overview
This tool provides automated testing for MCP servers using the HTTP streaming transport. It leverages Claude Code SDK to systematically test all available tools exposed by an MCP server, generating comprehensive test reports.
The agent automatically discovers available tools from the MCP server by connecting and listing all tools before testing begins.
Features
- Auto-discovery: Automatically discovers and tests all available MCP tools
- AI-powered testing: Uses Claude to generate realistic test data and validate responses
- Comprehensive reports: Generates detailed JSON reports with test results
- Clean testing: Attempts to clean up created resources after testing
- Authentication support: Supports custom headers for authenticated endpoints
Installation
NPM
npx @stackone/testing-agent --transport http --url http://localhost:3000/mcpBun
bunx @stackone/testing-agent --transport http --url http://localhost:3000/mcpFrom Source
git clone https://github.com/your-org/mcp-connectors.git
cd apps/testing-agent
bun install
bun start --transport http --url http://localhost:3000/mcpUsage
Prerequisites
Set your Anthropic API key in environment variables:
export ANTHROPIC_API_KEY=your-api-keyOr create a .env file (Bun will load it automatically):
ANTHROPIC_API_KEY=your-api-keyBasic Usage
Test an MCP server running locally:
bunx @stackone/testing-agent --transport http --url http://localhost:3000/mcpWith Authentication
Include authentication headers:
bunx @stackone/testing-agent \
--transport http \
--url https://api.example.com/mcp \
--headers '{"Authorization": "Bearer your-token"}'How It Works
The testing agent follows this process:
- Connect to MCP server: Establishes connection using the MCP client SDK
- Discover tools: Automatically discovers all available tools by calling
listTools()on the official MCP client SDK - Configure Claude Code: Sets up Claude Code SDK with the discovered tools
- Test each tool: Systematically tests each discovered tool with realistic data
- Generate report: Creates a detailed JSON report with test results
Test Process
For each discovered tool, the agent:
- Generates realistic input data using AI
- Calls the tool with the generated data
- Validates the response
- Documents success/failure status
- Provides improvement suggestions when applicable
- Attempts cleanup of any created resources
Output
Test results are saved to .agent/results_<server>_<timestamp>.json with the following structure:
{
"timestamp": "2025-01-09T10:30:00.000Z",
"server_url": "http://localhost:3000/mcp",
"tool_count": 2,
"tools_tested": [
{
"name": "search",
"input": {
"query": "python programming tutorial",
"maxResults": 5
},
"expected_output": "list of search results with urls and summaries",
"actual_output": { ... },
"success": true,
"suggestions": "consider adding pagination support"
}
]
}Architecture
The testing agent consists of:
- CLI interface (
src/cli.ts): Parses command-line arguments - Tool discovery (
src/discover-tools.ts): Connects to MCP server and lists tools - Test orchestrator (
src/index.ts): Main testing logic and Claude Code integration - Prompt generator (
src/prompt.ts): Creates testing prompts for Claude - Test data generator (
src/test-data-mcp-server.ts): Internal MCP server for generating test data
Development
Prerequisites
- Node.js 18+ or Bun
- TypeScript
- An Anthropic API key
Setup
# Clone the repository
git clone https://github.com/your-org/mcp-connectors.git
cd apps/testing-agent
# Install dependencies
bun install
# Run tests
bun test
# Build for production
bun run buildLimitations
- Currently only supports HTTP streaming transport
- Requires an Anthropic API key
- Testing is limited to 50 turns to prevent infinite loops
- Large response outputs may be truncated in console logs
License
MIT
