@dotsetlabs/bellwether
v2.1.0
Published
The open-source MCP testing tool. Structural drift detection and behavioral documentation for Model Context Protocol servers.
Maintainers
Readme
Bellwether
The open-source MCP testing tool. Catch drift before your users do.
What is MCP? MCP (Model Context Protocol) is how AI assistants like Claude connect to external tools—read files, query databases, call APIs. When those tool schemas change, AI workflows break silently.
Why Bellwether?
MCP servers expose tools with JSON schemas. When those schemas change—a parameter renamed, a type modified, a tool removed—AI agents break silently. Bellwether catches these changes before they reach production.
| Problem | Solution | |:--------|:---------| | Breaking changes slip into production | Drift detection catches schema changes in CI | | No standard for MCP testing | Native MCP support understands tools, prompts, resources | | Manual testing misses edge cases | Automated exploration covers what humans miss | | Documentation gets stale | CONTRACT.md generated from actual behavior |
Quick Start
npm install -g @dotsetlabs/bellwether
bellwether init npx @mcp/your-server
bellwether checkThat's it. No API keys. No LLM costs. Runs in seconds.
Two Modes
| Mode | Purpose | Cost | When to Use |
|:-----|:--------|:-----|:------------|
| check | Schema drift detection | Free | CI/CD, every PR |
| explore | LLM-powered behavioral testing | LLM API costs | Local dev, deep analysis |
Most users only need check. It's deterministic, fast, and catches the changes that break AI agents.
CI/CD Workflow
Store your baseline in git. Run checks in CI. No account needed.
# 1. Initialize and save baseline (one-time setup)
bellwether init npx @mcp/your-server
bellwether check
bellwether baseline save
git add bellwether.yaml bellwether-baseline.json
git commit -m "Add Bellwether baseline"# 2. Add to CI (.github/workflows/bellwether.yml)
name: MCP Drift Detection
on: [pull_request]
jobs:
check:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- run: npx @dotsetlabs/bellwether check --fail-on-driftWhat It Detects
| Change | Example | Severity |
|:-------|:--------|:---------|
| Tool added/removed | delete_file appears or disappears | Breaking |
| Schema changed | Parameter path becomes required | Breaking |
| Parameter renamed | path to file_path | Breaking |
| Description changed | Tool help text updated | Warning |
| Performance regression | Latency increased >10% | Warning |
| Tool annotations changed | readOnlyHint flipped to false | Warning |
| Output schema changed | Return type structure modified | Warning |
| Entity title changed | Tool/prompt/resource title updated | Info |
| Task support changed | Execution mode switched to async | Warning |
| Server instructions changed | Server-level instructions updated | Info |
| Prompt added/removed | Prompt template appears or disappears | Breaking |
| Resource changed | Resource URI or MIME type modified | Warning |
Comparisons are protocol-version-aware — version-specific fields (annotations, titles, output schemas, etc.) are only compared when both baselines support the relevant MCP protocol version.
Commands
Essential Commands
bellwether init <server-command> # Create config
bellwether check # Detect drift (free, deterministic)
bellwether baseline save # Save baseline to compare against
bellwether baseline compare # Compare current vs saved baselineExplore Command (Optional)
bellwether init --preset local npx @mcp/server # Uses Ollama (free)
bellwether explore # LLM-powered testingRequires LLM (Ollama for free local, or OpenAI/Anthropic). Generates AGENTS.md with behavioral documentation.
All Commands
| Command | Purpose |
|:--------|:--------|
| init | Create bellwether.yaml config |
| check | Schema drift detection (free) |
| explore | LLM behavioral testing |
| baseline save | Save test results as baseline |
| baseline compare | Compare against baseline |
| baseline show | Display baseline contents |
| baseline accept | Accept drift as intentional |
| baseline diff | Compare two baselines |
| discover | Show server capabilities |
| watch | Continuous checking on file changes |
| registry | Search MCP Registry |
| golden | Golden output regression testing |
| contract | Contract validation (generate/validate/show) |
| auth | Manage LLM provider API keys |
| validate-config | Validate bellwether.yaml without running tests |
CI/CD Exit Codes
| Code | Meaning | Suggested Action |
|:-----|:--------|:-----------------|
| 0 | No changes | Pass |
| 1 | Info-level changes | Pass or warn |
| 2 | Warning-level changes | Warn |
| 3 | Breaking changes | Fail |
| 4 | Runtime error | Fail |
| 5 | Low confidence metrics | Warn or fail |
GitHub Action
- uses: dotsetlabs/[email protected]
with:
version: '2.0.0'
server-command: 'npx @mcp/your-server'
baseline-path: './bellwether-baseline.json'
fail-on-severity: 'warning'Configuration
All settings live in bellwether.yaml. Create one with presets:
bellwether init npx @mcp/your-server # Default (free, fast)
bellwether init --preset ci npx @mcp/server # Optimized for CI/CD
bellwether init --preset local npx @mcp/server # Local Ollama (free)Environment Variables
| Variable | Description |
|:---------|:------------|
| OPENAI_API_KEY | OpenAI API key (explore only) |
| ANTHROPIC_API_KEY | Anthropic API key (explore only) |
| OLLAMA_BASE_URL | Ollama URL (default: http://localhost:11434) |
Documentation
docs.bellwether.sh — Full reference for configuration and commands.
Community
- GitHub Discussions - Questions and ideas
- GitHub Issues - Bug reports
- Contributing - How to contribute
Development
git clone https://github.com/dotsetlabs/bellwether
cd bellwether
npm install
npm run build
npm testLicense
MIT License - see LICENSE for details.
