@physics91/openrouter-mcp
v1.3.1
Published
A Model Context Protocol (MCP) server for OpenRouter API with Collective Intelligence - Multi-model consensus, ensemble reasoning, and collaborative problem solving
Maintainers
Readme
OpenRouter MCP Server
🚀 A powerful Model Context Protocol (MCP) server that provides seamless access to multiple AI models through OpenRouter's unified API.
✨ Features
- 🧠 Collective Intelligence System: Advanced multi-model collaboration and consensus building
- 5 specialized MCP tools for ensemble reasoning and intelligent decision-making
- Multi-model consensus with automated agreement analysis and quality scoring
- Ensemble reasoning using specialized models for different task aspects
- Adaptive model selection based on task context, requirements, and performance metrics
- Cross-model validation for content quality assurance and accuracy verification
- Collaborative problem-solving through iterative multi-model interaction
- 🤖 Multi-Model Access: Chat with GPT-4o, Claude 3.5, Llama 3.3, Gemini 2.5, and 200+ other AI models
- 🖼️ Vision/Multimodal Support: Analyze images and visual content with vision-capable models
- Support for base64-encoded images and image URLs
- Automatic image resizing and optimization for API limits
- Compatible with GPT-4o, Claude 3.5, Gemini 2.5, Llama Vision, and more
- 🚀 Latest Models (Jan 2025): Always up-to-date with the newest models
- OpenAI o1, GPT-4o, GPT-4 Turbo
- Claude 3.5 Sonnet, Claude 3 Opus
- Gemini 2.5 Pro/Flash (1M+ context)
- DeepSeek V3, Grok 2, and more
- ⚡ Intelligent Caching: Smart model list caching for improved performance
- Dual-layer memory + file caching with configurable TTL
- Automatic model metadata enhancement and categorization
- Advanced filtering by provider, category, capabilities, and performance tiers
- Statistics tracking and cache optimization
- 🏷️ Rich Metadata: Comprehensive model information with intelligent extraction
- Automatic provider detection (OpenAI, Anthropic, Google, Meta, DeepSeek, XAI, etc.)
- Smart categorization (chat, image, audio, embedding, reasoning, code, multimodal)
- Advanced capability detection (vision, functions, tools, JSON mode, streaming)
- Performance tiers (premium/standard/economy) and cost analysis
- Version parsing with family identification and latest model detection
- Quality scoring system (0-10) based on context length, pricing, and capabilities
- 🔄 Streaming Support: Real-time response streaming for better user experience
- 📊 Advanced Model Benchmarking: Comprehensive performance analysis system
- Side-by-side model comparison with detailed metrics (response time, cost, quality, throughput)
- Category-based model selection (chat, code, reasoning, multimodal)
- Weighted performance analysis for different use cases
- Multiple report formats (Markdown, CSV, JSON)
- Historical benchmark tracking and trend analysis
- 5 MCP tools for seamless integration with Claude Desktop
- 💰 Usage Tracking: Monitor API usage, costs, and token consumption
- 🛡️ Enterprise Security: Multi-layered security with defense-in-depth architecture
- API keys NEVER stored in config files (environment variables only)
- Privacy-preserving logging with automatic data sanitization
- Response redaction in error messages to prevent data leaks
- Opt-in verbose mode with explicit consent warnings
- OWASP Top 10 compliant security controls
- 🛡️ Error Handling: Robust error handling with detailed logging
- 🔧 Easy Setup: One-command installation with
npx - 🖥️ Claude Desktop Integration: Seamless integration with Claude Desktop app
- 📚 Full MCP Compliance: Implements Model Context Protocol standards
🚀 Quick Start
Option 1: Using npx (Recommended)
# Initialize configuration
npx @physics91/openrouter-mcp init
# Start the server
npx @physics91/openrouter-mcp startOption 2: Global Installation
# Install globally
npm install -g @physics91/openrouter-mcp
# Initialize and start
openrouter-mcp init
openrouter-mcp start📋 Prerequisites
- Node.js 16+: Required for CLI interface
- Python 3.9+: Required for the MCP server backend
- OpenRouter API Key: Get one free at openrouter.ai
🛠️ Installation & Configuration
1. Get Your OpenRouter API Key
- Visit OpenRouter
- Sign up for a free account
- Navigate to the API Keys section
- Create a new API key
2. Initialize the Server
npx @physics91/openrouter-mcp initThis will:
- Prompt you for your OpenRouter API key
- Create a
.envconfiguration file - Optionally set up Claude Desktop integration
3. Start the Server
npx @physics91/openrouter-mcp startThe server will start on localhost:8000 by default.
🎯 Usage
Available Commands
# Show help
npx openrouter-mcp --help
# Initialize configuration
npx openrouter-mcp init
# Start the server
npx openrouter-mcp start [options]
# Check server status
npx openrouter-mcp status
# Configure Claude Desktop integration
npx openrouter-mcp install-claude
# Configure Claude Code CLI integration
npx openrouter-mcp install-claude-codeStart Server Options
# Custom port and host
npx openrouter-mcp start --port 9000 --host 0.0.0.0
# Enable verbose logging
npx openrouter-mcp start --verbose
# Enable debug mode
npx openrouter-mcp start --debug🤖 Claude Desktop Integration
Automatic Setup
npx openrouter-mcp install-claudeThis automatically configures Claude Desktop to use OpenRouter models.
Manual Setup
Add to your Claude Desktop config file:
macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%/Claude/claude_desktop_config.json
Linux: ~/.config/claude/claude_desktop_config.json
{
"mcpServers": {
"openrouter": {
"command": "npx",
"args": ["openrouter-mcp", "start"],
"env": {
"OPENROUTER_API_KEY": "your-openrouter-api-key"
}
}
}
}Then restart Claude Desktop.
💻 Claude Code CLI Integration
Automatic Setup
npx openrouter-mcp install-claude-codeThis automatically configures Claude Code CLI to use OpenRouter models.
Manual Setup
Add to your Claude Code CLI config file at ~/.claude/claude_code_config.json:
{
"mcpServers": {
"openrouter": {
"command": "npx",
"args": ["openrouter-mcp", "start"],
"env": {
"OPENROUTER_API_KEY": "your-openrouter-api-key"
}
}
}
}Usage with Claude Code CLI
Once configured, you can use OpenRouter models directly in your terminal:
# Chat with different AI models
claude "Use GPT-4 to explain this complex algorithm"
claude "Have Claude Opus review my Python code"
claude "Ask Llama 2 to suggest optimizations"
# Model discovery and comparison
claude "List all available AI models and their pricing"
claude "Compare GPT-4 and Claude Sonnet for code generation"
# Usage tracking
claude "Show my OpenRouter API usage for today"
claude "Which AI models am I using most frequently?"For detailed setup instructions, see Claude Code CLI Integration Guide.
🛠️ Available MCP Tools
Once integrated with Claude Desktop or Claude Code CLI, you'll have access to these tools:
1. chat_with_model
Chat with any available AI model.
Parameters:
model: Model ID (e.g., "openai/gpt-4o", "anthropic/claude-3.5-sonnet")messages: Conversation historytemperature: Creativity level (0.0-2.0)max_tokens: Maximum response lengthstream: Enable streaming responses
Example:
{
"model": "openai/gpt-4o",
"messages": [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Explain quantum computing"}
],
"temperature": 0.7
}2. list_available_models
Get comprehensive information about all available models with enhanced metadata.
Parameters:
filter_by: Optional filter by model nameprovider: Filter by provider (openai, anthropic, google, etc.)category: Filter by category (chat, image, reasoning, etc.)capabilities: Filter by specific capabilitiesperformance_tier: Filter by tier (premium, standard, economy)min_quality_score: Minimum quality score (0-10)
Returns:
- Model IDs, names, descriptions with enhanced metadata
- Provider and category classification
- Detailed pricing and context information
- Capability flags (vision, functions, streaming, etc.)
- Performance metrics and quality scores
- Version information and latest model indicators
3. get_usage_stats
Track your API usage and costs.
Parameters:
start_date: Start date (YYYY-MM-DD)end_date: End date (YYYY-MM-DD)
Returns:
- Total costs and token usage
- Request counts
- Model-specific breakdowns
4. chat_with_vision 🖼️
Chat with vision-capable models by sending images.
Parameters:
model: Vision-capable model ID (e.g., "openai/gpt-4o", "anthropic/claude-3-opus", "google/gemini-pro-vision")messages: Conversation history (supports both text and image content)images: List of images (file paths, URLs, or base64 strings)temperature: Creativity level (0.0-2.0)max_tokens: Maximum response length
Image Format Support:
- File paths:
/path/to/image.jpg,./image.png - URLs:
https://example.com/image.jpg - Base64: Direct base64 strings (with or without data URI prefix)
Example - Multiple Images:
{
"model": "openai/gpt-4o",
"messages": [
{"role": "user", "content": "Compare these images and describe the differences"}
],
"images": [
{"data": "/home/user/image1.jpg", "type": "path"},
{"data": "https://example.com/image2.png", "type": "url"},
{"data": "...", "type": "base64"}
]
}Features:
- Automatic image format detection and conversion
- Image resizing for API size limits (20MB max)
- Support for JPEG, PNG, GIF, and WebP formats
- Batch processing of multiple images
5. list_vision_models 🖼️
Get all vision-capable models.
Parameters: None
Returns:
- List of models that support image analysis
- Model capabilities and pricing information
- Context window sizes for multimodal content
Example Vision Models:
openai/gpt-4o: OpenAI's latest multimodal modelopenai/gpt-4o-mini: Fast and cost-effective vision modelanthropic/claude-3-opus: Most capable Claude vision modelanthropic/claude-3-sonnet: Balanced Claude vision modelgoogle/gemini-pro-vision: Google's multimodal AImeta-llama/llama-3.2-90b-vision-instruct: Meta's vision-capable Llama model
6. benchmark_models 📊
Compare multiple AI models with the same prompt.
Parameters:
models: List of model IDs to benchmarkprompt: The prompt to send to each modeltemperature: Temperature setting (0.0-2.0)max_tokens: Maximum response tokensruns_per_model: Number of runs per model for averaging
Returns:
- Performance metrics (response time, cost, tokens)
- Model rankings by speed, cost, and reliability
- Individual responses from each model
7. compare_model_categories 🏆
Compare the best models from different categories.
Parameters:
categories: List of categories to compareprompt: Test promptmodels_per_category: Number of top models per category
Returns:
- Category-wise comparison results
- Best performers in each category
8. get_benchmark_history 📚
Retrieve historical benchmark results.
Parameters:
limit: Maximum number of results to returndays_back: Number of days to look backmodel_filter: Optional model ID filter
Returns:
- List of past benchmark results
- Performance trends over time
- Summary statistics
9. export_benchmark_report 📄
Export benchmark results in different formats.
Parameters:
benchmark_file: Benchmark result file to exportformat: Output format ("markdown", "csv", "json")output_file: Optional custom output filename
Returns:
- Exported report file path
- Export status and summary
10. compare_model_performance ⚖️
Advanced model comparison with weighted metrics.
Parameters:
models: List of model IDs to compareweights: Metric weights (speed, cost, quality, throughput)include_cost_analysis: Include detailed cost analysis
Returns:
- Weighted performance rankings
- Cost-effectiveness analysis
- Usage recommendations for different scenarios
🧠 Collective Intelligence Tools
The following advanced tools leverage multiple AI models for enhanced accuracy and insights:
11. collective_chat_completion 🤝
Generate chat completion using collective intelligence with multiple models to reach consensus.
Parameters:
prompt: The prompt to process collectivelymodels: Optional list of specific models to usestrategy: Consensus strategy ("majority_vote", "weighted_average", "confidence_threshold")min_models: Minimum number of models to use (default: 3)max_models: Maximum number of models to use (default: 5)temperature: Sampling temperature (default: 0.7)system_prompt: Optional system prompt for all models
Returns:
consensus_response: The agreed-upon responseagreement_level: Level of agreement between modelsconfidence_score: Confidence in the consensusparticipating_models: List of models that participatedindividual_responses: Responses from each modelquality_metrics: Accuracy, consistency, and completeness scores
12. ensemble_reasoning 🎯
Perform ensemble reasoning using specialized models for different aspects of complex problems.
Parameters:
problem: Problem to solve with ensemble reasoningtask_type: Type of task ("reasoning", "analysis", "creative", "factual", "code_generation")decompose: Whether to decompose the problem into subtasksmodels: Optional list of specific models to usetemperature: Sampling temperature (default: 0.7)
Returns:
final_result: The combined reasoning resultsubtask_results: Results from individual subtasksmodel_assignments: Which models handled which subtasksreasoning_quality: Quality metrics for the reasoning processprocessing_time: Total processing timestrategy_used: Decomposition strategy used
13. adaptive_model_selection 🎛️
Intelligently select the best model for a given task using adaptive routing.
Parameters:
query: Query for adaptive model selectiontask_type: Type of task ("reasoning", "creative", "factual", "code_generation", "analysis")performance_requirements: Performance requirements (accuracy, speed thresholds)constraints: Task constraints (max cost, timeout, etc.)
Returns:
selected_model: The chosen model IDselection_reasoning: Why this model was selectedconfidence: Confidence in the selection (0-1)alternative_models: Other viable options with scoresrouting_metrics: Performance metrics used in selectionexpected_performance: Predicted performance characteristics
14. cross_model_validation ✅
Validate content quality and accuracy across multiple models for quality assurance.
Parameters:
content: Content to validate across modelsvalidation_criteria: Specific validation criteria (e.g., "factual_accuracy", "technical_correctness")models: Optional list of models to use for validationthreshold: Validation threshold (0-1, default: 0.7)
Returns:
validation_result: Overall validation result ("VALID" or "INVALID")validation_score: Numerical validation score (0-1)validation_issues: Issues found by multiple modelsmodel_validations: Individual validation results from each modelrecommendations: Suggested improvementsconfidence: Confidence in the validation result
15. collaborative_problem_solving 🤖
Solve complex problems through collaborative multi-model interaction and iterative refinement.
Parameters:
problem: Problem to solve collaborativelyrequirements: Problem requirements and constraintsconstraints: Additional constraints (budget, time, resources)max_iterations: Maximum number of iteration rounds (default: 3)models: Optional list of specific models to use
Returns:
final_solution: The collaborative solutionsolution_path: Step-by-step solution developmentalternative_solutions: Alternative approaches consideredcollaboration_quality: Quality metrics for the collaborationcomponent_contributions: Individual model contributionsconvergence_metrics: How the solution evolved over iterations
🔒 Security Best Practices
This project implements Defense in Depth security architecture with multiple layers of protection. For complete security documentation, see SECURITY.md.
✅ Secure API Key Management
CRITICAL: API keys must NEVER be stored in configuration files.
Recommended Approach: Environment Variables
Windows:
set OPENROUTER_API_KEY=sk-or-v1-your-api-key-hereLinux/macOS:
export OPENROUTER_API_KEY=sk-or-v1-your-api-key-here
# Make permanent:
echo 'export OPENROUTER_API_KEY=sk-or-v1-your-api-key-here' >> ~/.bashrc
source ~/.bashrcThe system actively prevents insecure API key storage:
- ❌ API keys in config files are rejected with security errors
- ⚠️ Warnings issued if API key is provided as CLI argument
- ✅ Validation ensures keys are set as environment variables only
🔐 Privacy-Preserving Benchmarking
Default: Privacy Mode (Recommended)
By default, all user prompts and model responses are redacted from logs and saved results:
# ✅ SECURE: Content is redacted
benchmark_models(
models=["openai/gpt-4", "anthropic/claude-3.5-sonnet"],
prompt="Your sensitive prompt here"
# Results saved with: "<REDACTED: 245 chars>"
)Opt-In: Verbose Logging (Debug Only)
To include actual content (use only for debugging):
# ⚠️ WARNING: This logs actual prompt/response content
benchmark_models(
models=["openai/gpt-4"],
prompt="Debug prompt",
include_prompts_in_logs=True # Explicit consent required
)A privacy warning will be logged:
PRIVACY WARNING: Logging prompt content is enabled.
Prompts may contain sensitive or personal information.🛡️ Automatic Data Sanitization
The system automatically sanitizes sensitive data:
- API Keys: Masked in all logs (
sk-or...***MASKED***) - Request Headers: Authorization tokens redacted
- Payloads: Only metadata logged by default (message count, length)
- Responses: Content truncated in error messages (100 char max)
- Errors: No sensitive data in exception messages
📊 Security Features Summary
| Feature | Status | Protection | |---------|--------|------------| | API Key Storage | ✅ Enforced | Environment variables only | | Config File Validation | ✅ Active | Rejects keys in configs | | Privacy-Preserving Logs | ✅ Default | Content redacted by default | | Error Message Redaction | ✅ Active | Response bodies truncated | | Data Sanitization | ✅ Automatic | All logs sanitized | | Opt-In Verbose Mode | ⚠️ Available | With explicit warnings |
🔍 Security Compliance
This project implements controls aligned with:
- OWASP Top 10 (2021): A01, A02, A09
- CWE-798: Hard-coded credentials prevention
- CWE-532: Information exposure through logs prevention
- NIST SP 800-53: AC-3, IA-5, AU-2
📖 Additional Security Resources
- Complete Security Policy - Comprehensive security documentation
- Threat Model - Threats mitigated and residual risks
- Advanced Secrets Management - OS keychain and cloud integration
⚠️ Security Checklist
Before deploying to production:
- [ ] API key set as environment variable (never in config)
- [ ] Privacy mode enabled (default) for all benchmarks
- [ ] Verbose logging disabled (or used only for debugging)
- [ ] Config directory permissions restricted (
chmod 700 ~/.claude) - [ ] API key rotation policy implemented (90 days recommended)
- [ ] Logs reviewed for sensitive data before sharing
🔧 Configuration
Environment Variables
Create a .env file in your project directory:
# OpenRouter API Configuration
OPENROUTER_API_KEY=your-api-key-here
OPENROUTER_APP_NAME=openrouter-mcp
OPENROUTER_HTTP_REFERER=https://localhost
# Server Configuration
HOST=localhost
PORT=8000
LOG_LEVEL=info
# Cache Configuration
CACHE_TTL_HOURS=1
CACHE_MAX_ITEMS=1000
CACHE_FILE=openrouter_model_cache.jsonConfiguration Options
| Variable | Description | Default |
|----------|-------------|---------|
| OPENROUTER_API_KEY | Your OpenRouter API key | Required |
| OPENROUTER_APP_NAME | App identifier for tracking | "openrouter-mcp" |
| OPENROUTER_HTTP_REFERER | HTTP referer header | "https://localhost" |
| HOST | Server bind address | "localhost" |
| PORT | Server port | "8000" |
| LOG_LEVEL | Logging level | "info" |
| CACHE_TTL_HOURS | Model cache TTL in hours | "1" |
| CACHE_MAX_ITEMS | Max items in memory cache | "1000" |
| CACHE_FILE | Cache file path | "openrouter_model_cache.json" |
📊 Popular Models
Here are some popular models available through OpenRouter:
OpenAI Models
openai/gpt-4o: Most capable multimodal GPT-4 model (text + vision)openai/gpt-4o-mini: Fast and cost-effective with vision supportopenai/gpt-4: Most capable text-only GPT-4 modelopenai/gpt-3.5-turbo: Fast and cost-effective text model
Anthropic Models
anthropic/claude-3-opus: Most capable Claude model (text + vision)anthropic/claude-3-sonnet: Balanced capability and speed (text + vision)anthropic/claude-3-haiku: Fast and efficient (text + vision)
Open Source Models
meta-llama/llama-3.2-90b-vision-instruct: Meta's flagship vision modelmeta-llama/llama-3.2-11b-vision-instruct: Smaller vision-capable Llamameta-llama/llama-2-70b-chat: Meta's text-only flagship modelmistralai/mixtral-8x7b-instruct: Efficient mixture of expertsmicrosoft/wizardlm-2-8x22b: High-quality instruction following
Specialized Models
google/gemini-pro-vision: Google's multimodal AI (text + vision)google/gemini-pro: Google's text-only modelcohere/command-r-plus: Great for RAG applicationsperplexity/llama-3-sonar-large-32k-online: Web-connected model
Use list_available_models to see all available models and their pricing.
🐛 Troubleshooting
Common Issues
1. Python not found
# Check Python installation
python --version
# If not installed, download from python.org
# Make sure Python is in your PATH2. Missing Python dependencies
# Install manually if needed
pip install -r requirements.txt
# For multimodal/vision features
pip install Pillow>=10.0.03. API key not configured
# Re-run initialization
npx openrouter-mcp init4. Port already in use
# Use a different port
npx openrouter-mcp start --port 90005. Claude Desktop not detecting server
- Restart Claude Desktop after configuration
- Check config file path and format
- Verify API key is correct
Debug Mode
Enable debug logging for detailed troubleshooting:
npx openrouter-mcp start --debugStatus Check
Check server configuration and status:
npx openrouter-mcp status🧪 Development
Running Tests
# Install development dependencies
pip install -r requirements-dev.txt
# Run tests
npm run test
# Run tests with coverage
npm run test:coverage
# Lint code
npm run lint
# Format code
npm run formatProject Structure
openrouter-mcp/
├── bin/ # CLI scripts
│ ├── openrouter-mcp.js # Main CLI entry point
│ └── check-python.js # Python environment checker
├── src/openrouter_mcp/ # Python MCP server
│ ├── client/ # OpenRouter API client
│ │ └── openrouter.py # Main API client with vision support
│ ├── handlers/ # MCP tool handlers
│ │ ├── chat.py # Text-only chat handlers
│ │ ├── multimodal.py # Vision/multimodal handlers
│ │ └── benchmark.py # Model benchmarking handlers
│ └── server.py # Main server entry point
├── tests/ # Test suite
│ ├── test_chat.py # Chat functionality tests
│ ├── test_multimodal.py # Multimodal functionality tests
│ └── test_benchmark.py # Benchmarking functionality tests
├── examples/ # Usage examples
│ └── multimodal_example.py # Multimodal usage examples
├── docs/ # Documentation
├── requirements.txt # Python dependencies (includes Pillow)
└── package.json # Node.js package config📚 Documentation
Quick Links
- Documentation Index - Complete documentation overview
- Installation Guide - Detailed setup instructions
- API Reference - Complete API documentation
- Troubleshooting - Common issues and solutions
- FAQ - Frequently asked questions
Integration Guides
- Claude Desktop Integration - Desktop app setup
- Claude Code CLI Integration - Terminal workflow
Feature Guides
- Multimodal/Vision Guide - Image analysis capabilities
- Benchmarking Guide - Model performance comparison
- Model Metadata Guide - Enhanced filtering system
- Model Caching - Cache optimization
Development
- Architecture Overview - System design documentation
- Testing Guide - TDD practices and test suite
- Contributing Guide - Development guidelines
External Resources
- OpenRouter API Docs - Official OpenRouter documentation
- MCP Specification - Model Context Protocol standard
🤝 Contributing
We welcome contributions! Please see our Contributing Guide for details.
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests
- Submit a pull request
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🔗 Links
- OpenRouter - Get your API key
- Claude Desktop - Download Claude Desktop app
- Model Context Protocol - Learn about MCP
- FastMCP - The MCP framework we use
🙏 Acknowledgments
- OpenRouter for providing access to multiple AI models
- FastMCP for the excellent MCP framework
- Anthropic for the Model Context Protocol specification
Made with ❤️ for the AI community
Need help? Open an issue or check our documentation!
