npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

delphi-mcp

v2.2.2

Published

Delphi MCP Server - Multi-model AI consensus for complex questions. Query Claude, GPT-5, Gemini, DeepSeek simultaneously and synthesize diverse perspectives.

Readme


The Problem

Complex questions rarely have simple answers. A single AI model gives you one perspective shaped by its training data and architecture. For nuanced topics—technical trade-offs, research questions, multi-faceted decisions—one viewpoint isn't enough.

The insight: Different AI models reason differently. When multiple models independently arrive at the same conclusion, you can trust it. When they disagree, you've found genuine complexity worth exploring.


Requirements

  • Node.js 18+
  • OpenRouter API key — Get one at openrouter.ai/keys ($5-10 credit is plenty to start)
  • Claude Desktop or any MCP-compatible client

Quick Start

npm install -g delphi-mcp

Add to Claude Desktop config:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "delphi": {
      "command": "delphi-mcp",
      "env": { "OPENROUTER_API_KEY": "sk-or-v1-your-key" }
    }
  }
}

Restart Claude Desktop. Done.


How It Works

  1. Independent responses — Each model answers without seeing others
  2. Revision rounds — Models see the synthesis and can revise or challenge
  3. Convergence detection — Stops when 85% agreement is reached
  4. Hallucination flagging — Claims from only one model get flagged

Example Output

Question: Should we use microservices or a monolith for a new e-commerce platform?

Consensus (87% agreement after 3 rounds)

Round 1 — Initial Positions: | Model | Position | |-------|----------| | Claude | Monolith first, extract services later | | GPT-5 | Microservices for scalability from day one | | Gemini | Depends on team size and experience | | DeepSeek | Modular monolith as middle ground |

Round 2 — After seeing each other's reasoning:

  • GPT-5 revised: "Agreed that premature microservices add complexity. Team size matters."
  • Claude maintained position but acknowledged: "Microservices make sense if team is 50+ engineers"
  • All models converged on team size as the key factor

Round 3 — Final Synthesis:

| Claim | Strength | Agreement | |-------|----------|-----------| | Start with monolith for teams < 20 engineers | unanimous | 5/5 | | Modular boundaries enable future extraction | unanimous | 5/5 | | Microservices add 3-5x operational overhead | strong | 4/5 | | Extract services only when team/traffic demands | strong | 4/5 | | Kubernetes required for microservices | disputed | 2/5 |

Key Disagreement Surfaced:

"Kubernetes required for microservices" — Claude and DeepSeek disagreed, noting alternatives like ECS, Nomad, or even simple VM deployments. This flags an area where the "conventional wisdom" may be overconfident.

Control Drift: 45% — A single model would have given a more opinionated answer without surfacing the team-size nuance or the Kubernetes debate.


Presets

| Preset | Tier | Rounds | Grounding | Cost | Use Case | |--------|------|--------|-----------|------|----------| | quick | fast | 2 | off | ~$0.04 | Quick checks | | balanced | standard | 4 | off | ~$0.20 | General queries | | research | premium | 6 | on | ~$0.50 | Deep analysis | | factcheck | standard | 3 | on | ~$0.25 | Verify claims |


When to Use Delphi

Use Delphi for:

  • Complex technical decisions with trade-offs
  • Research questions with multiple valid perspectives
  • High-dimensional problems (many factors to weigh)
  • Topics where experts genuinely disagree
  • Validating important conclusions before acting

Skip Delphi for:

  • Simple factual lookups → single model is fine
  • Creative writing → diversity unhelpful
  • Real-time chat → too slow
  • Well-defined problems with clear answers

Decision rule: If the question has genuine complexity and the answer matters, use Delphi.


Features

  • Multi-Model Consensus — Claude, GPT-5, Gemini, DeepSeek working together
  • Dynamic Convergence — Iterates until 85% agreement or surfaces disagreement
  • Claim Strength — See which points are unanimous vs genuinely disputed
  • Revision Rounds — Models can challenge and refine each other's reasoning
  • Expert Personas — Frame panelists as domain experts for deeper analysis
  • Diverse Panel Mode — Assign complementary expert roles within a domain
  • Web Grounding — Optionally verify claims against live sources
  • Budget Controls — Token and cost limits for predictable spend
  • Multiple Formats — Markdown, JSON, HTML, plain text

Expert Personas

Like a real Delphi study, you can frame panelists as domain experts:

{
  "question": "What are the security implications of storing JWTs in localStorage?",
  "expertise": "security"
}

Available domains: | Domain | Expert Type | |--------|-------------| | security | Security Engineer (15+ years, penetration testing, secure development) | | finance | Financial Analyst (investment banking, risk management) | | medical | Medical Researcher (clinical medicine, evidence-based medicine) | | legal | Legal Expert (corporate law, IP, regulatory compliance) | | engineering | Software Engineer (system design, architecture patterns) | | data-science | Data Scientist (ML, statistical analysis) | | economics | Economist (micro/macro economics, policy analysis) | | architecture | Systems Architect (distributed systems, cloud platforms) | | devops | DevOps Engineer (CI/CD, infrastructure automation) | | product | Product Manager (strategy, user research, go-to-market) |

Diverse Panel Mode

Add diversePersonas: true to give each panelist a different complementary role within the domain — just like assembling a real expert panel:

{
  "question": "Should we migrate to microservices?",
  "expertise": "architecture",
  "diversePersonas": true
}

For architecture, this creates a panel of:

  • Cloud architect (AWS/GCP/Azure best practices)
  • Platform architect (internal developer platforms)
  • Data architect (data modeling, warehousing)
  • Integration architect (APIs, messaging)
  • Security architect (zero-trust, identity management)
  • Solutions architect (customer requirements)

Auto-Expertise Mode

For the most authentic Delphi experience, let the administrator automatically determine what experts are needed based on your question:

{
  "question": "Should we implement rate limiting at the API gateway or application layer?",
  "autoExpertise": true
}

The administrator analyzes your question and dynamically generates an optimal expert panel:

| Expert | Focus | Perspective | |--------|-------|-------------| | API Gateway Architect | Rate limiting patterns, edge vs origin | Infrastructure scalability | | Security Engineer | DDoS protection, abuse prevention | Defensive, assumes adversarial users | | Backend Developer | Application-level implementation | Developer experience, maintainability | | SRE/Platform Engineer | Observability, failure modes | Operational reliability |

Why auto-expertise?

  • Mimics how real Delphi studies select experts based on the question
  • No need to guess which domain fits best
  • Gets complementary perspectives without manual configuration
  • Shows rationale for why each expert was chosen

API

| Tool | Description | |------|-------------| | delphi_query | Multi-model consensus query | | delphi_factcheck | Fact-check a specific claim | | delphi_list_models | List available models | | delphi_estimate_cost | Estimate before running |


Documentation

For full technical documentation including:

  • All configuration options
  • Test results & insights
  • Architecture internals
  • Cost analysis
  • Safety features

See docs/TECHNICAL.md


License

MIT — Built by Thor Matthiasson