npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2025 – Pkg Stats / Ryan Hefner

@waldzellai/scientific-method

v0.1.3

Published

MCP server for diagrammatic thinking and spatial representation

Downloads

37

Readme

Scientific Method MCP Server

Motivation

Language models often struggle with applying rigorous scientific reasoning. While they can describe the scientific method, they frequently:

  1. Jump to conclusions without systematic hypothesis testing
  2. Fail to explicitly identify assumptions underlying their reasoning
  3. Conflate correlation with causation in explanatory claims
  4. Neglect alternative explanations for observed phenomena
  5. Show inconsistency in evaluating evidence across different hypotheses
  6. Make predictions without clear falsifiability criteria

The Scientific Method Server addresses these limitations by providing an external framework that guides models through formal scientific reasoning processes. By externalizing the scientific method, models can engage in more rigorous, transparent, and self-correcting inquiry.

Technical Specification

Tool Interface

interface HypothesisData {
  // Core hypothesis components
  statement: string;
  variables: Array<{
    name: string;
    type: "independent" | "dependent" | "controlled" | "confounding";
    operationalization?: string;
  }>;
  assumptions: string[];
  
  // Hypothesis metadata
  hypothesisId: string;
  confidence: number; // 0.0-1.0
  domain: string;
  iteration: number;
  
  // Relationships
  alternativeTo?: string[]; // IDs of competing hypotheses
  refinementOf?: string; // ID of parent hypothesis
  
  // Current status
  status: "proposed" | "testing" | "supported" | "refuted" | "refined";
}

interface ExperimentData {
  // Core experiment components
  design: string;
  methodology: string;
  predictions: Array<{
    if: string;
    then: string;
    else?: string;
  }>;
  
  // Experiment metadata
  experimentId: string;
  hypothesisId: string;
  controlMeasures: string[];
  
  // Results (if conducted)
  results?: string;
  outcomeMatched?: boolean;
  unexpectedObservations?: string[];
  
  // Evaluation
  limitations?: string[];
  nextSteps?: string[];
}

interface ScientificInquiryData {
  // Process stage
  stage: "observation" | "question" | "hypothesis" | "experiment" | "analysis" | "conclusion" | "iteration";
  
  // Content for current stage
  observation?: string;
  question?: string;
  hypothesis?: HypothesisData;
  experiment?: ExperimentData;
  analysis?: string;
  conclusion?: string;
  
  // Process metadata
  inquiryId: string;
  iteration: number;
  
  // Next steps
  nextStageNeeded: boolean;
}

Process Flow

sequenceDiagram
    participant Model
    participant SciServer as Scientific Method Server
    participant State as Scientific State
    
    Model->>SciServer: Submit observation (stage=observation)
    SciServer->>State: Store observation
    SciServer-->>Model: Return inquiry state
    
    Model->>SciServer: Formulate question (stage=question)
    SciServer->>State: Store question
    SciServer-->>Model: Return inquiry state
    
    Model->>SciServer: Propose hypothesis (stage=hypothesis)
    SciServer->>State: Store hypothesis
    SciServer-->>Model: Return inquiry state
    
    Model->>SciServer: Design experiment (stage=experiment)
    SciServer->>State: Store experiment design
    SciServer-->>Model: Return inquiry state
    
    Model->>SciServer: Analyze results (stage=analysis)
    SciServer->>State: Update with analysis
    SciServer-->>Model: Return inquiry state
    
    Model->>SciServer: Draw conclusion (stage=conclusion)
    SciServer->>State: Store conclusion
    SciServer-->>Model: Return final state
    
    Model->>SciServer: Refine hypothesis (stage=iteration)
    SciServer->>State: Create new iteration
    SciServer-->>Model: Return updated inquiry state

Key Features

1. Structured Scientific Process

The server enforces a structured scientific inquiry process:

  • Observation: Making and recording observations about phenomena
  • Question: Formulating specific, testable questions
  • Hypothesis: Creating falsifiable hypotheses with variables
  • Experiment: Designing controlled tests with predictions
  • Analysis: Evaluating results against predictions
  • Conclusion: Drawing warranted conclusions
  • Iteration: Refining hypotheses based on results

2. Hypothesis Management

Hypotheses must be explicitly formulated with:

  • Statement: Clear, testable proposition
  • Variables: Identified and categorized (independent, dependent, etc.)
  • Assumptions: Explicit underlying assumptions
  • Alternatives: Competing explanations for same phenomena

3. Experimental Design

The server guides rigorous experimental design:

  • Methodology: Clear procedural steps
  • Predictions: Explicit if-then statements for expected outcomes
  • Controls: Measures to eliminate confounding variables
  • Limitations: Acknowledged constraints of the design

4. Evidence Evaluation

Evidence is systematically evaluated:

  • Confirmatory: Evidence supporting hypotheses
  • Disconfirmatory: Evidence challenging hypotheses
  • Unexpected: Observations not predicted by hypotheses

5. Iteration Tracking

The server tracks how scientific understanding evolves:

  • History of hypothesis refinements
  • Changing confidence levels based on evidence
  • Alternative explanations explored and rejected

Usage Examples

Causal Analysis

When attempting to determine cause-effect relationships, the model can systematically work through alternative explanations and evidence evaluation.

Technical Troubleshooting

For diagnosing problems, the model can generate competing hypotheses about failure causes and design tests to differentiate between them.

Literature Review

When synthesizing research findings, the model can systematically evaluate evidence quality and competing explanations.

Health Diagnosis

For medical reasoning, the model can track hypothesis confidence for different conditions based on symptoms and test results.

Implementation

The server is implemented using TypeScript with:

  • A core ScientificMethodServer class
  • JSON schema validation for scientific process structures
  • Visualization for the scientific inquiry process
  • Relationship tracking between hypotheses and evidence
  • Standard MCP server connection via stdin/stdout

This server provides significant enhancement to model reasoning in domains requiring causal analysis, hypothesis testing, and evidence evaluation - essentially any context where rigorous scientific thinking would benefit human reasoning as well.