@crack-break-make/mcp-grimoire

v1.0.0-rc.0

Published

6 days ago

Your spellbook for MCP servers - intelligent orchestration with lazy loading and expert incantations

0High
0Medium
0Low

crack-break-make

mcp model-context-protocol ai tools grimoire spellbook orchestrator lazy-loading

Your intelligent spellbook for MCP servers - Lazy loading orchestration with 97% token savings

🎯 What is MCP Grimoire?

MCP Grimoire is an intelligent orchestrator for Model Context Protocol (MCP) servers. It acts as a smart gateway between AI agents (like Claude Desktop, GitHub Copilot) and your MCP tools, solving critical performance and usability problems in AI-powered development workflows.

The Problem

Traditional MCP implementations suffer from three critical issues:

1. Context Overload (Token Waste) 💸

Loading 50+ tools at startup consumes 40,000+ tokens
Degrades AI performance and increases API costs
Results in slower responses and confused tool selection

2. Missing Domain Expertise 🤷

MCP tools lack contextual guidance and best practices
Users must manually prompt for security patterns
Leads to vulnerabilities and inconsistent usage

3. Plugin Development Complexity 🔧

No standardized patterns for creating MCP plugins
Difficult to maintain and extend
Fragmented ecosystem

The Solution

MCP Grimoire achieves 97% token reduction through:

✅ Lazy Loading - Spawns MCP servers only when needed, not all at startup ✅ Intent-Driven Discovery - Matches queries to tools via hybrid keyword + semantic search ✅ Aggressive Cleanup - Kills inactive servers after 5 turns of inactivity ✅ Steering Injection - Embeds best practices directly into tool descriptions ✅ Transparent Operation - Claude doesn't know about the complexity

Result: From 40,000 tokens → 1,166 tokens average (~$0.20 → ~$0.006 per query)

🚀 Quick Start

1. Configure MCP Server (in Claude Desktop / GitHub Copilot)

Add to your claude_desktop_config.json:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "grimoire": {
      "command": "npx",
      "args": ["-y", "@crack-break-make/mcp-grimoire"]
    }
  }
}

Restart Claude Desktop - Grimoire MCP server is now running!

2. Create Spells (in Terminal)

Open a terminal and run CLI commands to manage spells:

# Create a new spell (interactive wizard)
# Note: Arguments trigger CLI mode via dual-mode detection
npx @crack-break-make/mcp-grimoire create

# List installed spells
npx @crack-break-make/mcp-grimoire list

# Validate a spell configuration
npx @crack-break-make/mcp-grimoire validate ~/.grimoire/postgres.spell.yaml

Key Point:

🖥️ MCP Server runs inside Claude Desktop (configured via mcp.json with -y flag)
💻 CLI Commands run in your terminal with command arguments (create, list, etc.)
Same package, two modes - automatically detected based on stdin and arguments!

3. Use in Claude

Ask Claude to interact with your tools:

Show me all users from the database

Grimoire will automatically activate the right MCP server based on your query!

🎭 Dual-Mode Operation

MCP Grimoire intelligently detects how it's being called using a 3-step detection strategy:

| Mode | How It's Called | Purpose | Example | | -------------- | -------------------------------- | --------------------------------- | ------------------------------------------- | | MCP Server | From mcp.json with stdio pipes | Runs as MCP gateway for AI agents | Claude Desktop spawns it automatically | | CLI Tool | From terminal with arguments | Manage spell configurations | npx @crack-break-make/mcp-grimoire create |

You only install once - the entry point (dist/index.js) detects the mode automatically:

Detection Logic (matches MCP best practices)

Args First: If CLI arguments provided (e.g., --help, create) → CLI Mode
TTY Check: If no args but running in terminal (TTY) → CLI Mode (shows help)
Default: If stdin is piped from MCP client → MCP Server Mode

This prevents the server from hanging if you accidentally run npx mcp-grimoire with no arguments.

GitHub Copilot (VS Code)

Add to .vscode/mcp.json or project mcp.json:

{
  "mcpServers": {
    "grimoire": {
      "command": "npx",
      "args": ["-y", "@crack-break-make/mcp-grimoire"]
    }
  }
}

Then use CLI in terminal to create spells (same as above).

📦 How It Works

High-Level Flow

User Query → Claude analyzes intent → resolve_intent(query) →
Grimoire matches keywords/semantics → Spawns relevant MCP server →
Injects steering → tools/list_changed → Claude sees tools + guidance →
Executes with best practices → After 5 turns idle → Kill server

Architecture Diagram

┌──────────────────────────────────────┐
│      Claude Desktop / Copilot        │
│  Maintains conversation state        │
└──────────────┬───────────────────────┘
               │ stdio (MCP Protocol)
               │
┌──────────────▼───────────────────────┐
│      GRIMOIRE GATEWAY SERVER         │
│  - Intent Resolution (hybrid)        │
│  - Process Lifecycle Management      │
│  - Tool Routing                      │
│  - Steering Injection                │
└──────┬──────────────┬────────────────┘
       │ stdio        │ stdio
       │              │
┌──────▼─────┐  ┌────▼──────┐
│  Postgres  │  │  Stripe   │  ... (spawned on-demand)
│ MCP Server │  │ MCP Server│
└────────────┘  └───────────┘

Multi-Tier Intent Resolution

Grimoire uses a confidence-based approach to decide when to auto-spawn vs ask for clarification:

| Tier | Confidence | Behavior | Example | | ---------- | ---------- | -------------------------- | --------------------------------------------- | | High | ≥ 0.85 | Auto-spawn immediately | "query postgres" → Instant activation | | Medium | 0.50-0.84 | Return alternatives | "check database" → [postgres, mysql, mongodb] | | Low | 0.30-0.49 | Weak matches | "analyze data" → 5 weak matches | | None | < 0.30 | Not found | "launch rocket" → Error + available spells |

70% of queries hit high confidence (zero-friction UX) 20% of queries hit medium confidence (AI agent picks from context) 10% of queries need clarification

🧙 Creating Your First Spell

A "spell" is a YAML configuration file that tells Grimoire how to spawn and use an MCP server.

Interactive Creation (Recommended)

# Run without installation (recommended)
npx @crack-break-make/mcp-grimoire create

# OR install globally first, then use short command
npm install -g @crack-break-make/mcp-grimoire
grimoire create

The wizard will guide you through:

Spell name (e.g., postgres)
Transport type (stdio, SSE, or HTTP)
Server configuration (command, args, or URL)
Environment variables (for authenticated servers like GitHub, Stripe, etc.)
Server validation (optional but recommended)

With server probing (auto-generates steering and keywords):

npx @crack-break-make/mcp-grimoire create --probe

Manual Creation

Create ~/.grimoire/postgres.spell.yaml:

name: postgres
version: 1.0.0
description: PostgreSQL database operations

# MCP Server Configuration
server:
  transport: stdio
  command: npx
  args:
    - '-y'
    - '@modelcontextprotocol/server-postgres'
  env:
    DATABASE_URL: postgresql://user:pass@localhost/db

# Intent Matching Keywords (minimum 3)
keywords:
  - database
  - sql
  - query
  - postgres
  - tables
  - users

# Expert Guidance (Optional but Recommended)
steering: |
  # Database Schema
  Tables:
    - users (id uuid, email string, created_at timestamp)

  # Security Rules
  ALWAYS use parameterized queries:
    ✓ query_database('SELECT * FROM users WHERE id = $1', [id])
    ✗ 'SELECT * FROM users WHERE id = ' + id  (SQL INJECTION!)

  # Performance Tips
  - Use LIMIT to avoid scanning millions of rows
  - created_at is indexed, use for date filtering

Environment Variables:

For servers requiring authentication (GitHub, Stripe, databases, etc.), use the ${VAR_NAME} syntax to reference shell environment variables:

server:
  env:
    GITHUB_PERSONAL_ACCESS_TOKEN: ${GITHUB_PERSONAL_ACCESS_TOKEN}
    DATABASE_URL: ${DATABASE_URL}
    API_KEY: ${MY_API_KEY}

The interactive wizard will prompt you for environment variables and suggest the ${VAR} syntax by default. You can also provide literal values directly.

Spell File Location (all platforms):

~/.grimoire/ (follows Claude Code convention)
- macOS: /Users/username/.grimoire/
- Windows: C:\Users\username\.grimoire\
- Linux: /home/username/.grimoire/

🔮 Supported MCP Transports

✅ Stdio (Fully Supported)

For local MCP servers spawned as child processes (most common):

server:
  transport: stdio
  command: npx
  args:
    - '-y'
    - '@modelcontextprotocol/server-postgres'

Examples:

@modelcontextprotocol/server-postgres
@modelcontextprotocol/server-github
@cap-js/mcp-server

✅ SSE (Fully Supported)

For real-time MCP servers using Server-Sent Events:

server:
  transport: sse
  url: http://localhost:3000/sse

✅ HTTP (Fully Supported)

For REST-like MCP servers:

server:
  transport: http
  url: http://localhost:7777/mcp

Note: Only local MCP servers (spawned by Grimoire) are supported in Phase 1. Remote servers (via mcp-remote) require different architecture and are planned for Phase 2.

⚡ When Servers Are Spawned and Killed

Spawn Triggers

1. High Confidence Match (≥0.85)

User: "query my postgres database"
→ resolve_intent matches "postgres" with 0.94 confidence
→ Immediate spawn + return tools
→ Time: 200-300ms

2. Manual Activation

User: "check my database"
→ resolve_intent returns alternatives: [postgres, mysql, mongodb]
→ Claude (or user) calls: activate_spell({ name: "postgres" })
→ Spawn specified spell
→ Time: 200-300ms

3. Already Active

→ Just update usage tracking
→ Time: ~5ms (no spawn overhead)

Kill Triggers (5-Turn Inactivity)

After every tool call, Grimoire checks:

If (currentTurn - lastUsedTurn) >= 5:
  → Kill process
  → Unregister tools
  → Send tools/list_changed notification

Real-World Example (E-commerce workflow):

| Turn | Action | Active Spells | Event | | ---- | ---------------- | ---------------------------- | --------------------------------- | | 1-3 | Database queries | [postgres] | ✅ Postgres spawned | | 4-7 | Process payments | [postgres, stripe] | ✅ Stripe spawned | | 8 | Deploy CAP app | [postgres, stripe, cap-js] | ✅ Cap-js spawned | | 9 | CAP deployment | [stripe, cap-js] | ❌ Postgres killed (6 turns idle) | | 14 | CAP testing | [cap-js] | ❌ Stripe killed (7 turns idle) |

Result: 3 spells → 1 spell (67% token reduction from peak)

🛠️ CLI Commands (Run in Terminal)

Important: CLI commands run in your terminal, not in Claude Desktop. The MCP server runs inside Claude automatically.

`npx @crack-break-make/mcp-grimoire create`

Create new spell configurations with interactive wizard:

# Interactive mode (guided) - RECOMMENDED
npx @crack-break-make/mcp-grimoire create

# With server validation (auto-generates steering)
npx @crack-break-make/mcp-grimoire create --probe

# Non-interactive mode
npx @crack-break-make/mcp-grimoire create \
  -n postgres \
  -t stdio \
  --command npx \
  --args "-y" "@modelcontextprotocol/server-postgres"

# With environment variables (for authenticated servers)
npx @crack-break-make/mcp-grimoire create \
  -n github \
  -t stdio \
  --command npx \
  --args "-y" "@modelcontextprotocol/server-github" \
  --env "GITHUB_PERSONAL_ACCESS_TOKEN=${GITHUB_PERSONAL_ACCESS_TOKEN}"

Optional: Install globally for shorter command:

npm install -g @crack-break-make/mcp-grimoire

# Now use short form:
grimoire create
grimoire list

Features:

Validates MCP server works before creating config
Auto-generates keywords from tool names
Creates intelligent steering instructions
Supports environment variables for authenticated servers
Supports all transport types (stdio, SSE, HTTP)

`npx @crack-break-make/mcp-grimoire list`

List all installed spells:

# Simple list
npx @crack-break-make/mcp-grimoire list

# Verbose output with details
npx @crack-break-make/mcp-grimoire list -v

Output:

📚 Spells in ~/.grimoire

  🔮 postgres                    [stdio ] (8 keywords)
  🔮 stripe                      [stdio ] (12 keywords)
  🔮 github-api                  [stdio ] (15 keywords)

✓ Total: 3 spells

`npx @crack-break-make/mcp-grimoire validate`

Validate spell configuration:

npx @crack-break-make/mcp-grimoire validate ~/.grimoire/postgres.spell.yaml

Checks:

Required fields (name, keywords, server.command/url)
Field types and formats
Minimum 3 keywords
Transport-specific requirements

`npx @crack-break-make/mcp-grimoire example`

Generate example templates:

# Output to stdout
npx @crack-break-make/mcp-grimoire example stdio

# Output to file
npx @crack-break-make/mcp-grimoire example stdio -o ~/.grimoire/myspell.spell.yaml

🎨 Using with AI Agents

Claude Desktop

How It Works:

User asks: "Show users from database"
Claude sees resolve_intent tool (always available)
Claude calls: resolve_intent({ query: "show users from database" })
Grimoire spawns postgres, injects steering, returns tools
Claude receives tools/list_changed notification
Claude calls: query_database({ query: "SELECT * FROM users" })
After 5 turns idle → Grimoire kills postgres automatically

Key Insight: Claude doesn't know about Grimoire's complexity - it just sees tools appearing/disappearing via MCP protocol notifications.

GitHub Copilot (VS Code)

Same workflow as Claude Desktop. Add to settings.json:

{
  "servers": {
    "grimoire": {
      "command": "npx",
      "args": ["-y", "@crack-break-make/mcp-grimoire"]
    }
  }
}

Note: Copilot support depends on VS Code's MCP implementation maturity.

📊 Token Savings Breakdown

Traditional MCP (Baseline)

All 50 servers spawned at startup:
- postgres tools (8 tools × 200 tokens) = 1,600 tokens
- stripe tools (12 tools × 200 tokens) = 2,400 tokens
- github tools (15 tools × 200 tokens) = 3,000 tokens
- ... 47 more servers
= ~40,000 tokens per conversation

Cost: ~$0.20 per query (GPT-4 pricing)

Grimoire (Multi-Tier Strategy)

Weighted Average Calculation:

High confidence (70%):   1,000 tokens (selected tools only)
Medium confidence (20%): 1,500 tokens (3 alternatives + tools)
Low confidence (8%):     2,000 tokens (5 weak matches + tools)
No match (2%):             300 tokens (error + available spells)

Average = 0.70×1000 + 0.20×1500 + 0.08×2000 + 0.02×300
        = 700 + 300 + 160 + 6
        = 1,166 tokens

Savings: (40,000 - 1,166) / 40,000 = 97.1% 🎉

Cost: ~$0.006 per query (33× cheaper!)

🤝 Contributing

We welcome contributions! Whether you're fixing bugs, adding features, or improving documentation, your help is appreciated.

Getting Started

1. Fork & Clone

# Fork on GitHub, then clone
git clone https://github.com/YOUR_USERNAME/mcp-grimoire.git
cd mcp-grimoire

# Install dependencies
pnpm install

2. Create a Branch

git checkout -b feature/my-awesome-feature

3. Make Changes

Follow our coding principles:

YAGNI: Implement only what's needed now
DRY: Don't repeat yourself
SRP: Single Responsibility Principle
SOLID: Follow SOLID principles

See CONTRIBUTING.md for comprehensive development guidelines.

4. Run Tests

# Run all tests
pnpm test

# Run with coverage
pnpm test:coverage

5. Commit Changes

We use Conventional Commits:

# Format
type(scope): short description

# Examples
feat(intent): add semantic search with embeddings
fix(lifecycle): prevent orphaned child processes
docs(readme): add contributing section
test(gateway): add multi-tier resolution tests

6. Submit Pull Request

git push origin feature/my-awesome-feature

Then open a PR on GitHub with:

Clear description of changes
Link to related issues
Screenshots/examples if applicable

Development Commands

# Development server (hot reload)
pnpm dev

# Build TypeScript
pnpm build

# Linting
pnpm lint          # Check for issues
pnpm lint:fix      # Auto-fix issues

# Formatting
pnpm format        # Format all files with Prettier

# Type checking
pnpm type-check    # Check TypeScript types

Project Structure

mcp-grimoire/
├── src/
│   ├── core/                    # Domain models (types, configs)
│   ├── application/             # Business logic (intent, lifecycle)
│   ├── infrastructure/          # External systems (file, embeddings)
│   ├── presentation/            # Gateway server, tool routing
│   ├── cli/                     # CLI commands, templates
│   └── utils/                   # Shared utilities
├── tests/
│   └── fixtures/                # Test spell configurations
├── docs/
│   ├── adr/                     # Architecture Decision Records
│   ├── architecture.md          # System architecture
│   └── CLAUDE.md                # Development guide
└── examples/                    # Example spell configs

Creating Architecture Decision Records (ADRs)

For significant architectural decisions, create an ADR:

# Use the adr-generator skill
/adr-generator --title "Use Hybrid Intent Resolution" --status proposed

See docs/adr/README.md for guidelines.

Running Integration Tests

# Requires test servers to be available
pnpm test:integration

# Run specific integration test
pnpm test src/presentation/__tests__/gateway-real-workflow.integration.test.ts

Code Quality Standards

We maintain high code quality through:

✅ 80%+ test coverage (unit + integration)
✅ Strict TypeScript (strict: true)
✅ ESLint + Prettier formatting
✅ No any types (enforced by linter)
✅ Comprehensive error handling

Need Help?

❓ FAQ & Troubleshooting

AI Agent Not Showing `resolve_intent` Tool

Problem: GitHub Copilot (VS Code) or other AI agents cache tools aggressively. After Grimoire spawns and registers new tools, the AI agent may not see them immediately, including the critical resolve_intent tool.

Solution: Explicitly prompt the AI agent to refresh its tool list:

Please call the tools/list API to refresh available tools, then use the resolve_intent tool to search for [your query].

Why this happens:

MCP clients cache tool lists for performance
The tools/list_changed notification may not trigger immediate refresh in all clients
This is a known limitation of some MCP client implementations (not a Grimoire bug)

Alternative approach: Restart the AI agent (e.g., reload VS Code window) to force tool cache refresh.

📖 Documentation

📝 License

🔗 Links

GitHub: crack-break-make/mcp-grimoire
npm: @crack-break-make/mcp-grimoire
Issues: Report bugs or request features
Discussions: Join the community

Made with ❤️ by Mohan Sharma

Special thanks to the MCP community and all contributors!

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

🎯 What is MCP Grimoire?

The Problem

The Solution

🚀 Quick Start

1. Configure MCP Server (in Claude Desktop / GitHub Copilot)

2. Create Spells (in Terminal)

3. Use in Claude

🎭 Dual-Mode Operation

Detection Logic (matches MCP best practices)

GitHub Copilot (VS Code)

📦 How It Works

High-Level Flow

Architecture Diagram

Multi-Tier Intent Resolution

🧙 Creating Your First Spell

Interactive Creation (Recommended)

Manual Creation

🔮 Supported MCP Transports

✅ Stdio (Fully Supported)

✅ SSE (Fully Supported)

✅ HTTP (Fully Supported)

⚡ When Servers Are Spawned and Killed

Spawn Triggers

Kill Triggers (5-Turn Inactivity)

🛠️ CLI Commands (Run in Terminal)

npx @crack-break-make/mcp-grimoire create

npx @crack-break-make/mcp-grimoire list

npx @crack-break-make/mcp-grimoire validate

npx @crack-break-make/mcp-grimoire example

🎨 Using with AI Agents

Claude Desktop

GitHub Copilot (VS Code)

📊 Token Savings Breakdown

Traditional MCP (Baseline)

Grimoire (Multi-Tier Strategy)

🤝 Contributing

Getting Started

Development Commands

Project Structure

Creating Architecture Decision Records (ADRs)

Running Integration Tests

Code Quality Standards

Need Help?

❓ FAQ & Troubleshooting

AI Agent Not Showing resolve_intent Tool

📖 Documentation

📝 License

🔗 Links

`npx @crack-break-make/mcp-grimoire create`

`npx @crack-break-make/mcp-grimoire list`

`npx @crack-break-make/mcp-grimoire validate`

`npx @crack-break-make/mcp-grimoire example`

AI Agent Not Showing `resolve_intent` Tool