@cachly-dev/mcp-server

v0.10.115

Published

2 days ago

Your AI forgets everything between sessions. cachly fixes that permanently. Persistent memory + causal root-cause analysis for Claude Code, Cursor, Copilot & Windsurf. Learns from every bug fix, git commit, and CI run automatically. Arrives pre-briefed. P

🧠 cachly AI Brain — MCP Server

Your AI is brilliant for one session. Then it forgets you.
Every morning you re-explain your architecture, your deploy process, the bug you already fixed last week. cachly gives your AI — and your whole team — a permanent, shared brain that gets smarter with every commit.

The story you already live every day

You are a good engineer. You want to ship, not babysit a forgetful assistant.

But every session starts at zero. Your AI doesn't remember the race condition you chased for three hours on Tuesday. It doesn't know your deploy gotchas. It can't tell you that Carol already solved this exact bug in March — because Carol's knowledge lives in Carol's head, and yours in yours.

So you re-explain. You re-research. Your team makes the same mistake in five different branches. And when someone leaves, their hard-won knowledge walks out the door with them.

The villain isn't your AI. It's amnesia. Context death between sessions, and knowledge silos between people. The average developer loses ~45 minutes a day re-establishing context that should already exist.

You don't need a smarter model. You need a memory that doesn't reset — and one that your whole team shares.

Meet your guide

cachly is the brain layer that sits under whatever AI you already use. We've watched hundreds of teams lose the same knowledge the same way, and we built the fix:

It learns automatically — from every commit, every fix, every session. No extra calls.
It arrives pre-briefed — your AI opens each session already knowing your stack.
It's shared — one engineer's solved bug becomes the whole team's reflex.
It's provable — quality-aware recall beats raw text search by +22.2 % Precision@1 (see the benchmark). A claim without a number is marketing; this is the number.
It's neutral — speaks MCP, so it works with Claude, Cursor, Copilot, Windsurf, Cline, Zed. Switch models anytime — your brain stays.

We're not the hero of this story. You are. cachly is the thing that makes you the engineer whose AI never forgets and whose team compounds knowledge instead of losing it.

Taste it first — no account, no risk

npx @cachly-dev/mcp-server@latest demo

Run it in any project folder. It reads YOUR git history and shows what your AI would know — your bugs fixed, your patterns, your past decisions. Nothing leaves your machine.

┌─────────────────────────────────────────────────────────────┐
│  Brain Preview — What your AI would know                    │
├─────────────────────────────────────────────────────────────┤
│  Commits: 847   Lessons: 634   Contributors: 7              │
│  Date range: 2024-01-12 → 2026-05-14                        │
├─────────────────────────────────────────────────────────────┤
│  Security fixes your AI would know:                         │
│  • fix(auth): JWT expiry check before signature validation  │
│  • security: sanitize webhook payload before JSON.parse     │
├─────────────────────────────────────────────────────────────┤
│  Bug fixes your AI would remember:                          │
│  • fix: Redis pub/sub race condition under high concurrency │
│  • fix: k8s readinessProbe threshold too low for cold start │
│  • fix: Stripe idempotency_key missing on retry path        │
├─────────────────────────────────────────────────────────────┤
│  With cachly, your AI arrives pre-briefed every session.    │
└─────────────────────────────────────────────────────────────┘

Like what you see? Make it permanent in the next step.

Brain-first — Semantic Cache as Proof-Point

cachly is not a semantic cache with a brain bolt-on. The Brain is the product. The Semantic Cache is the proof-point — it shows ROI in dollars from day one, with zero trust required. It opens the door. The Brain is why teams never leave.

| | Wedge — Land | Moat — Retain | |---|---|---| | Feature | Semantic Cache | AI Brain (Lessons, Recall, Team-Sharing) | | Value | Measurable cost savings from day one | Compounding team intelligence | | Metric | Cache-hit rate, $/month saved | Lessons retained, WoW trend, recall quality | | Analogy | Datadog APM (surfaces the problem) | Stripe (becomes critical infrastructure) |

The org-level advantage: Brain lessons and cache hits are shared across the whole team — one person's fix becomes every agent's reflex. Anthropic Projects Memory is per-user and model-locked. cachly is team-wide and model-neutral. That's the structural moat no first-party tool can build.

Setup — one command

npx @cachly-dev/mcp-server@latest autopilot

Autopilot does everything in a single command: it auto-detects every AI editor you use, writes the MCP config, signs you in via browser device-flow (one click, no password, no credit card), and bootstraps your brain from git history. Restart your editor and your AI arrives pre-briefed — every session, automatically.

Already inside Claude / Cursor / Copilot? Paste this to your AI and it configures everything itself:
Set up cachly for this project. Run: npx @cachly-dev/mcp-server@latest autopilot
It gives my AI persistent memory across sessions. Follow the browser login
(one click, no credit card), then restart the editor.

Our agreement with you: Free forever tier. GDPR, EU servers. No model lock-in — leave anytime and take your data. No code content is ever stored.

What changes the moment you turn it on

| The moment | Without cachly | With cachly | |-----------|----------------|-------------| | Session start | "What's your architecture again?" | "Ready. 23 lessons. Last session: deployed API." | | A known bug returns | Re-researches from scratch | "You fixed this March 12 — here's the exact command." | | You open an unfamiliar file | Cold start | "Carol fixed 3 bugs here. Related: fix:stripe-retry." | | A teammate leaves | Their knowledge leaves too | Their lessons stay, attributed, searchable | | New hire, day one | Weeks to onboard | setup → full team context instantly | | Pre-deploy | Hope nothing breaks | Brain predicts failure risks from past patterns |

This is the transformation: from the engineer who re-explains everything every morning → to the team whose collective brain never forgets and gets sharper with every commit.

cachly vs. Claude's built-in memory

Anthropic now ships memory for Claude — and it's genuinely good for one developer, using only Claude, alone. That's not the game we're playing. Here's the honest map:

| | cachly | Claude built-in memory | |--|------------|----------------------------| | Works across teams | ✅ one engineer's fix → everyone's reflex | ❌ per-user / per-agent only | | Works across models & tools | ✅ MCP — Claude, Cursor, Copilot, Windsurf, Zed… | ❌ Claude + Anthropic API only | | Structured knowledge | ✅ topic · outcome · severity · causal graph | ⚠️ flat text files, read linearly | | Causal root-cause (causal_trace) | ✅ problem → chain → proven fix | ❌ | | Provable recall quality | ✅ +22.2 % Precision@1 vs. BM25 (benchmark) | ❌ no public metric | | Governance (review, attribution, audit) | ✅ team_confirm, roles, audit trail | ❌ | | Self-hosting / BYOK / VPC | ✅ data stays in your infra | ❌ Anthropic-hosted | | Survives a model switch | ✅ your brain is yours | ❌ memory is gone or fragmented | | Zero-setup for one solo user | ⚠️ ~1 command | ✅ built in |

The honest takeaway: if you're a solo dev who only ever uses Claude, the built-in memory is great — use it. If you work on a team, switch tools, care about proof, or need governance and data residency, that's a gap Anthropic structurally can't close without breaking its own lock-in. That gap is where cachly wins. (Full strategic analysis: STRATEGY.md.)

vs. other memory tools

| | cachly | mem0 | MemGPT / Letta | Plain CLAUDE.md | |--|--------|------|----------------|-----------------| | Persistent memory | ✅ | ✅ | ✅ | Manual | | MCP server (no code changes) | ✅ | ✅ | ❌ | ✅ | | Causal root cause analysis | ✅ | ❌ | ❌ | ❌ | | Fully automatic (no explicit calls) | ✅ | ❌ | ❌ | ❌ | | Team knowledge graph + attribution | ✅ | Paid | ❌ | ❌ | | Provable recall lift (published) | ✅ | ❌ | ❌ | ❌ | | Git-ambient learning | ✅ | ❌ | ❌ | ❌ | | GDPR / EU servers | ✅ | ❌ | ❌ | ✅ | | Free tier forever | ✅ | Limited | ❌ | ✅ |

The standout moves

| Capability | What it does | |---------|-------------| | causal_trace | Root-cause analysis through memory: problem → causal chain → the fix that worked, with date and commands. No other system builds and queries a causal graph. | | brain_who_knows | "Who on my team knows about Kubernetes deploys?" → ranked experts 🥇🥈🥉, built automatically from authorship. | | brain_file_map | Before you touch a file: who's worked on it and which lessons reference it. | | team_expertise_map | The whole team's skills matrix in one table — onboarding and bus-factor insurance. | | brain_collab_pairs | Person↔Person Collaboration Graph — "Frag X und Y, die haben das zusammen gelöst." Bus-factor alerts included. | | brain_portability | W9 Model-Neutrality — config for 7 clients (Claude, Cursor, Copilot, Windsurf, Cline, Zed, Continue). "Same Brain, any model." | | brain_from_git | Reads your entire git history and populates the team knowledge graph (people + files + lessons) — zero setup, retroactively. | | brain_coverage / skill_gaps | A 0–100 health score for your knowledge + a ranked list of blind spots to fix. | | brain_predict | Predicts likely failures before they happen, from past incident patterns. | | Ambient Git | A git hook auto-extracts lessons from every commit. Zero extra calls. |

causal_trace in action:

causal_trace(problem="auth breaks after restart")

→ Root: k8s:namespace-terminating
→ Via:  keycloak:jwks-race
→ Fix:  PollUntilContextTimeout 3min  ← used this March 12, worked

30 minutes of git blame in one call.

What runs automatically after setup

| Trigger | What the Brain does — no prompting | |---------|----------------------------------| | First tool call | Session starts; project indexed in background | | Before every task | Recalls relevant past lessons | | During debugging | Traces root causes through causal memory | | Before deploys | Predicts failure risks from past patterns | | After every fix | Stores the lesson with commands + file paths + author | | Every git commit | Hook extracts a lesson from the commit | | Editor closes | Session summary saved for next time |

CLI Commands

npx @cachly-dev/mcp-server@latest autopilot # One command — signs in, configures every editor, bootstraps from git
npx @cachly-dev/mcp-server@latest demo      # Preview your Brain (no account needed)
npx @cachly-dev/mcp-server@latest bench     # Recall quality vs flat-file memory (no auth required)
npx @cachly-dev/mcp-server@latest autosetup # Interactive variant — pick editors yourself
npx @cachly-dev/mcp-server@latest health    # Check token, API, editors, git hook
npx @cachly-dev/mcp-server@latest digest    # Weekly Brain summary — shareable
npx @cachly-dev/mcp-server@latest share     # Generate a shareable stats card + tweet
npx @cachly-dev/mcp-server@latest publish   # Publish your Brain as an importable link (--public)
npx @cachly-dev/mcp-server@latest badge     # Get a live README badge for your Brain
npx @cachly-dev/mcp-server@latest invite    # Invite a teammate to share your Brain
npx @cachly-dev/mcp-server@latest index .   # Index a project's code into the Brain (CI-friendly)
npx @cachly-dev/mcp-server@latest learn-git # Auto-learn lessons from recent git commits

Tip — auto-learn on every merged PR: run learn-git in CI via the cachly-brain-setup GitHub Action with mode: learn. Each merged PR teaches your Brain automatically.

CI integration — your pipeline teaches the Brain

Every CI run is a lesson: a red→green transition is a proven fix, a green→red one is a known cause. Ready-to-paste templates live in src/ci-integration/:

GitHub Actions — copy brain-from-ci-action.yml into .github/workflows/. It triggers on workflow_run (completed) and pushes the outcome to your Brain. Requires CACHLY_JWT + CACHLY_BRAIN_INSTANCE_ID secrets.
GitLab CI — copy brain-from-ci-gitlab.yml into your pipeline: two .post jobs (on_success / on_failure) with allow_failure: true.
Anything else — push-ci-outcome.mjs is a standalone Node.js helper with zero dependencies. It always exits 0 — your CI never fails because of a Brain push.

Already have months of CI history? Backfill it in one call with the brain_from_ci MCP tool — bulk-ingests past outcomes the same way brain_from_git ingests commits.

MCP Tools (136 total)

🧠 Session & Memory (most used)

| Tool | What it does | |------|-------------| | session_start | Full briefing: last session, open failures, recent lessons, brain health | | session_end | Save what you built; auto-extract lessons from summary + git log | | learn_from_attempts | Store structured lessons after any fix, deploy, or discovery (with author, visibility) | | recall_best_solution | Best known solution for a topic — with success/failure history | | smart_recall | Hybrid BM25 + semantic + causal-graph search — 11 languages, quality-reranked | | remember_context | Cache architecture findings, decisions, file summaries | | compact_recover | Full context recovery after hitting the context-window limit |

👥 Team Brain & Org Knowledge Graph

| Tool | What it does | |------|-------------| | team_learn / team_recall | Share lessons across the team with author attribution | | team_confirm | A reviewer confirms a lesson (🛡️ senior / ✔️ peer) → ranks higher in recall · reviewer-gated | | team_assign_role / team_roster / team_whoami | Roles (👑 admin · 🛡️ reviewer · ✏️ contributor · 👁️ viewer) — enforced once an admin is set | | team_audit | Immutable, admin-only governance trail: every role change & lesson confirmation | | brain_who_knows | Find your team's experts on any topic — ranked 🥇🥈🥉 | | brain_file_map | Experts + lessons per file, before you touch it | | team_expertise_map | Full team skills matrix in one table | | brain_collab_pairs | Person↔Person Collaboration Graph — who collaborates with whom, bus-factor alerts | | brain_portability | Config snippets for 7 MCP clients — proves model-neutrality, same Brain everywhere | | skill_gaps | Knowledge blind spots: unresolved failures, missing attribution | | brain_coverage | 0–100 knowledge-health score for your codebase | | madc_deliberate | Specialist AI agents vote to resolve contradictory lessons | | memory_crystalize | Distill all lessons into a Crystal for instant team context | | team_crystallize | Team Crystal — fixes that 2+ teammates independently converged on (the cross-person, causal layer) |

🧬 Causal Intelligence

| Tool | What it does | |------|-------------| | causal_trace | Root-cause analysis through the Causal Knowledge Graph | | brain_predict / brain_predict_failures | Predict likely failures before they happen | | brain_from_git | Bootstrap people + files + lessons from git history — incremental | | brain_from_ci | Bulk-ingest CI outcomes: red→green becomes a fix lesson + causal fixes edge, green→red a causes edge — brain_from_git for CI logs | | memory_consolidate | Detect contradictions, merge duplicates, expire stale lessons | | ckg_inspect | Inspect the causal graph around any concept |

🌐 Shareable & Public Brains

| Tool | What it does | |------|-------------| | brain_seed_starter | Seed 16 universal lessons so your first smart_recall hits — auto-runs on a fresh repo | | brain_share | Export a Brain snapshot as a shareable link (public or unlisted) | | brain_import | Import any shared Brain into yours — topic_prefix, min_confidence, dry_run | | brain_share_list / brain_unshare | List your shares · revoke a share (link goes dead) | | brain_discover | Search the Brain marketplace for ready-made knowledge bases |

🌍 Knowledge Commons · ⚙️ Infrastructure · 📋 Roadmap

| Tool | What it does | |------|-------------| | syndicate / fedbrain_search | Contribute to / search the global Knowledge Commons | | brain_marketplace / brain_install | Browse + install curated Domain Brains (Kubernetes, Auth, DB…) into your Brain | | cache_get / cache_set / semantic_search / index_project | Cache + semantic ops — pass org_id on cache_get/cache_set to share the cache org-wide (writes mirror to org:{org_id}:sem, reads fall back to it on miss) | | cache_stats / cache_org_stats | Tokenmaxxing ROI: hits, estimated USD saved + monthly projection — per instance or aggregated across your whole org. Zero hits yet? You get a day-1 ROI projection instead. | | list_instances / create_instance / delete_instance | Manage Brain instances | | roadmap_add / roadmap_next | Persistent project roadmap stored in the Brain |

…and ~70 more. Run health to see what's wired up in your editor.

FAQ

Does my AI need to call session_start manually? No. Sessions start and end automatically on the first tool call and when the editor closes.

How is this different from Claude's built-in memory? Claude's memory is per-user, Claude-only, flat-file, and unbenchmarked. cachly is team-shared, model-neutral (any MCP client), structured + causal, governed, and has a published recall benchmark. See the comparison table above.

Can my whole team share one Brain? Yes — that's the point. team_learn / team_recall, or npx @cachly-dev/mcp-server@latest invite [email protected].

Is my code sent to cachly servers? No code content is stored. cachly stores lesson text, commit messages, session summaries, and key-value context. All data on EU servers, GDPR-compliant.

What is causal_trace and why is it unique? Given any error, it walks the Causal Knowledge Graph to find root cause, intermediate causes, and the exact fix that worked — including date and commands. No other memory system builds or queries a causal graph.

What if I hit the context-window limit mid-session? Call compact_recover. It reconstructs full context from Memory Crystal + recent sessions + WIP registry — typically one tool call.

Editor support matrix

npx @cachly-dev/mcp-server@latest autopilot auto-detects and configures all of the following. Manual snippets are in the Manual Setup section below.

| Editor / Client | Auto-setup | Config file written | Global config | Notes | |---|---|---|---|---| | Claude Code | ✅ | ~/.claude/mcp.json + .mcp.json | ✅ global always | Runtime device-flow sign-in on first tool call | | Cursor | ✅ detected via .cursor/ | .cursor/mcp.json | — | Project-level; restart Cursor after setup | | Windsurf | ✅ detected via .windsurf/ | .windsurf/mcp.json | — | Project-level; restart Windsurf after setup | | VS Code + Copilot | ✅ detected via .vscode/ | .vscode/mcp.json | — | Requires VS Code MCP extension or Copilot chat | | Cline | ✅ detected via VS Code | .vscode/mcp.json | — | Shares config with Copilot; restart VS Code | | Continue.dev | ✅ detected via .continue/ | .continue/config.json | — | Uses modelContextProtocolServers key | | Zed | ✅ detected via .zed/ | .zed/settings.json | — | Uses context_servers key | | Windsurf (global) | autosetup --editor windsurf | ~/.windsurf/mcp.json | ✅ | Pass --editor to target global config | | Any other MCP client | autosetup --editor claude | .mcp.json | — | Standard mcpServers stdio format |

Which sign-in path each editor uses:

| Scenario | Path | |---|---| | autosetup from a real terminal (TTY) | OAuth device-flow → browser click → API key saved automatically | | autosetup from VSCode task / CI (non-TTY) | Auto-detects non-interactive, opens browser with step-by-step guide, prints CACHLY_JWT=... autosetup instruction | | First tool call from Claude Code (no JWT yet) | Inline device-flow: MCP returns URL + code, browser opens automatically, next call proceeds | | CACHLY_JWT=cky_live_xxx npx ... autosetup | Skips auth step entirely, uses provided key |

Tip — fastest per-project setup from inside Claude Code:
Set up cachly for this project: npx @cachly-dev/mcp-server@latest autopilot
Claude runs it and restarts automatically.

Manual Setup

{
  "mcpServers": {
    "cachly": {
      "command": "npx",
      "args": ["-y", "@cachly-dev/mcp-server@latest"]
    }
  }
}

On the first tool call your AI will prompt you to sign in — takes 10 seconds.

{
  "mcpServers": {
    "cachly": {
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "@cachly-dev/mcp-server@latest"]
    }
  }
}

{
  "context_servers": {
    "cachly": {
      "command": {
        "path": "npx",
        "args": ["-y", "@cachly-dev/mcp-server@latest"]
      }
    }
  }
}

Self-hosting & BYOK

cachly is bring-your-own-key and self-host friendly out of the box — no enterprise contract required to keep data in your own infra.

Bring your own embedding key (BYOK). Semantic search runs on the embedding provider you choose. Set one env var and cachly auto-detects it; no key needed if you prefer cachly's server-side embeddings (uses your JWT):

| Provider | Env var | Model | |---|---|---| | OpenAI | OPENAI_API_KEY | text-embedding-3-small | | Google Gemini | GEMINI_API_KEY | text-embedding-004 | | Mistral | MISTRAL_API_KEY | mistral-embed | | Cohere | COHERE_API_KEY | embed-english-v3.0 | | Ollama (local, free) | OLLAMA_BASE_URL | nomic-embed-text | | cachly (server-side) | (none — uses JWT) | managed |

Force a specific one with CACHLY_EMBED_PROVIDER=openai. Run npx @cachly-dev/mcp-server@latest health to confirm which provider is active.

Point at your own backend (self-hosting). Every cachly install can talk to a private backend instead of api.cachly.dev:

# One-shot: wire up the wizard against your self-hosted backend
npx @cachly-dev/mcp-server@latest autopilot --api-url https://cachly.mycorp.internal

# Or non-interactively
npx @cachly-dev/mcp-server@latest autosetup \
  --instance-id <uuid> --api-key <cky_live_...> \
  --api-url https://cachly.mycorp.internal

autosetup bakes CACHLY_API_URL into the editor config only when it differs from the default cloud — so default installs stay clean, and self-hosted installs keep talking to your backend on every editor launch. All data stays in your infra.

Pricing

| Tier | RAM | Price | Best for | |------|-----|-------|----------| | Free | 25 MB | €0/mo forever | Dev & side projects | | Dev | 200 MB | €19/mo | Individual developers | | Pro | 900 MB | €49/mo | Teams | | Speed | 900 MB + Dragonfly | €79/mo | AI-heavy workloads | | Business | 7 GB | €199/mo | Scale-ups |

✅ All plans: EU servers · GDPR-compliant · 99.9% SLA · No credit card for Free

Environment Variables

| Variable | Default | Description | |----------|---------|-------------| | CACHLY_JWT | — | API token (set by wizard automatically) | | CACHLY_BRAIN_INSTANCE_ID | — | Default instance UUID (optional — auto-resolved) | | CACHLY_API_URL | https://api.cachly.dev | Override for self-hosted | | CACHLY_NO_TELEMETRY | unset | Set to 1 to disable anonymous usage pings |

🧠 Brain v3 — what's new

| Feature | Tool | What it does | |---|---|---| | Autonomous hygiene | brain_hygiene | Sweeps stale lessons, flags provisional, archives orphans | | PR risk scan | brain_plan + cachly-action scan | Predicts CI failures before they run, posts PR comment | | Multi-agent arbitration | brain_conflicts · brain_resolve_conflict | Detects + resolves conflicting lessons across agents | | Plans dashboard | brain_plan | Persistent plans in the UI with step tracking and brain-viz overlay | | Privacy federation | brain_contribute_signal · brain_import_meta | Share patterns without sharing data — k-anonymous global commons |

🛠️ Ecosystem & Docs

One brain, wherever you work. Start with the MCP server, or drop the same memory straight into your editor — your lessons follow you across all of them.

| Package | What it does | |---------|-------------| | @cachly-dev/mcp-server | ← you are here · works with Claude, Cursor, Copilot, Windsurf, Cline, Zed | | Cachly Brain for VS Code | One-click memory in the editor — status bar, lessons view, ambient learning. No terminal needed. | | Cachly Brain for JetBrains | Same brain for IntelliJ / PyCharm / GoLand / WebStorm / Rider — status bar, brain health, lessons view. | | @cachly-dev/openclaw | Cut LLM costs 60–90% in JS/TS apps | | cachly-dev/cachly-action | GitHub Action: auto-setup, PR risk scan, auto-learn from merged PRs, weekly hygiene |

🌐 cachly.dev — Dashboard & free signup
📖 Docs — Full documentation
🗺️ Public Roadmap — what's coming next
💬 GitHub Issues — Bugs & feature requests

Stop re-explaining yourself to your own tools. Give your AI — and your team — a brain that remembers, learns, and gets sharper with every commit.
npx @cachly-dev/mcp-server@latest autopilot