@davesheffer/hunch

v0.19.1

Published

12 minutes ago

Hunch — an Engineering Memory OS: a persistent, git-native reasoning graph over a codebase, exposed to Claude Code via MCP.

0High
0Medium
0Low

davesheffer

claude-code mcp engineering-memory knowledge-graph code-intelligence ai developer-tools

🧠 Hunch — Engineering Memory OS

Git stores what the code is. Hunch stores why it is that way — a persistent, git-native reasoning graph over your codebase, surfaced to Claude Code at reasoning time so the AI stops re-deriving understanding and stops undoing intentional design.

⚡ 60-second start

npm i -g @davesheffer/hunch
cd your-repo && hunch init && hunch backfill --since 90d
hunch why src/some/file.ts     # …or just ask Claude Code: "why is X built this way?"

Works with Claude Code, Cursor, Copilot & Windsurf from one shared graph.

📚 Read the full documentation → hunch-pi.vercel.app/docs

The docs site is the complete reference — setup, every CLI command and MCP tool, the guards, troubleshooting. This README is the tour. Jump to: Install · MCP setup · Firmness · CLI reference · Troubleshooting

The problem

Every AI coding session starts from zero. The model re-reads your code, re-guesses the intent, and happily "fixes" the thing you deliberately did last month — because the reasoning behind the code lives in PRs, Slack, and people's heads, not in the repo.

Hunch captures that reasoning as a byproduct of normal work — commits and test failures — stores it as a git-tracked graph next to your code, and feeds it back to Claude Code so every session is grounded in the decisions, bugs, and invariants that came before. Local-first, no documentation toil, no SaaS.

How it works

   commit / test failure              .hunch/  (git-tracked JSON)            Claude Code
 ┌───────────────────────┐         ┌──────────────────────────┐         ┌──────────────┐
 │ post-commit hook   ───┼────────▶│ Decisions  (why a change) │────────▶│ MCP tools     │
 │ record-bug         ───┼────────▶│ Bugs       (root causes)  │  read   │ /hunch-* cmds │
 │ structured diff +     │  write  │ Constraints(invariants)   │◀────────│ CLAUDE.md     │
 │ Claude (or heuristic) │         │ Components / Symbols/Edges │         │ CLI           │
 └───────────────────────┘         └──────────────────────────┘         └──────────────┘

Index (no LLM): tree-sitter parses your repo into a symbol/dependency graph — functions, call edges, imports, components — plus churn and fan-in metrics.
Learn: each commit becomes a structured Decision (an ADR); a failing test becomes a Bug with a ranked suspect list; recurring or severe bugs are promoted into Constraints (do-not-break invariants) and raise a component's fragility.
Ground: any MCP assistant reads it through an MCP server, an auto-maintained CLAUDE.md, and slash commands — every answer cites provenance (source + confidence + evidence), so nothing is a blind assertion.

→ Concepts in depth: the reasoning graph · provenance · time-travel

Getting started

npm install -g @davesheffer/hunch   # Node ≥ 20; puts `hunch` on your PATH
cd your-repo
hunch init                          # scaffold .hunch/, index, install hooks, wire up assistants
hunch backfill --since 90d          # cold start: seed decisions from recent git history
hunch why src/auth/session.ts       # …then ask your assistant: "why is X built this way?"

hunch init scaffolds .hunch/, indexes the repo, installs the git hooks + merge driver, writes .mcp.json + slash commands + an auto-maintained CLAUDE.md, and wires up every detected assistant (Claude Code, Cursor, VS Code/Copilot, Windsurf, Codex) to the same graph — merging idempotently into existing files. Reload your assistant in the repo afterward to pick up the hunch_* tools. Each teammate runs hunch init once; the .hunch/ content is shared via git.

Synthesis is billed to your coding-assistant subscription (Claude/Codex/Cursor CLI), never a pay-per-token API key — and falls back to a deterministic heuristic if no CLI is present. Details: Synthesis & billing.
Deep Synthesis (backfill --deep / sync --deep): reconcile multiple independent drafts into one — fan out across every signed-in CLI, or, with a single CLI, sample it N times for self-consistency (--samples, default 2). Confidence is agreement-weighted (capped below the enforcement threshold, so it stays advisory). Add --verify (auto under --deep) for a Critic pass that audits each draft against its commit — pruning unsupported rejected-alternatives before they become tripwires and down-weighting weak grounding; it only ever lowers confidence, never arming enforcement. Subscription-only, never on the guard path; degrades to the single-provider draft when no CLI is available. On Windows, prefer hunch init over a global claude mcp add; if tools don't appear, hunch doctor heals it (why).

Full walkthrough → Getting started · MCP & assistants · MCP tools · slash commands · the 22-command CLI

Enforcement: memory that holds the line

Hunch isn't just recall — it's a set of guards that stop the AI (and you) from undoing intentional design. All ride the same rails: the pre-edit hook, hunch check, the hunch_merge_verdict MCP tool, and the CI Constraint Guard. How hard they push is one committed knob — firmness (off → advisory → firm → strict).

Never Twice — corrections become enforced invariants

You tell the agent "no, never call the pay-per-token API here," it complies once, and next session it does it again — because the feedback was stored as advisory text. Hunch closes that loop: a correction is captured as a first-class Constraint (human_confirmed) via hunch_record_correction, and from then on the same hook + CI guard hold every assistant to it. → docs

Causal Merge Verdict — does this change re-open a closed bug?

A diff-only reviewer sees what changed; it can't see that the line you're deleting is the fix for an incident. Hunch can — hunch_merge_verdict replays a diff against the graph and returns a cited BLOCK / WARN / PASS:

VERDICT: ⛔ BLOCK — this change breaks a recorded invariant or re-opens a known bug.

⛔ pay() must verify the session before charging — con_pay
   🧠 why: "Charge must verify the session first" (dec_pay)
   🐞 guards against: Double-charge on unverified session (bug_…)

Deterministic (no LLM), and safe as a merge gate — it blocks only on a direct, high-confidence, non-stale blocking invariant. → docs

Decision Guard (Veto) — re-introducing a rejected approach is blocked

The most expensive reversal is re-adding an approach a decision rejected (latency, a forbidden dependency) — code that never existed, so a diff reviewer is blind to it. A decision carries machine-checkable tripwires; re-introduce one and Hunch blocks it with the receipt of what you rejected and why. → docs

Redundancy Guard — "this already exists"

An agent works from a local context window, so it re-implements a helper that already lives three modules over, or re-adds a dependency the codebase already has — sprawl a diff-only reviewer can't see, but Hunch's symbol graph can. Add a function or class already defined elsewhere and hunch check / the CI guard / hunch_merge_verdict flag it with the existing location. Deterministic and advisory — it never blocks; tuned to stay quiet (stopword + length filters, scoped to the change's own project root, move-aware so a refactor isn't mistaken for a duplicate). → docs

Plus the Regression Guard (re-adding deliberately-retired code) and the CI Constraint Guard (hunch ci — a PR gate that comments the affected con_/dec_ ids and fails on a blocking one).

Working as a team

The .hunch/ JSON is the source of truth — diffable, reviewable in PRs, synced for free over git push / pull. hunch init registers a git merge driver so concurrent edits merge by record id (human-confirmed beats auto, then confidence, then recency). The graph is OS-agnostic: paths are stored in POSIX form and an installed Hunch registers its MCP server by package name, so Windows / macOS / Linux teammates share one memory without per-machine fixups. → docs

Private memory (public repo, private context)

Open-source your code without open-sourcing your reasoning. hunch private sets up a separate private store in one command — Hunch unions it into every query and guard locally (MCP and the pre-edit hook see your sensitive decisions/bugs/constraints) while your public .hunch/ stays clean. It writes a gitignored .hunch/local.json so it's auto-detected — no env var, no shell-profile edit (and HUNCH_PRIVATE_DIR still overrides per-shell). Opt-in, default-off (no config → fully inert), and leak-safe by construction: committed files and the CI PR comment render public-only, so a private record can't reach a public surface. Record sensitive items with private: true (hunch_record_decision / hunch_record_correction); post-commit synthesis can route there too, and hunch private --auto-commit (opt-in) auto-commits + pushes each capture to the private repo — recursion-safe, staging only .hunch/. → docs

Continuous learning (CI)

The decision half of the loop is automatic (the post-commit hook). Light up the bug/constraint half by wrapping your test run — it captures failures as Bugs (recurrences auto-promote Constraints) and resolves fixed ones, preserving the runner's exit code:

hunch test                  # runs `npm test`; any runner: hunch test -- pytest -q

Drop npx hunch test into CI, and hunch ci to scaffold the PR merge gate. → docs

Semantic search (optional)

hunch query uses fast keyword search out of the box. For recall on paraphrases, opt into local embeddings (npm i -g @huggingface/transformers && hunch embed) — local, free, and opt-in. Vectors live in the derived SQLite index and never drift from the JSON source of truth.

VS Code

A companion VS Code extension (on Open VSX — VS Code / Cursor / Windsurf / VSCodium) brings the graph into the editor: a tree of decisions / invariants / bugs / bug-lineage / fragility / stale records, CodeLens summaries, hover with bug history, invariants in the Problems panel, an interactive component graph, and a status-bar invariant counter. It reads the committed .hunch/ JSON directly; writes delegate to the hunch CLI.

Architecture

src/
├─ core/         types (Zod schema), ids, paths, glob, schema migration, atomic file I/O
├─ store/        JSON source of truth ←→ SQLite/FTS5 derived index; merge driver; compaction
├─ extractors/   tree-sitter parse, git introspection, the indexer
├─ synthesis/    write path: subscription CLI (Claude/Codex/Cursor) or deterministic fallback
├─ mcp/          MCP stdio server (the hunch_* tools)
├─ integrations/ post-commit hook, CLAUDE.md writer, .mcp.json + slash commands, merge driver
└─ cli/          commander entrypoint

Everything lives under .hunch/ as git-tracked JSON (the source of truth); SQLite is a throwaway derived index. → storage layout · the docs for the full conceptual model.

Notable engineering decisions

Subscription-billed synthesis, never the API. The write path drives your Claude subscription via the claude CLI; ANTHROPIC_API_KEY / ANTHROPIC_AUTH_TOKEN are stripped from the child env to force subscription auth. A deterministic, no-LLM fallback means the loop never hard-requires credentials.
Native tree-sitter (0.21.1), not web-tree-sitter — the prebuilt WASM grammars have an incompatible ABI; the native bindings ship Node-20 prebuilds and a simpler synchronous API.
better-sqlite3 pinned to 12.9.0 — 12.10.x ships no Node-20 prebuild.
Atomic, durable writes. All .hunch/ writes go through temp-file + rename; an interrupted write can't truncate the index, and put/delete refuse to rewrite a corrupt one.
OS-agnostic by construction. Paths are canonicalized to POSIX before comparison, and committed configs reference Hunch by package name, never a machine-local path.

Develop

npm run dev -- <args>   # run the CLI from source via tsx (no build step)
npm run typecheck       # strict tsc — the gate
npm test                # node:test suite
npm run build           # compile to dist/ (the published artifact)

Hunch is pure TypeScript ESM, Node ≥ 20, licensed Apache-2.0. See CONTRIBUTING.md and the full developer docs.