@justethales/casp

v0.3.2

Published

3 days ago

CASP — the Coding-Agent State Protocol. The deterministic gate that validates your coding agent's project state against git and blocks the push the moment it drifts — across sessions, branches and a team. The complement to long-running autonomous models:

CASP — the Coding-Agent State Protocol

The model holds the context. CASP proves the state is true — against git. The new models run your whole roadmap for hours, even days, without losing the thread — which is exactly why state drift matters more, not less: the more an agent does between your checkpoints, the more its recorded state can quietly stop matching git. Point any coding agent at a CASP repo and it executes phase after phase across sessions, branches and a team — writing its own next-session prompt, logging every session — and casp check blocks the push the moment the state drifts from git. Everyone stores context; CASP proves it. The complement to long-running, autonomous models — with Claude Code today, and every model that ships next. MIT, zero telemetry, no SaaS.

npm i -g @justethales/casp        # or: npx @justethales/casp <command>
casp init            # scaffold the casp/ layer in any repo
casp status          # one-screen "where am I"
casp check           # validate the state against git — exits 1 on drift

Coding-Agent · State · Protocol. Works with Claude Code, Cursor, Aider, Continue, or any agent that can run a CLI. Node ≥ 20. No account, no telemetry, nothing leaves your machine.

Built by Thales (Juste Gnimavo) — a solo CEO running seven production products with Claude as the only engineer. CASP is the layer that keeps months of AI-driven sessions from collapsing into drift.

Pre-flight check + black box for AI coding sessions.

01 · The thread you keep losing

You come back to a project after a week — or you juggle five at once. The agent reads a state file that no longer matches reality, confidently starts work that already shipped, and you burn an afternoon undoing it.

Boards, cards and spreadsheets don't save you: reconstructing context is manual, and the agent can't read any of it. The state needs to be machine-readable, git-native — and provably true.

// casp/state.json  ● DRIFTED
{
  "phase":       "13 — camera streaming",
  "next_prompt": "phases/14-camera.md",   // already shipped in v13.4
  "last_commit": "a1f3c9",                // not in git history
  "migrations":  ["0001"…"0007"]          // git stops at 0006
}

A stale state file makes your agent confidently wrong. CASP gives every project one thread that survives across sessions — and can't drift silently.

02 · The wedge — everyone stores context, CASP validates it

The adjacent space — Mem0, Letta, Zep, the new git-native "memory" projects — all store what happened. Almost none verify that the stored state still matches git reality.

That verification is casp check — and it's mandatory before every push. It catches:

next-prompt drift — your next_prompt points at a file that's already shipped, or doesn't exist. CASP refuses to start the wrong session.
git ground-truth — last_commit not in history, migrations list out of sync, uncommitted state — checked against git itself, not a guess.
push, blocked — no fuzzy similarity scores. A hard, repeatable pass/fail gate that stops the push while the state is lying. casp check exits non-zero on drift, so it works as a real CI status check, not a decorative log.

03 · Beside your existing stack

CASP replaces nothing in your workflow. It fills the one gap nothing else covers — the validated present tense of a project, in a form your agent can read and act on.

| Layer | Tool | What it holds | The gap | |---|---|---|---| | Intent | Jira · Linear | What you plan to do | Drifts from reality, lives in the cloud, your agent can't reliably read it. | | Validated present | CASP | Where the project stands now — and the exact next move, proven against git | (this is the gap nothing else fills) | | History & verification | git · PR · CI | What changed · is it reviewed · does it build | A perfect record of the past — silent about what comes next. |

Git, PRs and CI don't know what ships next. CASP does.

04 · Three files. One thread.

No database. No service. No vector store. Three plain files an agent can read on the first line of any session, scaffolded by casp init into casp/:

| File | Role | What it holds | |---|---|---| | state.json | source of truth | Machine-readable, per project: current phase, next phase, the exact next-prompt to execute, phases shipped, migrations applied, last commit, last session id. | | now.md | for humans | The one-screen "where am I right now." Open it, get the thread back in five seconds — no archaeology. | | roadmap.md | what ships next | The Next-3 to ship plus a phase scoreboard. The agent always knows the order of work. |

Templates are gates, not guidance. Canonical session-prompt, session-log and audit-brief templates mean every session — human or agent — produces the same-shaped artifacts. Structure is enforced, not suggested. Draft them with casp new prompt|log — don't copy-paste a prior one.

05 · Built for big roadmaps

A real product isn't one feature. It's dozens of phases across API, web client and mobile, shipped over weeks by rotating sessions and agents. CASP keeps a single validated order across all of it — so any agent knows which phase is next, and never re-ships a shipped one.

roadmap.md — phase scoreboard            13 shipped ▰▰▰▰▱▱ 22 total

  10  api     Realtime sync engine        shipped
  11  mobile  Push notifications          shipped
  12  mobile  Offline-first cache         shipped
  13  web     Team permissions            shipped
  14  web     Analytics dashboard ◂ NEXT  active
  15  api     Per-seat billing            queued
  16  mobile  Biometric login             queued

One ordered thread across forty phases — web and mobile.

06 · State, not memory

CASP is not an AI memory layer. Memory tools remember who you are. CASP tracks where your project stands — and proves it. Different artifact, different operation, different failure it prevents.

| | CASP | Memory layers · Mem0 / Letta / git-native "soul" | |---|---|---| | What it holds | Project execution state | User facts & preferences | | Core operation | Validates against git | Stores & recalls | | On conflict | Deterministic check vs ground-truth | Fuzzy similarity guess | | When it runs | Synchronous gate — blocks the push | Async / eventual recall | | Leaves your machine | Never · zero telemetry | Varies / cloud |

07 · The command deck

Five verbs. Trivially typed — one syllable, no homographs, the same in English, French or Spanish.

| Command | What it does | |---|---| | casp init | Scaffold the continuity layer (casp/) into any repo. Idempotent — re-running never overwrites existing files. | | casp status | One-screen snapshot: phase, next, what's shipped, last 10 commits. --plain strips ANSI. | | casp check | The drift validator. Validates state.json against the filesystem and git. Exits 1 on drift. --quiet only prints on FAIL (CI-friendly); --no-git skips git-dependent checks; --json emits a machine-readable report with a stable schema (docs/check-json.md) — same checks, same exit code. | | casp next | Print the next session's prompt straight from state.next_prompt — pipe-friendly, exits non-zero when there's no actionable prompt. | | casp new prompt --slug X | Generate a gated session-prompt from the canonical template into docs/plan/sessions/. | | casp new log --slug X | Open a session-log in the shape every session shares, into session-logs/. |

08 · In your editor

CASP ships Claude Code slash-commands so the state lives where you already work. Drop them in once:

cp -r node_modules/@justethales/casp/skills/casp ~/.claude/skills/
cp -r node_modules/@justethales/casp/skills/next ~/.claude/skills/

| Command | What it does | |---|---| | /casp | Read-only status — the agent reads the current thread before it writes a single line. | | /next | Auto-start the next session straight from state.next_prompt. No copy-paste, no guessing. |

Works with Claude Code · Cursor · Aider · Continue — anything that reads files. The CLI is the contract; the slash-commands are an optional convenience.

09 · For engineering orgs

One agent doing the wrong thing costs an afternoon. A hundred agents doing it across a hundred repos costs a quarter. CASP is the deterministic guardrail you drop into the automation loop — the same shape in every project.

A required CI status check. casp check sits in the same slot as lint and tests. A state that lies can't merge — drift is blocked at the org level, not left to anyone's discipline.
A guardrail for agent fleets. Autonomous agents multiply mistakes. CASP hands every one of them the same validated thread to read and the same hard gate before it pushes. Automation without the duplicate-work tax.
An audit trail, for free. Every state transition is a git commit. A complete, diffable, revertable record of how each project moved — git log is your compliance trail.
Passes infosec by design. Local-only, zero telemetry, no cloud, no account. Nothing to vet, nothing to exfiltrate. The security review is one line: it never leaves the machine.

# .github/workflows/ci.yml
jobs:
  state-check:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
        with: { fetch-depth: 0 }   # casp checks against full git history
      - run: npx @justethales/casp check        # ✗ fails the build the moment state drifts

One protocol, every repo. The same validated shape, org-wide.

10 · The contract

A protocol earns adoption by being predictable. These don't bend.

Validate state, not intent. CASP checks what your repo is, never what you meant to do. Facts against git, every time.
Templates are gates. Canonical artifacts are enforced, not suggested. Every session comes out the same shape.
check before every push. The validator is not optional. A lying state never reaches your remote.
Nothing leaves your machine. Deterministic, git-native, local-only. Zero telemetry. No cloud, no account, no bill.

Quickstart

npm i -g @justethales/casp
cd my-project
casp init                 # scaffold the layer
casp status               # where am I right now
casp check                # prove the state is true (exits 1 on drift)

casp init creates:

casp/
├── state.json          # machine-readable session state (the validator reads this)
├── now.md              # current focus (human-readable, one paragraph)
├── roadmap.md          # Next-3 to ship + phase scoreboard
├── README.md           # the protocol, in your repo
└── templates/
    ├── session-prompt.md
    ├── session-log.md
    └── audit-brief.md

Edit casp/now.md, casp/roadmap.md and casp/state.json to describe your project, draft your first session prompt with casp new prompt --slug phase-1-first-slice, and run casp check before every push. That is the whole loop.

What the validator catches

$ npx @justethales/casp check

casp:check · 22 PASS · 2 WARN · 1 FAIL
──────────────────────────────────────────────────────────────────────
  FAIL  next_prompt is already SHIPPED · docs/plan/sessions/PHASE-1-AUTH.md has status: shipped
        → either update state.json.next_prompt to the real next slice, or re-execute it explicitly
  WARN  last_commit is in history but not at HEAD · state=abc1234 HEAD=def5678
        → bump state.last_commit to def5678
  ...

✗ 1 drift detected. Push blocked — fix before push.

Nine check categories, each with a one-line → fix hint so the agent can resolve without re-reading docs:

state.json.next_prompt points at a missing file.
state.json.next_prompt points at a prompt with status: shipped. (The exact bug CASP was built to catch.)
state.json.last_session_id does not map to a session-log file.
state.json.last_commit not in git log.
state.json.phases_shipped[] has duplicates.
state.json.migrations_applied[] does not match the migrations directory.
A session prompt has status: shipped but session_log: pending.
Uncommitted changes in casp/, docs/plan/sessions/, or session-logs/.
A state claim whose backing directory is missing — claimed migrations, shipped phases, or a session id the validator cannot verify FAIL; a check that cannot find what it needs never reports green.

The exit-code contract — clean → exit 0, drift → exit 1 — is covered by npm test, so the CI gate stays real.

Need the report as data instead of text? casp check --json emits the same findings as structured PASS/WARN/FAIL with a stable, documented schema — for CI annotations, webhooks, and roll-ups. See docs/check-json.md.

What this is NOT

Not a task tracker. Use Linear, Jira, GitHub Issues. CASP is about which session ships next, not which issues are open.
Not an AI memory / RAG layer. It validates project state against git; it doesn't store user facts or do similarity recall.
Not a CI tool. It runs locally. You can wire casp check into CI as a pre-push gate, but it doesn't replace your test runner.
Not opinionated about your code. Rust, Python, TypeScript, Go — CASP cares about session state, not your stack.
Not a replacement for CLAUDE.md. That's your project's constitution (rules, conventions). CASP is the operating state on top of it.

Roadmap

0.4 — casp install-hook: wire casp check into your pre-push hook in one command. The validator stops being optional.
0.5 — Pre-session gate on casp next: refuse to start a session on a drifted state (--no-check escape hatch). Both boundaries gated — start and push.
0.6 — Configurable paths (sessions_dir / logs_dir state keys) so non-standard layouts validate against the right ground truth.
0.7 — casp status --json: the structured session handoff, and the substrate for a multi-repo roll-up (status --all).
Later — casp verify <commit> + casp state diff: validate a past state; inspect how the state evolved between two commits.
Demand-gated — native binaries, a narrow casp rollback (state mutation only, never code), a CI status-check installer, a generic webhook notifier (user-owned outbound, off by default).

Cut from earlier drafts, deliberately: casp lint (an LLM verb inside the CASP binary — even advisory — would undercut the deterministic claim; prose-vs-reality checking belongs in your agent, which can read casp/ today for free).

Vote on the roadmap with GitHub issues / reactions.

FAQ

Does this require Claude Code? No. The CLI works standalone with any agent that runs shell commands. The /casp and /next slash-commands are an optional bundle for Claude Code users.

Is CASP an AI memory product? No — that's the wedge. Memory tools (Mem0, Letta, Zep) store and recall. CASP validates project execution state against git and blocks the push on drift. Different artifact, different operation.

Does it work for Cursor, Aider, Continue? Yes. The CLI is just a CLI; any agent that runs shell commands and reads files can use it.

What about Python, Rust, Go projects? The CLI is Node-only (TypeScript via tsx). Use it via npx without committing Node as a project dependency. A binary distribution is on the roadmap.

Is any data sent anywhere? No. Zero network calls. Zero telemetry. CASP reads your filesystem and your git. Nothing leaves your machine.

Why "CASP"? Coding-Agent State Protocol — the name says exactly what the artifact is. An earlier name for this tool collided with a well-known Red Hat project of the same name; CASP relaunches it as a protocol, the way MCP did for model context.

Where do I report bugs or request features? github.com/ThalesGnimavo/casp/issues

License

MIT. Use it, fork it, ship it.

If CASP saves you 30 minutes a week, that's enough payback. If you want to say thanks: star the GitHub repo, or say hi on X.

CASP — the Coding-Agent State Protocol. The model holds the context; CASP proves the state is true — against git.