@agent-undo/core
v0.2.0
Published
Ctrl-Z for AI agents. A universal time machine that records every side effect an autonomous agent produces — file writes, shell commands, git state, HTTP mutations — and reverses them with one command.
Downloads
371
Maintainers
Readme
The thing stopping people from running agents in full-auto isn't intelligence — it's fear. An agent edits 15 files, deletes a folder, runs a migration, sends an email, fires off an API call. If it screws up, the files are maybe recoverable. The deleted folder, the DB row, the sent email, the network call? No undo exists anywhere.
undo is that undo. Act freely, because everything is reversible.
$ undo watch # arm it — now any agent's changes are reversible
... agent wipes a secret, deletes auth.ts, dumps junk, POSTs a charge ...
$ undo diff # see exactly what it did
$ undo rollback # rewind all of it
✓ rewound to cp001Works with any AI agent
undo is not tied to any model, vendor, or IDE. Every agent does one thing in common — it changes files on disk — so undo meets it at whichever layer is convenient:
| Your setup | Turn it on | Covers |
|---|---|---|
| Anything — Cursor, Copilot, Windsurf, Aider, custom scripts, even you | undo watch | Snapshots, then watches the filesystem. Reversible no matter what made the change. |
| Any CLI agent | undo run -- <agent-cmd> | Wraps the command; snapshots first, reversible after. |
| Any MCP client | add the MCP server | The agent calls undo_checkpoint / undo_track / undo_rollback itself. |
| Claude Code | undo protect | Native PreToolUse hook — auto-checkpoints every session, zero effort. |
Install
The CLI works on macOS, Linux, and Windows — no Node required:
cargo install undo-core # via crates.io (installs the `undo` binary)
brew install tathagat22/tap/undo # via Homebrew
curl -fsSL https://raw.githubusercontent.com/tathagat22/agent-undo/main/packaging/install.sh | shThe MCP server (for MCP clients like Cursor / Claude):
npx -y @agent-undo/coreWhat it reverses
One consistent model — record a change with its inverse, replay the inverse on rollback — across every domain. Anything that touches the outside world is dry-run gated: undo shows you what it would do and never fires blindly.
📁 Files — byte-perfect, crash-safe
Create / modify / delete / directories / symlinks / permissions, all restored exactly. A content-addressed blob store means even large and binary files come back byte-for-byte. Plus redo, and selective per-file revert.
undo rollback # rewind everything since the checkpoint
undo revert src/auth.ts # ...or just one file
undo redo # ...changed your mind🔍 undo diff — review before you trust
A PR-style view of exactly what the agent changed, built from undo's own before-snapshots:
src/auth.ts modified +2 -2
-const KEY = "prod-secret"
+const KEY = ""
src/new.ts new +1 -0
+export const added = true
2 file(s) changed, +3 -2🌐 Network calls — actually reversed
When the agent records a mutation with a compensator (the request that reverses it), undo runs it:
agent: POST /v1/charges → records a refund as the compensator
undo_compensate → preview: "WOULD send the refund"
undo_compensate execute=true → fires it, most-recent-first✉️ Email — honest hold-and-release
No tool can recall a delivered email — the recipient has a copy nothing can touch. So undo does the one honest thing that works: it holds the email as a draft that has gone nowhere.
undo_email_stage to=… subject=… body=… # held, NOT sent
undo_email_cancel # delete the draft → it never existed
undo_email_release # ...or actually deliver itBefore release: cancel is a true unsend. After delivery: it's gone, and undo says so plainly — the most it can do then is trash your copy. We don't pretend to reach into other people's inboxes.
☁️ Cloud & databases — any tool
undo doesn't hardcode AWS or Postgres. The agent records the command that reverses what it did, and undo runs it (dry-run gated):
undo_record_reversal description="created S3 bucket assets-prod" command="aws s3 rb s3://assets-prod --force"
undo_record_reversal description="ran migration 042" command="psql $DB -f rollback_042.sql"
undo_compensate execute=trueWorks with any cloud, database, or CLI. (For DB UPDATE/DELETE, you record the inverse — undo runs what you give it.)
CLI
undo init set up undo in this directory
undo checkpoint [label] mark a point you can rewind to
undo track <path>... capture a path before the agent changes it
undo status what's changed since the last checkpoint
undo diff a PR-style diff of everything the agent changed
undo rollback [checkpoint] rewind everything since a checkpoint
undo revert <path> selectively undo just one file
undo redo undo the last rollback
undo watch snapshot + watch the filesystem (any agent)
undo run -- <command> snapshot, then run any command reversibly
undo protect / unprotect install / remove the Claude Code auto-capture hookMCP server
Add to your MCP client's config (e.g. .mcp.json):
{ "mcpServers": { "undo": { "command": "npx", "args": ["-y", "@agent-undo/core"] } } }16 tools: undo_init · undo_checkpoint · undo_track · undo_status · undo_diff · undo_log · undo_rollback · undo_revert · undo_redo · undo_record_http · undo_record_reversal · undo_compensate · undo_email_stage · undo_email_release · undo_email_cancel · undo_email_pending
Architecture
A polyglot system with a real native boundary:
┌─────────────────────────────┐
│ TypeScript (agent surface) │ MCP server · compensation · email · reversals
├─────────────────────────────┤ ↕ NAPI-RS (in-process, no subprocess)
│ Rust (the engine) │ Effect · Journal · blob store · rollback · diff
│ crates/undo-core │ + the standalone `undo` CLI
└─────────────────────────────┘Rust owns the part that touches your filesystem and has to be fast and trustworthy; TypeScript owns the agent-facing surface; NAPI-RS bridges them in-process.
Why you can trust it
A universal undo is only worth anything if it's correct under pressure:
- Crash-safe — journal/state written write-temp-then-rename (atomic on POSIX).
- Rollback integrity — if any step fails, the journal is left intact and it's safe to retry; never reports success while leaving files unrestored.
- Concurrency-safe — an exclusive lock, so an agent and a human can't corrupt the journal.
- Sandboxed — refuses paths outside the project, never captures
.undo, auto-gitignores snapshots so secrets aren't committed.
This is tested, not asserted: unit tests per property, a property test that runs dozens of randomized mutation sequences and asserts byte-for-byte round-trips, a concurrency test that hammers one journal from many threads, and Node suites that drive real HTTP/Gmail/command reversals against mock servers. The engine suite runs in CI on Linux, macOS, and Windows.
Platform note: the engine is verified on all three OSes. On Windows, content + structure + mtime restore exactly; unix permission bits and symlink fidelity are POSIX-only (they no-op rather than fail).
License
MIT © Tathagat Maitray
