npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@swarmclawai/agentbox

v0.4.1

Published

Local-first black box recorder for AI agent terminal runs: replay terminal output, file diffs, MCP calls, and risk flags.

Downloads

630

Readme

agentbox

Black box footage for your AI agent. agentbox records terminal-based agent runs into a local, self-contained replay with terminal output, file diffs, MCP tool calls, redactions, and risk flags.

npm version License: MIT CI

Why this exists

Coding agents now touch terminals, files, tools, and MCP servers. When a run succeeds, fails, or does something surprising, the evidence is usually scattered across scrollback, Git diff, and tool logs.

agentbox keeps that evidence local and replayable. It does not ask you to send traces to a cloud service, adopt a framework, or change agents. Wrap the command you already run, then inspect what happened.

30-second demo

npx @swarmclawai/agentbox@latest demo
npx @swarmclawai/agentbox@latest demo --scenario failure
npx @swarmclawai/agentbox@latest demo --scenario mcp-risk

Each demo writes a replay, Markdown report, and redacted export zip:

.agentbox/runs/<run-id>/agentbox-run.html
.agentbox/runs/<run-id>/REPORT.md
.agentbox/runs/<run-id>/agentbox-<run-id>.zip

The replay includes:

  • terminal playback stored as asciicast v2
  • file diffs from before/after the run
  • MCP tools/list and tools/call events when using agentbox mcp-proxy
  • conservative secret redaction
  • risk flags for suspicious tool output

Install

pnpm add -g @swarmclawai/agentbox

Or run without installing:

npx @swarmclawai/agentbox@latest record -- <agent command>

Commands

| Command | Purpose | |---|---| | agentbox demo --scenario success\|failure\|mcp-risk | Create a deterministic replay, report, and export zip | | agentbox record -- <command...> | Record a terminal-based agent run | | agentbox export <run\|latest> | Create a redacted zip for sharing | | agentbox report <run\|latest> | Create a Markdown report for local review or CI | | agentbox list | List local runs with status filters | | agentbox library | Generate a searchable local HTML run index | | agentbox open <run\|latest\|library> | Open a replay or the local run library | | agentbox compare <base> <head> | Compare two recorded runs | | agentbox clean | Safely delete old run directories | | agentbox inspect <run> | Summarize a recorded run | | agentbox render <run> | Regenerate agentbox-run.html | | agentbox mcp-proxy --name <server> -- <server-command...> | Log MCP stdio tools/list and tools/call | | agentbox install --platform <name> | Install platform instructions and lightweight hooks | | agentbox --help-agents | Print the machine-readable command catalog |

Every data-returning command supports --json and emits one JSON line on stdout.

Recording an agent

agentbox record -- codex "add tests for the parser"
agentbox record -- claude "fix the failing build"
agentbox record -- goose run "summarize this repo"

By default, agentbox captures terminal output and resize events. It does not store typed input unless you explicitly pass:

agentbox record --capture-input -- <agent command>

Sharing a run

Create a local, redacted zip from the latest run:

agentbox export latest

The zip includes agentbox-run.html, run.json, terminal.cast, events.jsonl, diffs.json, SHARE.md, and manifest.json with checksums. Review the bundle before posting it publicly.

Create a Markdown summary for the latest run:

agentbox report latest
agentbox report latest --out agentbox-report.md

Reports include the command, exit code, duration, changed files, MCP/tool counts, risk flags, redactions, local replay path, export zip path, and artifact URL when supplied.

Managing runs

List recent runs:

agentbox list
agentbox list --status risky --limit 5

Open the latest replay or a searchable local library:

agentbox open latest
agentbox library --open

Compare two runs by id, unique prefix, path, or latest:

agentbox compare 20260423120000-a1b2 20260423123000-c3d4
agentbox compare 20260423120000-a1b2 latest --out agentbox-compare.md

Cleanups are conservative. Preview first, then confirm:

agentbox clean --keep 10 --dry-run
agentbox clean --keep 10 --yes
agentbox clean --before 30d --yes

Agent integrations

Install lightweight instructions and hooks for your agent platform:

agentbox install --platform codex
agentbox claude install
agentbox opencode install
agentbox gemini install

Supported platforms mirror the SwarmVault/Graphify coverage: Claude Code, Codex, OpenCode, GitHub Copilot CLI, VS Code Copilot Chat, Aider, OpenClaw, Factory Droid, Trae, Trae CN, Cursor, Gemini CLI, Hermes, Kiro, Google Antigravity, and Windows Claude Code.

Hook-capable platforms log tool payload summaries into the active run when AGENTBOX_RUN_DIR is set. Platforms without hook support receive always-on project instructions that point users at agentbox record, agentbox inspect latest, and agentbox export latest.

MCP logging

Wrap an MCP stdio server with agentbox mcp-proxy inside a recorded run. The proxy forwards JSON-RPC unchanged and logs only tools/list and tools/call summaries.

{
  "mcpServers": {
    "filesystem": {
      "command": "agentbox",
      "args": [
        "mcp-proxy",
        "--name",
        "filesystem",
        "--",
        "npx",
        "-y",
        "@modelcontextprotocol/server-filesystem",
        "."
      ]
    }
  }
}

Then run your agent through agentbox record. MCP events appear in the replay's MCP tab.

Artifact layout

.agentbox/runs/<run-id>/
  run.json
  terminal.cast
  events.jsonl
  diffs.json
  agentbox-run.html

The HTML replay is self-contained and can be opened locally without a dev server.

GitHub Actions

Use the bundled action to preserve a replay for CI jobs and write an Agentbox report to the workflow summary:

- uses: swarmclawai/[email protected]
  with:
    command: pnpm test
    artifact-name: agentbox-test-run

The action records the command, exports a redacted zip, uploads it with actions/upload-artifact, and preserves the recorded command's exit code after reporting.

PR comments are opt-in so default workflows do not need write-token permissions:

permissions:
  contents: read
  pull-requests: write

steps:
  - uses: actions/checkout@v5
  - uses: swarmclawai/[email protected]
    with:
      command: pnpm test
      comment-pr: "true"

Risk gates are also opt-in:

- uses: swarmclawai/[email protected]
  with:
    command: pnpm test
    fail-on-risk: high

Use fail-on-risk: medium or fail-on-risk: low for stricter gates. Set newline-separated redact-patterns to apply custom redaction during recording and export.

Redaction

agentbox redacts common API key and token patterns before writing artifacts. Add your own patterns with:

agentbox record --redact-pattern 'MY_SECRET_[A-Z0-9]+' -- <agent command>

Redaction is a safety net, not a guarantee. Review artifacts before sharing them publicly.

How it compares

| | agentbox | LLM observability SaaS | asciinema | |---|---:|---:|---:| | Works with any terminal agent | Yes | Usually no | Yes | | Local-first HTML artifact | Yes | Usually no | Partial | | Captures file diffs | Yes | Usually no | No | | Captures MCP tool calls | Yes | Sometimes | No | | Requires app/framework instrumentation | No | Often | No |

Development

pnpm install
pnpm run build
pnpm run typecheck
pnpm run test

License

MIT