npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@tuel/code-oz

v0.21.0-alpha.0

Published

CI-style gates for AI coding agents — local-first governed delivery loop

Downloads

486

Readme

code-oz

CI-style gates for AI coding agents.

code-oz runs coding agents through a repo-local delivery loop:

DEFINE → PLAN → BUILD → VERIFY → REVIEW → SHIP

Use it when direct AI coding is too unconstrained and you want every change to pass through inspectable artifacts, approval gates, verification evidence, and independent review before it ships.

AI agents are fast. code-oz makes their work auditable. It is for risky repos, not fastest-loop coding.

Tests Release npm version Homebrew License: MIT Platform Tests passing

macOS note: code-oz binaries are not yet Apple-Developer-signed (signing + notarization deferred to v0.x stable). Gatekeeper may prompt on first launch; the install script applies xattr -d com.apple.quarantine as a workaround, and brew install handles this automatically.

What you get

  • file-based phase gates you can inspect in the repo
  • approvals bound to exact artifact SHA-256s
  • isolated worktrees for agent changes
  • an events.jsonl ledger for reconstructing what happened
  • cross-family review so the builder and reviewer are not the same model family

Status: public alpha. The deterministic demo uses FakeProvider so you can inspect the lifecycle without spending tokens. FakeProvider proves lifecycle gates and ledger determinism, not model quality.

Install

Three channels deliver the same single binary, verified against the same checksums.txt.

# curl | sh
curl -fsSL https://github.com/omerakben/code-oz/releases/download/v0.20.1-alpha.0/install.sh \
  | sh -s -- --version v0.20.1-alpha.0

# npm (scoped under the TUEL AI publisher; binary still runs as `code-oz`)
npm install -g @tuel/code-oz

# Homebrew
brew tap omerakben/code-oz
brew install omerakben/code-oz/code-oz

Platform support: macOS arm64, macOS x64, Linux x64, Linux arm64. Windows and Scoop are deferred to a future distribution milestone.

If npm install -g @tuel/code-oz fails with a 404 or authentication error, your ~/.npmrc is likely overriding the @tuel scope — see docs/TRUST.md § Install gotchas for the fix.

Why not just Claude Code or Codex?

Use Claude Code, Codex, Cursor, Gemini CLI, OpenCode, Roo Code, or Aider directly when you want the fastest possible agent loop.

Use code-oz when you want a governed loop.

Direct-agent workflow:

  1. Ask an agent to make a change.
  2. Inspect the result.
  3. Hope the prompt, tests, and review were enough.

code-oz workflow:

  1. Define the task as an artifact.
  2. Approve the artifact by SHA-256.
  3. Build in an isolated worktree.
  4. Verify evidence before review.
  5. Require independent review.
  6. Write a ledger of what happened.

code-oz is not trying to be a smarter coding model. It is the control layer around coding models.

What is real today?

| Area | Status | | ---------------- | -------------------------------------------------------------------- | | CLI commands | init, run, approve, doctor | | Lifecycle | DEFINE → PLAN → BUILD → VERIFY → REVIEW → SHIP for greenfield runs | | Gates | File-based gates with schema validation | | Approvals | SHA-256-bound approval artifacts | | Isolation | Worktree-per-run isolation | | Ledger | events.jsonl audit trail | | Demo provider | Deterministic FakeProvider | | Live providers | Claude CLI, Codex CLI, and xAI HTTP adapter | | xAI auth | XAI_API_KEY env var | | Install channels | curl script, npm package, Homebrew tap | | Platforms | macOS arm64, macOS x64, Linux arm64, Linux x64 | | Tests | 3395 offline tests in CI |

The provider contract is intentionally narrow. The alpha is about proving governed delivery, not supporting every agent on day one.

What is simulated or not ready yet?

| Area | Current state | | ------------------------------ | ----------------------------------------------------------- | | FakeProvider demo | Simulated model responses, real gates/artifacts/ledger | | Gemini | Stub provider in v0.1; not a working invocation adapter | | OpenCode / Roo Code | Future adapter candidates, not v0.1 providers | | Brownfield AUDIT runtime | Shipped in v0.21 (M17); proven by a deterministic full-cycle e2e, live brownfield smoke pending credentials | | Windows / Scoop | Deferred | | Apple signing / notarization | Deferred to v0.x stable; macOS may show Gatekeeper prompts | | GPG/Sigstore-signed checksums | Deferred to v0.x stable | | Full benchmark proof | Runner shipped in v0.21; code-oz Fake column measured, direct-agent and live columns pending credentials | | Broad multi-agent consultation | Deferred | | Cloud IAM adapters | Deferred |

Do not use the alpha as proof that one model writes better code than another. Use it to inspect whether the governed lifecycle works.

How is this different?

| Tool | Best for | What code-oz adds | | -------------------------- | ------------------------------------------ | --------------------------------------------------------------------------- | | Claude Code | Fast terminal coding with Claude | Repo-local gates, approvals, worktree isolation, ledger, independent review | | Codex CLI | Fast terminal coding with OpenAI models | Same governed lifecycle around Codex output | | Cursor | AI-native IDE workflow | External lifecycle governance outside the editor | | Aider | Terminal-native git-integrated pair coding | Multi-phase artifacts and cross-family review | | Gemini CLI / OpenCode / Roo Code | Direct AI coding loop | Future adapter candidates; not supported in v0.1 | | Qodo / Sonar | PR/code quality review | Earlier lifecycle gates before the PR review stage | | HivePipe / Devin / Factory | Managed agentic SDLC or agent workforce | Local-first, source-visible CLI runtime for owned repos |

Full footnote-sourced comparison: docs/comparisons/ai-coding-agents.md. Benchmark protocol and first measured rows (the deterministic code-oz Fake column): docs/benchmarks/agent-gate-bench.md. Direct-agent and live-provider columns require local credentials and are not yet measured.

code-oz is not a replacement for coding agents. It is a governed delivery loop around them.

Quick demo

git clone https://github.com/omerakben/code-oz.git
cd code-oz
bun install
bun run demo:todo-cli                # default (balanced)
bun run demo:todo-cli --effort lite  # multiplier 0.4

The demo runs one full lifecycle:

DEFINE → PLAN → BUILD → VERIFY → REVIEW → SHIP

Then inspect:

ls docs/demo/01-todo-cli/output/
cat docs/demo/01-todo-cli/output/balanced/events.jsonl | tail

The demo uses FakeProvider, so it is deterministic and token-free. The value is not that the fake model is smart. The value is that the same gates, approvals, worktree flow, and ledger mechanics are exercised every time.

Failure demo

Governance only matters if bad runs get blocked. The failure-gates demo runs five deterministic scenarios:

bun run demo:failure-gates

| Failure | Expected result | | ------------------------------------------------ | ------------------------------------------------ | | Tampered approved artifact | Gate refuses because SHA-256 no longer matches | | BUILD output changes files outside allowed scope | Mutation gate blocks the run | | VERIFY evidence fails | Run restarts or writes NEEDS_INTERVENTION.json | | REVIEW uses same provider family as BUILD | Cross-family review check refuses it | | Reviewer finds risky change | Run routes back to revision instead of SHIP |

Walkthrough: docs/demo/02-failure-gates/.

This is the demo to watch before trusting the tool.

Who is this for?

Use code-oz if:

  • you already use AI coding agents
  • you work in repos where mistakes matter
  • you want approval artifacts instead of chat transcripts
  • you want to compare builder and reviewer model families
  • you want a reproducible audit trail for AI-generated changes

Do not use code-oz yet if:

  • you want the fastest possible one-shot code generation
  • you need Windows support today
  • you need a polished enterprise SaaS dashboard
  • you need every provider family supported today
  • you are not willing to run an alpha

Provider setup

First-run CLI defaults to FakeProvider when no live provider is configured, so code-oz init && code-oz run can complete without spending provider tokens. Live Claude and Codex use their upstream CLI login sessions; xAI uses XAI_API_KEY. The separate code-oz-gui helper uses GEMINI_API_KEY for an in-app AI assistant; that key is not used by the CLI.

See docs/PROVIDER_SETUP.md for the single provider setup table and docs/contracts/PROVIDERS.md for the live / stub / future-candidate matrix.

Try it from source

git clone https://github.com/omerakben/code-oz.git
cd code-oz && bun install && bun test
bun run build:binary

mkdir /tmp/code-oz-smoke && cd /tmp/code-oz-smoke
~/Projects/code-oz/dist/code-oz init
~/Projects/code-oz/dist/code-oz doctor tools

Trust and security

  • Vulnerability reporting and artifact trust posture: SECURITY.md
  • Data boundaries, install trust, and what is and is not logged: docs/TRUST.md

Contributing

Roadmap

Public summary at docs/design/ROADMAP.md#now-next-later. The detailed milestone inventory follows in the same file.

  • Now (v0.20.1-alpha.0): first-run polish, truth-corrected provider claims, failure demo, security and community files, agent-gate benchmark protocol.
  • Next (v0.21.0-alpha.0): M17 AUDIT runtime for brownfield repos.
  • Later: signed checksums, broader provider adapters, Windows/Scoop, hosted launch artifacts.

Architecture and historical context

For the dense architecture (hybrid phase-graph, agentic sub-orchestration spine, 23 non-negotiable rules), product thesis, influence library, and the project's historical "AI software company" framing: docs/ABOUT.md.

Star this repo if...

Star code-oz if you think AI coding agents need more than clever prompts:

  • inspectable specs and plans
  • test evidence before review
  • independent model-family review
  • tamper-evident approvals
  • a ledger of what the agent did

Direct agents are fast. Governed agents are auditable.

License

MIT. See LICENSE.