agent-duet

v0.1.3

Published

2 months ago

Two AI coding agents (Claude + GitHub Copilot) in a peer-review loop on a shared project.

Downloads

0High
0Medium
0Low

nyzzi

ai claude copilot code-review agent cli llm

agent-duet

Two AI coding agents in a peer-review loop on a shared project.

agent-duet orchestrates a turn-by-turn exchange between any two of Claude (via the Claude Agent SDK), GitHub Copilot (via the copilot CLI, optionally as a Squad agent), and OpenAI Codex (via the codex CLI). One agent is the reviewer (read-only by default), the other is the implementer (read + write). The orchestrator pipes each turn's output into the next turn's prompt and writes a single Markdown transcript of the whole exchange.

Why

If you've ever fed Claude's review notes to Copilot by hand, then handed Copilot's reply back to Claude, then repeated that two more times — this automates that loop.

Prerequisites

Node.js ≥ 20
Anthropic credentials for the Claude Agent SDK — either ANTHROPIC_API_KEY in env, or an authenticated Claude Code install on the same machine (required if Claude is one of the two agents)
GitHub Copilot CLI installed and authenticated (copilot --version) — required if Copilot is one of the two agents
OpenAI Codex CLI installed and authenticated (codex --version) — required if Codex is one of the two agents
(Optional) Squad installed if you want --squad

Install

git clone <this repo>
cd agent-duet
npm install
npm run build

For local invocation without publishing:

node dist/index.js --help
# or
npm link    # then `agent-duet --help` from anywhere

Usage

Minimal — relies on defaults for everything except project and task:

agent-duet \
  --project ./my-app \
  --task "Review this codebase for quality and security issues."

Or directly from the build output:

node dist/index.js \
  --project ./my-app \
  --task "Review this codebase for quality and security issues."

Full form with every knob:

agent-duet \
  --project ./my-app \
  --task "Review this codebase for quality and security issues. Reviewer flags issues with file:line. Implementer fixes accepted issues, pushes back on disagreements." \
  --reviewer claude \
  --reviewer-mode read \
  --implementer copilot \
  --implementer-mode write \
  --squad \
  --max-rounds 6 \
  --stop-token APPROVED

Defaults: --reviewer claude --implementer copilot --reviewer-mode read --implementer-mode write --max-rounds 6 --stop-token APPROVED. Roles must be different agents.

Resuming a run

Pass --resume with the transcript file from a previous run; project, task, and other settings are recovered from its frontmatter:

node dist/index.js --resume transcripts\<run file name>.md --squad --yolo

What happens each round

Round 1 (reviewer turn): reviewer reads project, writes findings.
Round 2 (implementer turn): implementer receives the findings, makes edits, summarizes.
Round 3 (reviewer): reviewer validates the changes, flags anything still missing.
...continues until --max-rounds, the stop token appears on its own line at the end of an output, or a turn exits non-zero.

Read vs. write enforcement

Claude (read mode): the SDK is invoked with disallowedTools: ["Edit", "Write", "MultiEdit", "NotebookEdit"].
Claude (write mode): permissionMode: "acceptEdits", edit tools allowed.
Copilot (read mode): spawned as copilot --deny-tool write -p "...".
Copilot (write mode): spawned as copilot --allow-all-tools -p "...".
Codex (read mode): spawned as codex exec --ask-for-approval never --sandbox read-only - (prompt on stdin).
Codex (write mode): spawned as codex exec --ask-for-approval never --sandbox workspace-write -.

With --squad, the Copilot invocation becomes copilot --agent squad ....

Output

A single Markdown file at ./transcripts/run-<timestamp>.md (override with --transcript):

# agent-duet run

- Started: 2026-05-06T...
- Project: `./my-app`
- Reviewer: **claude** (read)
- Implementer: **copilot** (write)
- ...

## Task

...

## Round 1 — reviewer (claude, read)

<reviewer output>

## Round 2 — implementer (copilot, write)

<implementer output>

The same content is also streamed to stdout while the run is in progress.

Customizing role instructions

The defaults are tuned for a security/perf peer review, but you can override either side:

agent-duet \
  --project ./my-app \
  --task "Migrate Express middlewares to Fastify equivalents." \
  --reviewer-instructions "You are a Fastify expert. Identify each Express middleware and propose a Fastify replacement with the exact API mapping. Do not modify code." \
  --implementer-instructions "Apply the proposed migrations one at a time, running the test suite after each. Report what passed and what regressed."

Project layout

src/
  index.ts            CLI entry (commander)
  orchestrator.ts     the alternating-turn loop
  prompt.ts           per-turn prompt template
  transcript.ts       Markdown transcript writer
  adapters/
    types.ts          Adapter interface
    claude.ts         Claude Agent SDK adapter
    copilot.ts        copilot CLI subprocess adapter
    codex.ts          codex CLI subprocess adapter

Status

Fixed roles per run (reviewer / implementer). Supported agents: claude, copilot, codex. Roadmap: per-round role swapping, stricter tool filtering.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

agent-duet

Why

Prerequisites

Install

Usage

Resuming a run

What happens each round

Read vs. write enforcement

Output

Customizing role instructions

Project layout

Status