airunx

v0.1.4

Published

a month ago

Vendor-neutral Agent Orchestrator for developer workflows with LangGraph, AGENTS.md, and multi-agent pipelines

0High
0Medium
0Low

jasonspalace

agent orchestrator langgraph cli workflow automation dev-tools mcp

AIRunX

Vendor-neutral Agent Orchestrator for developer workflows with LangGraph, AGENTS.md, and multi-agent pipelines

Overview

AIRunX is a thin, powerful CLI tool that orchestrates multi-agent development workflows. It executes end-to-end development tasks from GitHub issues, PRDs, or developer prompts using intelligent agent pipelines.

Key Features

🤖 Autonomous Pipelines: 4 execution pipelines with clear deliverables → Pull request
🔧 Execution Pipelines: thin, standard, feature, mission-critical
💓 Heartbeat Mode: Continuous agent that polls GitHub Issues as a task queue
🌳 Git Worktrees: Spawn parallel PRs from a single machine
📊 Dashboard: Real-time status, runtime, tokens used, heartbeat
🔌 Vendor-Neutral: Works with Claude Code, Codex, Cursor
💰 Execution Fidelity: Balance quality vs. cost with 4 fidelity levels (fast, standard, thorough, ultra)

Installation

npm install -g airunx

Or clone and install locally:

git clone https://github.com/digitalpyro/airunx.git
cd airunx
npm install
npm run build
npm link

Quick Start

Prerequisites

Install at least one CLI provider:

Option A: Claude Code (Recommended)

npm install -g @anthropic-ai/claude-code

Authentication options:

Subscription: claude login (uses your Claude Pro/Team quota)
API Key: export ANTHROPIC_API_KEY=sk-ant-... (pay-per-token)

Option B: OpenAI Codex

npm install -g @openai/codex

Authentication options:

Subscription: codex login (uses your ChatGPT Plus/Team quota)
API Key: export OPENAI_API_KEY=sk-... (pay-per-token)

Option C: Cursor CLI (experimental — least tested backend)

curl https://cursor.com/install -fsS | bash

Authentication options:

Subscription: cursor-agent login (uses your Cursor plan quota)
API Key: export CURSOR_API_KEY=... (pay-per-token)

Billing Note: Subscription auth uses your existing plan quota. API keys use pay-per-token billing to your respective API account.

GitHub Access (required for heartbeat mode and issue-based workflows):

Option A: gh CLI (recommended for local development)

brew install gh      # macOS
# or: sudo apt install gh  # Debian/Ubuntu
gh auth login

Option B: Token only (recommended for servers/CI — no gh CLI needed)

export GITHUB_TOKEN=ghp_...
# or load via --dotenv flag:
airunx heartbeat start --dotenv /path/.env --repo owner/repo

When gh CLI is not installed, AIRunX automatically falls back to the GitHub REST API using GITHUB_TOKEN or GH_TOKEN. All heartbeat operations (polling, labels, comments, issue assignment) work in token-only mode.

Token Permissions:

| Token Type | Required Scopes | | ---------------------- | --------------------------------------------- | | Classic token | repo (full control of private repositories) | | Fine-grained token | See permissions table below |

Fine-grained personal access token permissions (set per-repository or organization):

| Permission | Access Level | Used For | | ----------------- | -------------- | -------------------------------------------------------- | | Issues | Read and write | Poll tasks, assign/unassign, update labels, close issues | | Pull requests | Read and write | Create PRs, add labels, update descriptions | | Contents | Read and write | Clone repos, push branches, read files | | Metadata | Read-only | Repository info (granted automatically) |

Note: GH_TOKEN takes precedence over GITHUB_TOKEN, matching gh CLI behavior. For heartbeat server deployments, a fine-grained token scoped to specific repositories is recommended over a classic token for security.

Setup

Initialize your project:

airunx init

# If using API key auth (not subscription), load your .env first:
airunx init --dotenv .env

This will:

Detect available backends
Let you choose a default backend
Create .airunx/settings.json with configuration pointers
Create .airunx/config.yml with your backend configuration
Optionally create local AGENTS.md and pipelines.yaml for customization

Check system health:

airunx doctor

Configure GitHub authentication:

Option 1: GitHub CLI (recommended for local development)

gh auth login

Option 2: Environment Variable (recommended for CI/CD and servers)

export GITHUB_TOKEN=ghp_your_token_here

Required scopes: repo, read:org Create a token at: https://github.com/settings/tokens/new?scopes=repo,read:org

When both are configured, GITHUB_TOKEN takes precedence.

Configure other credentials (if needed):

cp .env.example .env
# Edit .env with your API keys

Run a workflow:

# From GitHub issue
airunx run https://github.com/user/repo/issues/123

# From PRD file
airunx run --prd ./specs/feature.md

# From prompt
airunx run "Add user authentication with JWT"

# With fidelity control (quality vs. cost)
airunx run --fidelity fast "Quick README update"
airunx run --fidelity ultra "Critical security patch"

# With runtime configuration flags
airunx run --prd ./specs/feature.md                    # External PRD file
airunx run --prd https://example.com/prd.md            # Remote PRD URL
airunx run --project ~/projects/new-app "Add auth"     # Target different project
airunx run --context ./extra-context.md "Add feature"  # Additional context

# Combine flags
airunx run --prd ./prd.md --project ~/projects/app --context ./design-context.md

Check status:

airunx status

Server Deployment

For server/production deployments without interactive setup:

Create settings file:

mkdir -p ~/.airunx
cat > ~/.airunx/settings.json << 'EOF'
{
  "workspace_location": "/opt/airunx/repos",
  "env_file_location": "/etc/airunx/.env",
  "default_backend": "claude-code"
}
EOF

Create env file:

sudo mkdir -p /etc/airunx
sudo cat > /etc/airunx/.env << 'EOF'
GITHUB_TOKEN=ghp_your_production_token
ANTHROPIC_API_KEY=sk-ant-your_production_key
EOF
sudo chmod 600 /etc/airunx/.env

Run with GitHub issue URL:

airunx run https://github.com/owner/repo/issues/123
# Repo will be auto-cloned to /opt/airunx/repos/owner/repo

No airunx init required - just copy settings into place.

Run heartbeat mode:

airunx heartbeat start --repo owner/repo
# Repos auto-clone when processing issues

Clean up stale repos:

airunx cleanup workspace --older-than 30 --dry-run
airunx cleanup workspace --older-than 30

CLI Reference

Global Options

airunx --version    # Show version number
airunx --help       # Show help
airunx --debug      # Enable debug output (secrets are redacted)
airunx --quiet      # Suppress non-essential output (errors always shown)

--debug and --quiet are mutually exclusive and available on all subcommands.

Commands

`airunx run`

Execute an orchestrated workflow from a GitHub issue, PRD, or prompt.

airunx run [input] [options]

Arguments: | Argument | Description | |----------|-------------| | input | GitHub issue URL, file path, or inline prompt (optional if --prd provided) |

Options: | Option | Description | Default | |--------|-------------|---------| | -m, --mode <mode> | Orchestration mode: pipeline, raw | pipeline | | -w, --worktree <name> | Git worktree name | Auto-generated | | -b, --base-branch <branch> | Base branch for worktree | Auto-detected | | -f, --fidelity <level> | Execution fidelity: fast, standard, thorough, ultra | standard | | -p, --pipeline <name> | Pipeline to use from pipelines.yaml | standard | | --project <path> | Target project directory (or project name from folders) | Auto-detected | | --prd <path> | Path or URL to PRD file | - | | --context <path> | Path to additional context file | - | | --backend <backend> | Override configured backend: claude-code, cursor, codex | From config | | --format <format> | Output format: human, json | human | | --no-dashboard | Disable dashboard output | - | | --skip-version-check | Skip checking if AIRunX is up-to-date with remote | - | | --keep-worktree | Keep the worktree after execution (do not clean up) | - | | --no-sync | Skip syncing source branch with remote before worktree creation | - | | --from-branch <branch> | Use specified branch as the source for the new worktree, overriding the default base branch | - | | --workspace <path> | Directory for cloning repos (server deployments) | ~/.airunx/repos | | --dotenv <path> | Path to .env file for environment variables | Auto-detected | | --no-pr | Skip PR creation even on successful completion | false | | -v, --verbose | Show detailed execution progress (stages, timing, spawn counts) | false | | --ci-verification-gate | After PR creation, poll GitHub Actions and iterate on CI failures (up to 3 attempts). Overrides ci_verification_gate setting | false |

Examples:

# From GitHub issue
airunx run https://github.com/user/repo/issues/123

# From PRD file
airunx run --prd ./specs/feature.md

# With fidelity control
airunx run --fidelity ultra "Critical security patch"

# Skip version check for faster startup
airunx run --skip-version-check "Quick fix"

# Keep worktree after execution for investigation
airunx run --keep-worktree "Debug this issue"

# Skip syncing with remote (offline work or when local is current)
airunx run --no-sync "Quick local test"

# Work from a feature branch instead of main
airunx run --from-branch feature/api-v2 "Add new endpoint"

# Combine: use feature branch without syncing
airunx run --from-branch feature/wip --no-sync "Continue work"

# Full example with multiple options
airunx run --prd ./prd.md --project ~/projects/app --fidelity thorough --skip-version-check

`airunx init`

Initialize a project with AIRunX configuration.

airunx init [options]

Options: | Option | Description | |--------|-------------| | -f, --force | Overwrite existing configuration files | | -r, --reset | Clear cache and state files before initialization | | --workspace <path> | Set workspace location for cloning repos (server deployments) | | --from-workspace <path> | Import folders from VSCode .code-workspace file | | --dotenv <path> | Load environment file before backend detection (for API key auth) |

What it does:

Detects available backends (Claude Code, Cursor, Codex)
Creates .airunx/settings.json with configuration pointers
Creates .airunx/config.yml with backend configuration
Optionally creates local AGENTS.md and pipelines.yaml for customization

`airunx status`

Show status of running workflows.

airunx status [options]

Options: | Option | Description | |--------|-------------| | -a, --all | Show all workflows including completed |

`airunx doctor`

Check backend availability and system configuration.

airunx doctor

What it checks:

Node.js version (>= 18.0.0 required)
Environment variables
Backend availability (Claude Code, Cursor, Codex)
GitHub CLI authentication
Git repository and main branch
System configuration validity
Pipeline configuration
Fidelity cost estimates

`airunx circuit`

Manage circuit breaker state for backends. The circuit breaker prevents repeated calls to failing backends.

airunx circuit [command]

Subcommands: | Command | Description | |---------|-------------| | status | Show circuit breaker status for all backends (default) | | reset <backend> | Reset circuit breaker for a specific backend |

Examples:

# View circuit breaker status
airunx circuit
airunx circuit status

# Reset a tripped circuit breaker
airunx circuit reset claude-code

`airunx cleanup`

Clean up AIRunX resources including worktrees, debug files, and state.

airunx cleanup [command] [options]

Subcommands: | Command | Description | |---------|-------------| | run | Run cleanup of AIRunX resources (default) | | list | List debug files | | workspace | Clean up cloned repos in workspace directory |

Options for cleanup run: | Option | Description | |--------|-------------| | --worktrees | Clean up orphan worktrees | | --debug-files [days] | Clean up debug files older than N days (default: 7) | | --workflow-state | Clean up workflow state files | | --todos | Clean up todo files | | --all | Clean up all resource types | | -f, --force | Force cleanup even with uncommitted changes | | --workflow-id <id> | Clean up specific workflow by ID | | --project <path> | Target project directory for cleanup | | --dry-run | Preview what would be cleaned without making changes |

Examples:

# Clean up all resources
airunx cleanup --all

# Clean up only old debug files
airunx cleanup --debug-files 14

# Preview cleanup without making changes
airunx cleanup --all --dry-run

# Force cleanup of orphan worktrees
airunx cleanup --worktrees --force

# List debug files
airunx cleanup list

`airunx validate`

Validate configuration files against their schemas.

airunx validate [files...] [options]

Arguments: | Argument | Description | |----------|-------------| | files | Specific files to validate (optional, validates all if omitted) |

Options: | Option | Description | |--------|-------------| | -l, --lenient | Use lenient mode (warnings instead of errors) | | -j, --json | Output results in JSON format (for CI/CD) | | -a, --all | Validate all project configuration files | | -g, --global | Include global configuration files | | -t, --type <type> | Force file type: settings, pipelines, agents, system-config | | --schema | Show schema documentation for config files |

Examples:

# Validate all project config files
airunx validate --all

# Validate specific file
airunx validate .airunx/settings.json

# Validate with lenient mode (warnings instead of errors)
airunx validate --lenient .airunx/pipelines.yaml

# JSON output for CI/CD integration
airunx validate --all --json

# Include global config files
airunx validate --all --global

# Show schema documentation
airunx validate --schema
airunx validate --schema --type pipelines

What it validates:

settings.json - Configuration pointers and settings
pipelines.yaml - Pipeline definitions with agent roles
AGENTS.md - Agent role definitions (markdown format)
config.yml - System configuration with backend routing

Validation modes:

Strict (default): Errors cause validation to fail
Lenient (--lenient): Invalid enum values become warnings with suggestions

`airunx agents-validate`

Interactive AGENTS.md validation with model checking and typo suggestions.

airunx agents-validate [path] [options]

Arguments: | Argument | Description | |----------|-------------| | path | Path to AGENTS.md file (optional, auto-detected) |

Options: | Option | Description | |--------|-------------| | --fix | Automatically apply suggested fixes | | --non-interactive | Run without prompts (for CI/CD) | | --no-cost-warnings | Disable cost warnings for expensive models | | -l, --lenient | Use lenient mode (warnings instead of errors) | | -j, --json | Output results in JSON format |

Examples:

# Validate auto-detected AGENTS.md
airunx agents-validate

# Validate specific file
airunx agents-validate ./custom/AGENTS.md

# Auto-fix typos in model names
airunx agents-validate --fix

# CI/CD mode with JSON output
airunx agents-validate --non-interactive --json

What it validates:

Agent role names (with typo suggestions)
Provider field values (claude-code, cursor, codex)
Model field values (opus, sonnet, haiku, gpt-4o, o1, etc.)
Required fields (Purpose)

Model validation features:

Detects typos using Levenshtein distance (e.g., sonnett → sonnet)
Interactive prompts to fix invalid values
Cost warnings for expensive models (opus, o1)
Auto-fix mode for CI/CD pipelines

`airunx heartbeat`

Manage the heartbeat execution mode. Heartbeat mode continuously polls GitHub Issues with the airunx:pending label and dispatches each task to airunx run as an isolated child process.

airunx heartbeat <command> [options]

Subcommands: | Command | Description | |---------|-------------| | start | Start the heartbeat process | | stop | Stop the running heartbeat process | | status | Show heartbeat status | | recover | Recover orphaned tasks (tasks stuck in airunx:running) |

Options for heartbeat start: | Option | Description | Default | |--------|-------------|---------| | -r, --repo <repo> | Target repository (owner/repo format) | Auto-detected | | --backend <backend> | Backend to use: claude-code, cursor, codex | From config | | --interval <ms> | Polling interval in milliseconds | 10000 | | --pipeline <name> | Default pipeline for tasks | standard | | -f, --fidelity <level> | Execution fidelity: fast, standard, thorough, ultra | From config | | --dotenv <path> | Path to .env file (loaded before all operations) | Auto-detected | | --context <path> | Path to context file or directory (forwarded to each run) | From config | | --idle-cycles <n> | Idle cycles before entering pause mode | 6 | | --idle-pause <ms> | Pause duration when idle (ms) | 60000 | | --ci-verification-gate | After PR creation, poll GitHub Actions and iterate on CI failures up to 3 attempts (forwarded to each run) | false |

Options for heartbeat status: | Option | Description | Default | |--------|-------------|---------| | --audit | Show recent audit log entries | - | | --limit <n> | Number of audit entries to show | 10 |

Options for heartbeat recover: | Option | Description | Default | |--------|-------------|---------| | -r, --repo <repo> | Target repository (owner/repo format) | Auto-detected | | --force-unlock | Force remove stale lockfile | - |

Examples:

# Start heartbeat in current repo
airunx heartbeat start

# Start with specific backend and polling interval (60 seconds)
airunx heartbeat start --backend claude-code --interval 60000

# Target a specific repository
airunx heartbeat start --repo user/my-project

# Server deployment with env file, fidelity, and context
airunx heartbeat start --repo owner/repo --dotenv /var/app/.env --fidelity standard --context ./context.md

# Enable CI verification gate (polls GH Actions and iterates on PHPCS/PHPUnit failures)
airunx heartbeat start --repo owner/repo --dotenv /var/app/.env --fidelity thorough --ci-verification-gate

# Check if heartbeat is running
airunx heartbeat status

# Stop the heartbeat
airunx heartbeat stop

# Recover tasks stuck in running state
airunx heartbeat recover

How it works:

Polls GitHub Issues with airunx:pending label
Claims task using optimistic concurrency (assign-then-verify)
Transitions label to airunx:running
Posts heartbeat timestamps to issue comments
Spawns airunx run <issue-url> --format json as an isolated child process
Enforces child process timeout (default 300 min, configurable via AIRUNX_CHILD_TIMEOUT_MS) with SIGTERM → SIGKILL escalation
On completion with PR: labels airunx:completed, posts summary with cost/token/runtime metrics (issue closes when PR is merged via Closes keyword)
On completion without PR: labels airunx:failed, keeps issue open (prevents false completions)
On failure: labels airunx:failed, posts error details with cost/token/runtime metrics

Task Queue Labels: | Label | State | |-------|-------| | airunx:pending | Ready for pickup | | airunx:running | Currently executing | | airunx:completed | Successfully finished | | airunx:failed | Execution failed |

Priority Labels (optional): | Label | Priority | |-------|----------| | airunx:priority:p1 | Critical — processed first | | airunx:priority:p2 | High | | airunx:priority:p3 | Normal (default when no label) |

Creating Tasks: Create a GitHub Issue with the airunx:pending label. Optionally specify a pipeline:

<!-- airunx:pipeline:feature -->

## Task Description

Add user authentication with JWT support...

Opt-in Documentation Generation: To have AIRunX generate documentation (README updates, changelogs, API docs) for an issue, add a checkbox to the issue body:

- [ ] Generate documentation

If this checkbox is absent, the docs-generator stage is skipped. This keeps bug fixes and small tasks fast. The checkbox is case-insensitive and works in any pipeline that includes the document stage (standard, feature, mission-critical).

Automatic Checkbox Marking

When a task completes, AIRunX automatically checks off - [ ] items in the source GitHub issue that match completed work. The judge agent evaluates which checkboxes were satisfied during implementation.

This is enabled by default. Disable globally via the mark_issue_checkboxes setting.

Pipelines

AIRunX uses an autonomous pipeline architecture where each pipeline runs end-to-end and produces a concrete deliverable.

Pipeline Types

| Pipeline | Deliverable | Fidelity | Max Iterations | Timeout | Use Case | | -------------------- | ------------ | -------- | -------------- | ------- | -------------------------- | | thin | Pull Request | fast | 2 | 30 min | Quick fixes, prototypes | | standard | Pull Request | standard | 2 | 60 min | Regular development | | feature | Pull Request | thorough | 5 | 120 min | Important features | | mission-critical | Pull Request | ultra | 15 | 480 min | Security, critical systems |

Pipelines vs. Fidelity: Pipelines define which stages run and how many iterations are allowed. Fidelity controls how deeply each stage works (model selection, verification depth, review rigor). A pipeline's max_iterations caps the fidelity's review iteration count. For example, the standard pipeline (max 2 iterations) with thorough fidelity (8 review iterations) will run at most 2 iterations.

Execution Pipelines

Execution pipelines transform issues into pull requests:

GitHub Issue/PRD/Prompt
    ↓
Orchestrator → Plans implementation
    ↓
Dev (Strategic) → Designs architecture
    ↓
Test Creator → Designs test strategy
    ↓
Reviewer → Quality assurance
    ↓
Dev (Implementation) → Writes code
    ↓
Static Analyzer → Lint, type check
    ↓
Judge → ITERATE or PROCEED decision
    ↓
Docs → Documentation (opt-in via checkbox, standard+ only)
    ↓
Pull Request (commit & PR created post-graph)

Choosing a Pipeline:

thin: For small changes where speed matters. Limited iterations prevent over-engineering.
standard: Default choice for most development work. Balanced quality and speed.
feature: For production-ready features. Enhanced review cycles ensure quality.
mission-critical: For security patches and critical infrastructure. Maximum quality gates.

Usage:

# Quick fix with thin pipeline
airunx run "Fix typo in README" --pipeline thin

# Important feature with feature pipeline
airunx run https://github.com/user/repo/issues/456 --pipeline feature

# Critical security fix with mission-critical pipeline
airunx run "Fix SQL injection vulnerability" --pipeline mission-critical

Default Pipeline Stages

All pipelines build on the thin base pipeline. The developer agent runs in two stages — first as strategist, then as implementer — with review in between.

Thin Pipeline (fast fidelity, max 2 iterations):

orchestrate → strategize → test_design → review → implement → analyze → judge
     │            │             │           │          │          │        │
orchestrator  developer    test-creator  code-     developer  static-  code-judge
                                        reviewer             analyzer

Standard Pipeline (standard fidelity, max 2 iterations):

Same as thin + docs-generator (document) stage after judge

Feature Pipeline (thorough fidelity, max 5 iterations):

Same as standard + code-reviewer (enhanced_review) after analyze
                  + static-analyzer (static_analysis) after enhanced_review

Mission-Critical Pipeline (ultra fidelity, max 15 iterations):

Same as feature + code-reviewer (security_review) after enhanced_review

Iteration behavior: On ITERATE, the judge clears stages from implement onward. The strategize, test_design, and review outputs are retained — the implement stage sees reviewer feedback on each iteration without re-running earlier stages.

Stage Skip Conditions

Use skip_condition in pipelines.yaml to control when stages execute based on iteration context:

| Pattern | Description | | --------------------- | -------------------------------------------------------------- | | iteration.interim | Skip on iterations 2 through N-1 (runs on first and last only) | | iteration.first | Skip on first iteration | | !iteration.first | Skip unless it is the first iteration | | iteration.last | Skip on final iteration | | !iteration.last | Skip unless final iteration | | !iteration.interim | Skip on first and last iterations (runs on interim only) | | stage_name.success | Skip if named stage succeeded | | !stage_name.success | Skip if named stage failed |

Example: Skip static analysis on interim iterations to save tokens:

# pipelines.yaml
stages:
  - name: static-analysis
    agent: static-analyzer
    skip_condition: 'iteration.interim'

This is useful for expensive operations that provide diminishing returns when run repeatedly. The stage runs on:

Iteration 1 (first): Catch initial issues
Iteration N (last): Final verification before PR

Iterations 2 through N-1 skip the stage to reduce token usage.

Heartbeat Execution Mode

Heartbeat mode enables continuous, autonomous operation where AIRunX monitors GitHub Issues as a task queue. This is ideal for CI/CD integration, team workflows, and always-on development environments.

Architecture

GitHub Issues (airunx:pending)
         ↓
    Heartbeat Process
         ↓
┌────────────────────────┐
│  Poll → Checkout →     │
│  Execute → Complete    │
│         ↻              │
└────────────────────────┘
         ↓
    GitHub Issues (airunx:completed)

Task Lifecycle

| Stage | Label | Description | | ----------- | ------------------ | -------------------------- | | Queued | airunx:pending | Task ready for pickup | | Claimed | airunx:running | Agent assigned, executing | | Done | airunx:completed | Task finished successfully | | Error | airunx:failed | Task failed with error |

Optimistic Concurrency

Multiple heartbeat instances can run safely across machines. Task checkout uses an assign-then-verify pattern:

Optimistically assign issue to self
Verify assignment succeeded (another agent may have won)
Update labels atomically
Rollback on any failure

This prevents race conditions without requiring external locks.

Process Lock

Only one heartbeat instance runs per machine. A lockfile (.airunx-state/heartbeat.lock) contains the PID and is used to:

Detect stale locks from crashed processes
Prevent duplicate instances
Enable graceful shutdown via airunx heartbeat stop

Audit Logging

All heartbeat operations are logged to .airunx-state/audit/audit-YYYY-MM-DD.jsonl (daily rotation) for debugging and compliance:

{"timestamp":"2026-03-26T10:00:00Z","event":"heartbeat_start","details":{"config":{...}}}
{"timestamp":"2026-03-26T10:00:30Z","event":"task_claimed","workflowId":"heartbeat-...","details":{"taskId":"123","title":"..."}}
{"timestamp":"2026-03-26T10:05:00Z","event":"task_completed","workflowId":"heartbeat-...","details":{"taskId":"123","prUrl":"..."}}
{"timestamp":"2026-03-26T10:05:00Z","event":"task_completed_no_pr","workflowId":"heartbeat-...","details":{"taskId":"456"}}

Agent-Native Tools

Heartbeat mode provides tools that agents can use to control their own workflow:

| Tool | Purpose | | ------------------- | ----------------------------------------------------------------------- | | complete_task | Signal task completion with summary and deliverables | | list_pipelines | Discover available pipelines and their configurations | | query_audit | Query past execution events for learning from history | | iteration_history | Get history of previous iterations to avoid repeating failed approaches |

These tools shift control from rigid orchestration to agent-driven decisions, enabling more intelligent task handling.

iteration_history

Query previous iteration attempts to avoid repeating failed approaches.

Parameters: None

Returns:

List of failed approaches with their gaps
Recurring gaps (issues that appeared 2+ times)

Usage: Agents should call this tool at the start of an iteration (after iteration 1) to understand what has been tried and what gaps persist. Recurring gaps should be addressed first as they indicate systemic issues.

Circuit Breaker

The heartbeat includes a circuit breaker to handle backend failures gracefully:

Closed: Normal operation
Open: Backend failing, skip requests
Half-Open: Testing if backend recovered

Backoff uses exponential delay with jitter to prevent thundering herd.

Compound Engineering

AIRunX uses the Compound Engineering plugin by Every directly. The CE plugin provides 17+ specialized review agents, workflow skills, and multi-agent delegation for Claude Code sessions. AIRunX also reimplements CE's sub-agent delegation patterns in its own adapter layer (CompoundEngineeringAdapter) so they work within automated pipelines.

The core philosophy: each unit of engineering work should make subsequent units easier — not harder. See the CE plugin repo for full documentation on the methodology.

How AIRunX Applies CE Patterns

Research-Driven Planning — Before implementation, the system conducts multi-layered research:

Codebase analysis: Identifies existing patterns and conventions
Documentation review: Analyzes framework guidance and best practices
Pattern extraction: Generates context matching your repository's style

In AIRunX, scoping and planning are handled externally via Claude Code's planning workflows or PRDs, which are then fed into execution pipelines as GitHub issues.

Todo System — AIRunX implements a filesystem-based task tracker in .airunx-state/todos/:

pending → in_progress → completed
                     ↘ blocked

Review findings become trackable todos, each iteration captures learnings, and completed work informs future planning. Features include hierarchical task decomposition, thread-safe file locking, rollup generation for PR descriptions, and metadata tracking.

Iteration History — The orchestrator records each iteration attempt so agents avoid repeating failed approaches:

Agent executes implementation
Judge evaluates and returns {decision, reason, gaps}
IterationHistory.record() captures the attempt
On subsequent iterations, agents query history via the iteration_history tool
Recurring gaps are surfaced for priority resolution

Sub-Agent Delegation — When using the Claude Code backend, agents can delegate research to specialized sub-agents (architecture-strategist, pattern-recognition-specialist, framework-docs-researcher). This is configurable per-agent via Provider-Config in AGENTS.md. Disable globally with disable_compound_engineering: true in settings.json.

Git Worktrees — Develop features in isolated branches and run parallel PR workflows from a single machine:

airunx run "Add feature" --worktree feature-auth

Architecture

Agent Pipeline

Orchestrator ← Plans implementation
    ↓
Dev (Strategic) ← Problem solving, architecture
    ↓
Test Creator ← Test strategy design
    ↓
Reviewer ← Quality assurance
    ↓
Dev (Implementation) ← Code changes
    ↓
Static Analyzer ← Linting, type checking
    ↓
Judge ← Quality evaluation, iterate decision
    ↓
Docs ← Documentation updates (standard+ only)
    ↓
PR Creation

Technology Stack

LangGraph: Orchestration backbone (DAG/state management)
AGENTS.md: Convention-based agent configuration
TypeScript: Type-safe implementation
Commander: CLI framework
Vitest: Testing framework
MCP: Model Context Protocol integrations

Execution Fidelity

AIRunX supports 4 fidelity levels to balance quality with cost:

| Level | Cost vs. Standard | Use Case | Review Iterations | Verification | | ------------ | ----------------- | ----------------------------------- | ----------------- | ------------------------------- | | fast | -70% | Quick changes, docs, prototyping | 1 | Disabled | | standard | Baseline | Regular development, bug fixes | 5 | Enabled | | thorough | +50% | Important features, production code | 8 | Enabled | | ultra | +200% | Critical infrastructure, security | 15 | Enabled + Multi-model consensus |

Note: Review Iterations is the maximum number of review passes the fidelity level allows. The actual iteration count is capped by the pipeline's max_iterations setting. For example, the standard pipeline (max 2 iterations) with thorough fidelity (8 review iterations) will run at most 2 iterations.

Usage

# Use fast fidelity for quick iterations
airunx run --fidelity fast "Fix typo in README"

# Use ultra for critical changes
airunx run --fidelity ultra "Update payment processing"

# View cost estimates
airunx doctor

Fidelity can be configured globally via default_fidelity in settings.json or per-run with --fidelity.

Approval Modes

AIRunX supports 3 approval modes to control CLI adapter permission handling:

| Mode | Description | Adapter Flags | | ---------- | ----------------------------------- | -------------------------------------------------------------------------------------------------------------------- | | manual | Interactive prompts for all actions | No auto-approval flags | | auto | Auto-approve all actions (default) | Codex: --full-auto, Claude Code: --dangerously-skip-permissions, Cursor: No flags | | yolo | Skip all approvals and sandboxing | Codex: --dangerously-bypass-approvals-and-sandbox, Claude Code: --dangerously-skip-permissions, Cursor: No flags |

Configuration

Set in settings.json:

{
  "approval_mode": "auto"
}

Security Controls

AIRunX uses different enforcement mechanisms depending on the backend. No single mechanism covers all backends — each relies on the controls its CLI provides.

Claude Code:

--disallowedTools glob patterns block git writes and gh mutations at the tool level before execution
Agents get AGENT_DISALLOWED_TOOLS (git writes + gh writes blocked); orchestrators get ORCHESTRATOR_DISALLOWED_TOOLS (git writes only, gh access retained for PR/issue management)
In pipeline mode, all roles get full restrictions — the pipeline executor handles PR creation after stages complete
Pattern syntax: Bash(git*push*) matches any bash command starting with git containing push
Known limitation: glob patterns can be evaded via shell wrappers (e.g., bash -c "git push"). For defense in depth, install a Claude Code pre-tool-call hook that performs semantic command parsing

Codex:

--sandbox workspace-write restricts filesystem access in pipeline mode
No --disallowedTools equivalent exists — Codex has no tool-level command blocking
A pre-push git hook is installed in worktrees to block agent-initiated pushes (bypassed by the pipeline executor via AIRUNX_ALLOW_PUSH=1)

Cursor:

Prompt-level restrictions only — agents are instructed not to run git/gh write commands (not enforced at tool level, though environment-level git hooks still apply)
No sandbox, no native hook system, no --disallowedTools equivalent

All backends:

Pre-push git hooks in worktrees block direct git push by agents
DANGEROUS_FILE_PATTERNS filters sensitive files at staging time (complementary to runtime command blocking)
categorizeError() classifies blocked commands as NonRetryableError so the orchestrator does not retry them

Security Recommendations

Local Development: Use auto mode (default) — provides automation with workspace sandboxing
CI/CD Pipelines: Use auto mode with containerization
Never use yolo: Unless in completely isolated, disposable environments
Claude Code users: Consider adding a pre-tool-call hook for evasion-resistant command blocking
Codex users: Always run in pipeline mode to get sandbox restrictions

Responsible Disclosure

If you discover a security vulnerability, please report it privately via GitHub Security Advisories rather than opening a public issue. We aim to acknowledge reports within 48 hours and provide a fix or mitigation within 7 days.

PR Customization

Customize how PR titles and labels are generated through settings.json.

PR Title Rules

Configure keyword-to-commit-type mappings for conventional commit title generation:

{
  "gh_pr_title_rules": {
    "fix": ["fix", "bug", "broken", "issue", "error"],
    "feat": ["add", "implement", "create", "feature", "new"],
    "docs": ["doc", "readme", "documentation"],
    "refactor": ["refactor", "restructure", "reorganize"],
    "test": ["test", "spec", "coverage"],
    "chore": ["chore", "config", "deps", "ci", "build"],
    "perf": ["perf", "performance", "optimize", "speed"],
    "style": ["style", "format", "lint"],
    "default": "feat"
  }
}

| Key | Description | | ---------------------------------- | ------------------------------------------------------ | | Commit types (e.g., fix, feat) | Array of keywords that trigger that commit type | | default | Fallback type when no keywords match (default: feat) |

If gh_pr_title_rules is null, uses built-in defaults.

PR Label Rules

Configure which labels are automatically applied to PRs:

{
  "gh_pr_label_rules": {
    "always": ["ai-generated"],
    "include_backend": true,
    "include_fidelity": true,
    "custom": ["needs-review"]
  }
}

| Field | Type | Default | Description | | ------------------ | ---------- | ------------------ | ---------------------------- | | always | string[] | ["ai-generated"] | Labels always applied to PRs | | include_backend | boolean | true | Add backend-{name} label | | include_fidelity | boolean | true | Add fidelity-{level} label | | custom | string[] | [] | Additional labels to apply |

If gh_pr_label_rules is null, uses built-in defaults.

PR Body Template

Customize the PR body using a Handlebars template file. AIRunX looks for templates in this priority order:

Project: .airunx/github_pull_request_template.md
Global: ~/.airunx/github_pull_request_template.md
Default: Built-in template

Available Template Variables

| Variable | Type | Description | | ------------------- | --------- | ----------------------------------------------------- | | taskDescription | string | Full task description from issue/PRD/prompt | | taskTitle | string | Short task title | | fidelityLevel | string | Fidelity level used (fast, standard, thorough, ultra) | | backend | string | Primary backend used (claude-code, cursor, codex) | | iterationCount | number | Number of iterations completed | | maxIterations | number | Maximum iterations allowed | | runtime | string | Total execution time (formatted) | | totalTokens | string | Total tokens used (formatted with commas) | | totalCost | string | Estimated cost (numeric string, e.g., "1.23") | | todos | string | Completed todos as markdown checkboxes | | testCommands | string | Auto-detected test commands for the project | | testPlan | string? | Custom test plan if provided by agent | | stagePerformance | array? | Per-stage metrics (see below) | | hasProviderInfo | boolean | Whether stages have provider info | | providerBreakdown | array? | Per-provider aggregated costs (multi-provider only) | | iterationHistory | string? | Iteration summary |

Stage Performance Data

When stagePerformance is present, each entry contains:

| Field | Type | Description | | ---------- | --------- | -------------------------------------------------- | | name | string | Stage name | | provider | string? | Backend that executed the stage | | tokens | string | Tokens used (formatted) | | cost | string | Cost for this stage (numeric string, e.g., "2.50") | | runtime | string | Stage duration (e.g., "2m 30s") | | status | string | Success indicator (✅ or ❌) |

Provider Breakdown Data

When multiple providers are used, providerBreakdown contains aggregated stats:

| Field | Type | Description | | ---------- | -------- | ----------------------------------------- | | provider | string | Provider name | | stages | number | Number of stages using this provider | | tokens | string | Total tokens (formatted) | | cost | string | Total cost (numeric string, e.g., "5.00") |

Example Template

## Summary

{{taskDescription}}

--- ## Execution Metrics | Metric | Value | |--------|-------| | **Backend** |
{{backend}}
| | **Fidelity** |
{{fidelityLevel}}
| | **Iterations** |
{{iterationCount}}/{{maxIterations}}
| | **Runtime** |
{{runtime}}
| | **Tokens** |
{{totalTokens}}
| | **Estimated Cost** | \${{totalCost}}* |

<sub>*Cost uses actual token breakdown when available, otherwise estimates 50/50
  input/output split</sub>

{{#if stagePerformance}}
  ### Stage Performance

  {{#if hasProviderInfo}}
    | Stage | Provider | Tokens | Cost | Runtime | Status |
    |-------|----------|--------|------|---------|--------|
    {{#each stagePerformance}}
      |
      {{this.name}}
      |
      {{this.provider}}
      |
      {{this.tokens}}
      | \${{this.cost}}
      |
      {{this.runtime}}
      |
      {{this.status}}
      |
    {{/each}}
  {{else}}
    | Stage | Tokens | Cost | Runtime | Status |
    |-------|--------|------|---------|--------|
    {{#each stagePerformance}}
      |
      {{this.name}}
      |
      {{this.tokens}}
      | \${{this.cost}}
      |
      {{this.runtime}}
      |
      {{this.status}}
      |
    {{/each}}
  {{/if}}
{{/if}}

{{#if providerBreakdown}}
  ### Provider Breakdown | Provider | Stages | Tokens | Cost |
  |----------|--------|--------|------|
  {{#each providerBreakdown}}
    |
    {{this.provider}}
    |
    {{this.stages}}
    |
    {{this.tokens}}
    | \${{this.cost}}
    |
  {{/each}}
{{/if}}

--- ## Todos Completed

{{todos}}

--- [![AIRunX Powered](https://img.shields.io/badge/AIRunX-Powered-black?style=flat-square&labelColor=1E3A8A)](https://github.com/digitalpyro/airunx) _Generated by AIRunX_

The signature badge is controlled by the airunx_signature setting (default: true).

Template Tips

Use {{#if variable}}...{{/if}} for conditional sections
Use {{#each array}}...{{/each}} to iterate over arrays
Use {{{variable}}} (triple braces) for raw HTML/markdown output
Use \$ to escape dollar signs in markdown tables

Multi-Provider Architecture

AIRunX supports multiple LLM providers with intelligent per-role routing. This prevents "model self-deception" where a single provider reviews its own outputs.

Supported Providers

| Provider | Strengths | | ------------------------------- | ----------------------------------------- | | Claude Code (claude-code) | Reasoning, planning, tool use | | Cursor CLI (cursor) | Code understanding, IDE integration | | OpenAI Codex (codex) | Fast code generation, independent reviews |

Per-Role Provider Configuration

Configure providers in AGENTS.md for each role:

### orchestrator

- Purpose: Coordinate the pipeline, make decisions on workflow mode
- Provider: claude-code
- Fallback-Provider: codex

### code-reviewer

- Purpose: Code review and quality assurance
- Provider: codex
- Provider-Rationale: Independent perspective prevents self-bias

Or configure globally in config.yml:

agent_routing:
  orchestrator: claude-code
  developer: cursor
  code-reviewer: codex

backends:
  claude-code:
    type: cli
    executable: claude
    model: sonnet # Default model for this backend
    timeout: 300
  cursor:
    type: cli
    executable: cursor-agent
    model: gpt-4o # Default model for this backend
    timeout: 300
  codex:
    type: cli
    executable: codex
    model: gpt-4o # Default model for this backend
    timeout: 300

Provider Routing Priority

1. AGENTS.md `Provider` and `Fallback-Provider` fields (highest)
2. config.yml `agent_routing`
3. config.yml `fallback_backend` (lowest)

Model Resolution Priority

1. config.yml backend `model` field (highest - enforces org-wide policies)
2. AGENTS.md `Model` field (per-agent preferences)
3. Adapter default model (lowest)

Runtime Configuration Flags

AIRunX supports runtime configuration flags to work on external projects and use PRDs from various sources.

--project

Override the default project location to work on external projects:

# Work on a different project
airunx run --project ~/projects/my-app "Add user authentication"

# Combined with other flags
airunx run --project /path/to/project --prd ./specs/auth-prd.md

Priority: CLI --project > default_project_location in settings.json > current directory

--prd

Specify an external PRD (Product Requirements Document) file or URL:

# Local file
airunx run --prd ./specs/feature-prd.md

# Remote URL
airunx run --prd https://example.com/prd.md

# GitHub blob URL (automatically converted to raw URL)
airunx run --prd https://github.com/org/repo/blob/main/docs/prd.md

Supported sources:

Local file paths (absolute or relative)
HTTP/HTTPS URLs
GitHub blob URLs (converted to raw.githubusercontent.com)

--context

Inject additional context for agents without modifying project configuration:

# Add runtime context
airunx run --context ./design/theme-context.md "Add dark mode"

# The context is merged with project context from settings.context_location

The runtime context is appended to project context with a separator.

Dynamic Context Resolution (@mentions)

AIRunX supports dynamic context resolution using @ mentions in prompts, similar to Cursor's @ mention system. This allows agents to automatically load referenced files during execution.

Supported patterns:

| Pattern | Description | Example | | -------------- | -------------------------------------- | --------------------------- | | @file:path | Load a specific file | @file:src/utils/config.ts | | @folder:path | Load all files in a directory | @folder:src/utils | | @path | Shorthand (infers type from extension) | @src/utils/config.ts |

Examples:

# Reference specific files in your prompt
airunx run "Update the validation in @src/utils/validator.ts to match @src/types/user.ts"

# Reference entire folders for broader context
airunx run "Refactor all utilities in @folder:src/utils to use the new logging pattern"

# Mixed references
airunx run "Add error handling to @src/api/client.ts based on patterns in @folder:src/errors"

How it works:

When a prompt is processed, the context resolver parses for @ references
Referenced files/folders are loaded and cached (60-second TTL)
Content is injected into the agent's context as additionalContext.dynamicContext
Failed references are logged but don't block execution

Folder loading:

Loads .md, .txt, .ts, .js, .tsx, .jsx, .py, .rb, .go, .rs files
Traverses up to 3 levels deep
Skips node_modules, .git, .worktrees, dist, build, __pycache__
Files over 1MB are skipped

Smart filtering:

URLs are ignored (@https://...)
Short usernames are ignored (@user, @team)
Duplicate references are deduplicated

Configuration

AIRunX uses a hierarchical configuration system that allows defaults to be overridden at global or project level.

Configuration Hierarchy

Priority (highest to lowest):
─────────────────────────────────────────────────────────────
1. CLI Arguments
   └─ airunx run --fidelity ultra --backend cursor

2. Project Config (.airunx/)
   └─ Checked in to repo, shared with team

3. Global User Config (~/.airunx/)
   └─ Personal preferences across all projects

4. Package Defaults (config/default/)
   └─ Sensible starting point

Directory Structure

your-project/
├── .airunx/                    # Project configuration (commit this)
│   ├── settings.json           # Configuration pointers
│   ├── config.yml              # Backend routing & fidelity
│   ├── AGENTS.md               # Custom agent definitions (optional)
│   └── pipelines.yaml          # Custom pipelines (optional)
├── .airunx-state/              # Runtime state (git-ignored)
│   ├── workflows/              # Workflow execution state
│   └── todos/                  # Task tracking
├── .env                        # Credentials (git-ignored)
└── .worktrees/                 # Git worktrees (git-ignored)

settings.json

The main configuration file with pointers to other resources:

{
  "$schema": "./settings.schema.json",
  "context_location": null,
  "mcp_json_location": null,
  "agents_md_location": "./AGENTS.md",
  "pipelines_yaml_location": "./pipelines.yaml",
  "default_project_location": null,
  "folders": [],
  "default_fidelity": "standard",
  "default_backend": "claude-code",
  "skip_version_check": false,
  "skip_sync": false,
  "approval_mode": "auto",
  "disable_compound_engineering": false,
  "verbose_progress": true,
  "airunx_signature": true,
  "workspace_location": null,
  "env_file_location": null,
  "ci_verification_gate": false,
  "mark_issue_checkboxes": true,
  "gh_pr_title_rules": null,
  "gh_pr_label_rules": null,
  "tool_configs": null
}

Core Settings

| Field | ------------------------------ | context_location | mcp_json_location | agents_md_location | pipelines_yaml_location | default_fidelity | default_backend | skip_version_check | default_base_branch | skip_sync | disable_compound_engineering | approval_mode | verbose_progress | airunx_signature | workspace_location | env_file_location | ci_verification_gate | mark_issue_checkboxes | gh_pr_title_rules | gh_pr_label_rules | Type | Default | Description | | ---------------- | --------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | string \| null | null | Path to context file (e.g., .ai/context.md). Relative paths resolved from settings file location. | | string \| null | null | Path to MCP servers configuration file (mcp.json). | | string \| null | null | Path to AGENTS.md file. If null, uses the default AGENTS.md. | | string \| null | null | Path to pipelines.yaml file. If null, uses the default pipelines.yaml. | | string | "standard" | Default execution fidelity: fast, standard, thorough, ultra | | string | "claude-code" | Default backend: claude-code, cursor, codex | | boolean | false | Skip checking if AIRunX is up-to-date on startup. | | string \| null | null | Override base branch for git worktrees (e.g., main, master, develop). Auto-detected if omitted. | | boolean | false | Skip syncing source branch with remote before worktree creation. Equivalent to always using --no-sync. | | boolean | false | Disable Compound Engineering sub-agent delegation (use raw Claude Code instead). | | string | "auto" | Approval mode for CLI adapters: manual, auto, yolo. Controls permission prompts. Note: yolo requires AIRUNX_ALLOW_YOLO=1 env var; without it, falls back to auto with a warning. | | boolean | true | Enable verbose progress reporting during workflow execution. | | boolean | true | Include AIRunX signature badge in PR descriptions. | | string \| null | null | Directory for cloning repos when not found locally (useful for server deployments). | | string \| null | null | Explicit path to .env file for environment loading. Automatically set when using init --dotenv. | | boolean | false | After PR creation, poll GitHub Actions and iterate on CI failures (up to 3 attempts). Fetches failed logs via gh run view --log-failed, injects into workflow input, re-runs from implement stage, commits the fix (updates existing PR), and polls again. Requires GH Actions workflows in the repo; no-ops otherwise. Can be overridden per-run with --ci-verification-gate. | | boolean | true | Mark completed - [ ] items on source GitHub issues after pipeline completion. The judge agent identifies which checkboxes were satisfied. | | object \| null | null | Custom PR title type inference rules. See PR Title & Label Customization. | | object \| null | null | Custom PR label generation rules. See PR Title & Label Customization. |

Project Configuration

| Field | Type | Default | Description | | -------------------------- | ---------------- | ------- | ------------------------------------------------------------------------------------------------------------------------ | | default_project_location | string \| null | null | Default project directory when no project is detected. | | folders | array | [] | Array of project folders for multi-project workspace support. See Multi-Project Workspaces. |

Tool Configuration Overrides

The tool_configs object allows overriding default tool configuration paths:

{
  "tool_configs": {
    "linter_config": "./custom-eslint.config.js",
    "type_checker_config": "./tsconfig.custom.json",
    "test_runner_config": "./vitest.custom.config.ts",
    "coverage_config": "./coverage.config.js",
    "formatter_config": "./.prettierrc.custom",
    "security_scanner_config": "./security.config.json",
    "docs_location": "./docs"
  }
}

| Field | Description | | ------------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------- | | linter_config | Path to

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

AIRunX

Overview

Key Features

Table of Contents

Installation

Quick Start

Prerequisites

Setup

Server Deployment

CLI Reference

Global Options

Commands

airunx run

airunx init

airunx status

airunx doctor

airunx circuit

airunx cleanup

airunx validate

airunx agents-validate

airunx heartbeat

Automatic Checkbox Marking

Pipelines

Pipeline Types

Execution Pipelines

Default Pipeline Stages

Stage Skip Conditions

Heartbeat Execution Mode

Architecture

Task Lifecycle

Optimistic Concurrency

Process Lock

Audit Logging

Agent-Native Tools

iteration_history

Circuit Breaker

Compound Engineering

How AIRunX Applies CE Patterns

Architecture

Agent Pipeline

Technology Stack

Execution Fidelity

Usage

Approval Modes

Configuration

Security Controls

Security Recommendations

Responsible Disclosure

PR Customization

PR Title Rules

PR Label Rules

PR Body Template

Available Template Variables

Stage Performance Data

Provider Breakdown Data

Example Template

Template Tips

Multi-Provider Architecture

Supported Providers

Per-Role Provider Configuration

Provider Routing Priority

Model Resolution Priority

Runtime Configuration Flags

--project

--prd

--context

Dynamic Context Resolution (@mentions)

Configuration

Configuration Hierarchy

Directory Structure

settings.json

Core Settings

Project Configuration

Tool Configuration Overrides

`airunx run`

`airunx init`

`airunx status`

`airunx doctor`

`airunx circuit`

`airunx cleanup`

`airunx validate`

`airunx agents-validate`

`airunx heartbeat`