codex-sdk

v0.2.9

Published

4 months ago

A TypeScript SDK for building AI-powered terminal applications with interactive workflows. [Originally forked from OpenAI's codex.]

Downloads

167

0High
0Medium
0Low

melnikov-s

Codex SDK

A TypeScript SDK for building AI-powered terminal applications with interactive workflows. [Originally forked from OpenAI's codex.]

When to use this

High-level AI coding agents (like Claude Code) work well for most general coding tasks. This library is for building small, focused workflows that are easily shareable and reproducible.

Instead of one large coding agent, you create specific workflows: a deploy dashboard, project setup assistant, code review helper, or database migration tool. Each workflow handles one job well and can be shared with your team or the community.

The key architectural difference: coding agents typically offer two modes - a high-level interface where the agent manages everything, or an SDK that gives you programmatic control but loses the human-in-the-loop interaction (no text box, no approval flows). This library gives you both: YOU manage the context and call the model (using ai-sdk) while keeping the human-in-the-loop interface intact. You get programmatic control over the entire workflow AND the collaborative terminal UI.

In summary:

Claude Code: High abstraction, lower flexibility
Codex SDK: Low abstraction, high flexibility

When it fits

Developer tools: Deploy dashboards, code review helpers, repo analyzers, CLI copilots that need approvals and quick iteration
Ops/SRE automation: Human-in-the-loop workflows with safe command whitelists and visible command history
Team-shareable tools: Reproducible, focused assistants that ship as terminal apps with good UX

This gives you infrastructure for human-in-the-loop AI workflows with built-in UI, approval policies, and tool execution. Works with any AI provider via ai-sdk.

When it doesn't fit

IDE-native assistants only: Prefer MCP tools inside Claude or Cursor to keep everything in-editor

🚀 Declarative API

Responsibility Boundaries

You control:

Full control of model context
State management (messages, loading, queues, task lists)
AI model calls (when to call, what context to include)
Business logic and workflow coordination
Data persistence and external integrations

Library handles:

Multi-workflow orchestration and switching
Terminal UI rendering and user interactions
Tool execution with approval policies
File system integration and command sandboxing

Workflow Lifecycle

Your workflow factory returns an object with these methods:

initialize() - Called once when workflow is created
message(userInput) - Called each time user sends input
stop() - Called when user stops current processing
terminate() - Called when workflow is closed/destroyed

Declarative State

Your workflow receives state (current values) and setState (updater function). Changes are immediately reflected in the UI:

// Read current state
const { loading, messages, queue, taskList } = state;

// Update state declaratively
await setState({
  loading: true,
  messages: [...state.messages, { role: "user", content: "Hello" }],
  statusLine: "Processing your request...",
});

// Or use functional updates
await setState((prev) => ({
  ...prev,
  queue: [...prev.queue, "new task"],
  taskList: prev.taskList.map((task) =>
    task.label === "Deploy" ? { ...task, completed: true } : task,
  ),
}));

Action Shortcuts

The actions API provides convenient shortcuts for common state updates:

// These actions are equivalent to setState calls:
actions.setLoading(true); // setState({ loading: true })
actions.addMessage(userMessage); // setState({ messages: [...messages, userMessage] })
actions.say("Processing..."); // Add UI-only message
actions.addToQueue("pending"); // setState({ queue: [...queue, "pending"] })
actions.addTask("Deploy staging"); // setState({ taskList: [...taskList, task] })

You decide when to call the AI, what context to include, and how to handle responses. The library provides the infrastructure and UI.

📦 Installation

npm install codex-sdk

🎯 Quick Start

Simple API

The SDK provides a single run function that handles both single and multiple workflows:

import { createAgentWorkflow, run } from "codex-sdk";
import { openai } from "@ai-sdk/openai";
import { generateText } from "ai";

const workflow = createAgentWorkflow(
  "My Assistant",
  ({ state, actions, tools }) => {
    return {
      initialize: async () => {
        actions.say("Ready! How can I help you?");
      },

      message: async (userInput) => {
        actions.setLoading(true);
        actions.addMessage(userInput);

        // Handle user input with AI
        const result = await generateText({
          model: openai("gpt-4o"),
          system: "You are a helpful assistant.",
          messages: state.transcript,
          tools: tools.definitions,
        });

        // Using single-agent (global) helper
        await actions.handleModelResult(result);
        actions.setLoading(false);
      },

      stop: () => actions.setLoading(false),
      terminate: () => {},
    };
  },
);

// Single workflow
const manager = run(workflow);

// Multiple workflows - pass Array<WorkflowFactory>
const multiManager = run([codeAssistant, researchAssistant], {
  title: "🚀 My Development Environment",
});

Multi-Workflow Setup

import { createAgentWorkflow, run } from "codex-sdk";

// Define your AI assistants
const codeAssistant = createAgentWorkflow(
  "Code Assistant",
  ({ state, actions, tools }) => ({
    initialize: async () => {
      actions.say("Code assistant ready!");
    },
    message: async (userInput) => {
      // AI logic for coding tasks
      actions.addMessage(userInput);
      // ... AI processing logic
    },
    stop: () => actions.setLoading(false),
    terminate: () => {},
  }),
);

const researchAssistant = createAgentWorkflow(
  "Research Assistant",
  ({ state, actions, tools }) => ({
    initialize: async () => {
      actions.say("Research assistant ready!");
    },
    message: async (userInput) => {
      // AI logic for research tasks
      actions.addMessage(userInput);
      // ... AI processing logic
    },
    stop: () => actions.setLoading(false),
    terminate: () => {},
  }),
);

// Run multiple workflows with seamless switching
const manager = run([codeAssistant, researchAssistant], {
  title: "🚀 My AI Development Environment",
  approvalPolicy: "suggest",
  config: {
    safeCommands: ["git status", "npm test"],
    headers: [
      { label: "Environment", value: "Development" },
      { label: "Project", value: "My App" },
    ],
  },
});

// Listen to workflow events
manager.on("workflow:switch", (event) => {
  console.log(`Switched to: ${event.workflow.title}`);
});

🎮 Multi-Workflow Controls

Default Keyboard Shortcuts

| Command | Description | | -------- | -------------------------------------------------------- | | Ctrl+P | Switch to next workflow | | Ctrl+O | Switch to previous workflow | | Ctrl+N | Switch to next non-loading workflow | | Ctrl+K | Open app command palette (switch/create/close workflows) |

Workflow tabs appear at the bottom showing all active assistants.

Customizable Controls

Customize keyboard shortcuts and display them in overlays:

import { run, createAgentWorkflow } from "codex-sdk";

const customHotkeyConfig = {
  previousWorkflow: { key: "h", ctrl: true }, // Ctrl+H instead of Ctrl+O
  nextWorkflow: { key: "l", ctrl: true }, // Ctrl+L instead of Ctrl+P
  nextNonLoading: { key: "j", ctrl: true }, // Ctrl+J instead of Ctrl+N
  appCommands: { key: "k", ctrl: true, shift: true }, // Ctrl+Shift+K instead of Ctrl+K
};

run([myWorkflow], {
  title: "Custom Controls Demo",
  hotkeyConfig: customHotkeyConfig,
});

Features:

Always visible controls: Keyboard shortcuts are displayed at the bottom of selection overlays
Full customization: Configure any combination of ctrl, shift, meta, and key
Type safety: Full TypeScript support for hotkey configuration
Per-session: Different hotkey configurations for different workflow sessions

Interactive Overlays

History Overlay (/history): Browse command history with j/k navigation
Help Overlay (/help): View all available slash commands
Approval Overlay (/approval): Switch between approval policies
Selection Overlay: Multi-choice selections with arrow key navigation
Prompt Overlay: Text input dialogs with validation
Confirmation Overlay: Yes/No confirmation dialogs

File System Integration

File Autocompletion

The SDK provides intelligent file system autocompletion with @filename syntax:

// In your message input, type:
@src/components/    // Press Tab to see completions
@package.json       // Auto-completes to exact file
@./README.md        // Relative path completion

Features:

Prefix Matching: Type @src/ and Tab to see all files in src/
Directory Navigation: Navigate directories with Tab completion
Relative Paths: Support for ./ and ../ path patterns
Smart Filtering: Real-time filtering as you type
Keyboard Navigation: Use arrow keys to select from completions

Desktop Notifications

Get system notifications when AI assistants complete responses:

run(workflows, {
  config: {
    notify: true, // Enable desktop notifications
  },
});

Configuration:

Notifications appear when workflows finish processing
Includes workflow name and response preview
Respects system notification settings
Can be enabled/disabled per session

🛠️ API Reference

WorkflowManager

The run function returns an event-driven WorkflowManager:

const manager = run(workflows);

// Access workflow instances
manager.getWorkflows().forEach((workflow) => {
  console.log(workflow.title, workflow.isActive);
});

// Listen to events
manager.on("workflow:create", (event) => {
  console.log(`Created: ${event.workflow.title}`);
});

manager.on("workflow:switch", (event) => {
  console.log(
    `Switched from ${event.previousWorkflow?.title} to ${event.workflow.title}`,
  );
});

// Clean shutdown
manager.terminate();

WorkflowOptions

Configuration options:

interface BaseWorkflowOptions {
  approvalPolicy?: ApprovalPolicy;
  additionalWritableRoots?: ReadonlyArray<string>;
  fullStdout?: boolean;
  config?: LibraryConfig;
  onController?: (controller: WorkflowController) => void;
  title?: React.ReactNode;
  hotkeyConfig?: Partial<CustomizableHotkeyConfig>;
}

interface SingleWorkflowOptions extends BaseWorkflowOptions {
  prompt?: string;
  imagePaths?: Array<string>;
  headless?: boolean;
  format?: {
    roleHeader?: (msg: UIMessage) => string;
    message?: (msg: UIMessage) => string;
  };
  log?: { sink?: (line: string) => void; mode?: "human" | "jsonl" };
}

interface MultiWorkflowOptions extends BaseWorkflowOptions {
  initialWorkflows?: Array<InitialWorkflowRef>;
}

type WorkflowOptions = SingleWorkflowOptions | MultiWorkflowOptions;

Headless mode is only supported for single workflows.

Event Usage Examples

const manager = run([codeAssistant, researchAssistant], {
  title: "🛠️ AI Development Workspace",
});

// Track workflow lifecycle
manager.on("workflow:create", (event) => {
  console.log(`Created workflow: ${event.workflow.title}`);
  addTabToUI(event.workflow);
});

manager.on("workflow:close", (event) => {
  console.log(`Closed workflow: ${event.workflow.title}`);
  removeTabFromUI(event.workflow);
});

// Monitor active workflow changes
manager.on("workflow:switch", (event) => {
  updateActiveTab(event.workflow);
  loadWorkflowContext(event.workflow);
});

// Handle workflow state changes
manager.on("workflow:loading", (event) => {
  showSpinner(event.workflow);
});

manager.on("workflow:ready", (event) => {
  hideSpinner(event.workflow);
});

Available Events:

"workflow:create" - New workflow instance created
"workflow:close" - Workflow instance closed
"workflow:switch" - Active workflow changed
"workflow:loading" - Workflow is processing
"workflow:ready" - Workflow finished processing

Event Data:

interface WorkflowEvent {
  workflow: WorkflowInstance;
  previousWorkflow?: WorkflowInstance; // for switch events
}

interface WorkflowInstance {
  title: string;
  factory: WorkflowFactory;
  isActive: boolean;
  state: WorkflowState; // Access loading, messages, etc.
  displayConfig?: DisplayConfig;
}

Programmatic Workflow Management

The WorkflowManager returned by run() provides complete programmatic control over workflows and manager properties, equivalent to all Ctrl+K menu functionality.

Manager Properties

Access and update all manager-level settings programmatically:

const manager = run([codeAssistant, researchAssistant], {
  title: "🛠️ AI Development Workspace",
  approvalPolicy: "suggest",
});

// Read current properties
console.log(manager.title); // "🛠️ AI Development Workspace"
console.log(manager.approvalPolicy); // "suggest"
console.log(manager.config); // LibraryConfig object
console.log(manager.hotkeyConfig); // CustomizableHotkeyConfig object

// Update properties (triggers immediate UI re-renders)
manager.setTitle("🚀 Updated Development Environment");
manager.setApprovalPolicy("auto-edit");
manager.setConfig({
  safeCommands: ["git status", "npm test", "ls -la"],
  tools: {
    shell: { maxBytes: 2048 * 1024 },
  },
});
manager.setHotkeyConfig({
  nextWorkflow: { key: "l", ctrl: true },
  previousWorkflow: { key: "h", ctrl: true },
});

Workflow Lifecycle Management

Programmatically create, close, and switch between workflows:

const manager = run([codeAssistant, researchAssistant], {
  title: "🛠️ AI Development Workspace",
});

// Create new workflow instances
const newAssistant = await manager.createWorkflow(codeAssistant, {
  activate: true, // Switch to this workflow immediately
});
const backgroundResearcher = await manager.createWorkflow(researchAssistant);

// Get all workflows and active workflow
const allWorkflows = manager.getWorkflows();
const activeWorkflow = manager.getActiveWorkflow();

// Switch between workflows
await manager.switchToWorkflow(backgroundResearcher);

// Close workflows
await manager.closeWorkflow(newAssistant);

Workflow Navigation

Navigate between workflows programmatically:

// Navigate workflows (equivalent to Ctrl+P, Ctrl+O, Ctrl+N)
manager.switchToNextWorkflow();
manager.switchToPreviousWorkflow();
manager.switchToNextNonLoadingWorkflow();

Direct Workflow Control

Each workflow instance provides direct access to its state and control methods:

const activeWorkflow = manager.getActiveWorkflow();
if (activeWorkflow) {
  // Direct state management
  await activeWorkflow.setState({
    statusLine: "Processing request...",
    loading: true,
  });

  const currentState = activeWorkflow.getState();
  console.log(currentState.messages.length);

  // Direct workflow control
  activeWorkflow.message({ role: "user", content: "Hello!" });
  activeWorkflow.stop();

  // Check workflow status
  console.log(activeWorkflow.isLoading);
  console.log(activeWorkflow.isActive);
}

Workflow Iteration and Management

Loop over workflows and manage them programmatically:

// Iterate over all workflows
manager.getWorkflows().forEach((workflow) => {
  console.log(`Workflow: ${workflow.title}`);
  console.log(`Active: ${workflow.isActive}`);
  console.log(`Loading: ${workflow.isLoading}`);

  if (workflow.isActive) {
    workflow.message({ role: "user", content: "Status update" });
  }
});

// Find specific workflows
const codeWorkflows = manager
  .getWorkflows()
  .filter((w) => w.title.includes("Code"));

// Bulk operations
const loadingWorkflows = manager.getWorkflows().filter((w) => w.isLoading);
loadingWorkflows.forEach((w) => w.stop());

Event-Driven Management

Combine programmatic control with event handling:

const manager = run([codeAssistant, researchAssistant], {
  title: "🛠️ AI Development Workspace",
});

// Automatically switch to new workflows
manager.on("workflow:create", async ({ workflow }) => {
  console.log(`New workflow created: ${workflow.title}`);
  await manager.switchToWorkflow(workflow);
});

// Auto-restart failed workflows
manager.on("workflow:error", async ({ workflow }) => {
  console.log(`Workflow ${workflow.title} encountered an error, restarting...`);
  await manager.closeWorkflow(workflow);
  await manager.createWorkflow(workflow.factory, { activate: true });
});

// Dynamic property updates based on workflow state
manager.on("workflow:switch", ({ workflow }) => {
  if (workflow.title.includes("Research")) {
    manager.setApprovalPolicy("suggest"); // More careful for research
  } else {
    manager.setApprovalPolicy("auto-edit"); // Faster for coding
  }
});

Complete Example

Here's a comprehensive example showing all programmatic capabilities:

import { run, createAgentWorkflow } from "codex-sdk";

const codeAssistant = createAgentWorkflow("Code Assistant", /* ... */);
const researchAssistant = createAgentWorkflow("Research Assistant", /* ... */);

const manager = run([codeAssistant, researchAssistant], {
  title: "🛠️ Development Workspace",
  approvalPolicy: "suggest",
});

// Set up event handlers
manager.on("workflow:create", ({ workflow }) => {
  console.log(`✅ Created: ${workflow.title}`);
});

manager.on("workflow:switch", ({ workflow, previousWorkflow }) => {
  console.log(`🔄 Switched from ${previousWorkflow?.title} to ${workflow.title}`);
});

// Programmatic workflow management
async function setupWorkspace() {
  // Update manager properties
  manager.setTitle("🚀 Active Development Session");
  manager.setHotkeyConfig({
    nextWorkflow: { key: "j", ctrl: true },
    previousWorkflow: { key: "k", ctrl: true }
  });

  // Create additional workflows
  const debugger = await manager.createWorkflow(codeAssistant);
  const analyzer = await manager.createWorkflow(researchAssistant, { activate: true });

  // Set up workflow states
  await debugger.setState({
    statusLine: "Ready for debugging",
    queue: ["Check logs", "Analyze stack trace"]
  });

  await analyzer.setState({
    statusLine: "Analyzing codebase...",
    loading: true
  });

  // Send messages to specific workflows
  debugger.message({ role: "user", content: "Debug the login issue" });
  analyzer.message({ role: "user", content: "Analyze performance bottlenecks" });

  console.log(`Total workflows: ${manager.getWorkflows().length}`);
}

setupWorkspace();

This programmatic API provides complete control over the workflow environment, allowing you to build sophisticated multi-agent systems, automated workflow orchestration, and dynamic user interfaces.

🛠️ Workflow API

Workflow Creation

import { createAgentWorkflow } from "codex-sdk";

const workflow = createAgentWorkflow(
  "My Assistant",
  ({ state, actions, tools }) => {
    return {
      initialize: async () => {
        actions.say("Hello! How can I help?");
      },
      message: async (userInput) => {
        // Your workflow logic here
        actions.addMessage(userInput);
        // ... process user input
      },
      stop: () => actions.setLoading(false),
      terminate: () => {},
    };
  },
);

Additional Features

Custom Commands

const workflow = createAgentWorkflow(
  "With Commands",
  ({ state, actions, tools }) => ({
    initialize: async () => {
      actions.say("Ready!");
    },
    message: async (userInput) => {
      actions.addMessage(userInput);
      // Workflow implementation
    },
    stop: () => actions.setLoading(false),
    terminate: () => {},
    commands: {
      reset: {
        description: "Reset conversation",
        handler: () => actions.say("Conversation reset!"),
      },
      analyze: {
        description: "Analyze current context",
        handler: (args) => {
          /* Custom logic */
        },
      },
    },
  }),
);

Message Metadata

Attach custom metadata to any message for enhanced context tracking, analytics, or custom display formatting:

const workflow = createAgentWorkflow(
  "Smart Assistant",
  ({ state, actions, tools }) => ({
    initialize: async () => {
      actions.say("Smart assistant ready!");
    },
    message: async (userInput) => {
      // Add metadata to UI messages
      actions.say("Processing your request...", {
        priority: "high",
        source: "user_request",
        timestamp: new Date().toISOString(),
        userId: "user_123",
      });

      // Add metadata to user messages
      const enrichedMessage = {
        ...userInput,
        metadata: {
          confidence: 0.95,
          intent: "code_review",
          sessionId: "session_456",
        },
      };
      actions.addMessage(enrichedMessage);
    },
    stop: () => actions.setLoading(false),
    terminate: () => {},
  }),
);

Why use metadata?

Analytics: Track message types, user interactions, and workflow performance
Context: Store additional information like timestamps, user IDs, or confidence scores
Custom Display: Build custom UI components that respond to metadata properties
Debugging: Add debugging information that doesn't appear in AI conversations
Integration: Store data needed for external systems or APIs

Per‑Agent Message Scoping (Multi‑Agent in One Workflow)

Keep control state global (loading, inputDisabled, statusLine, queue, taskList). Use lightweight agent handles to scope messages and transcripts per agent within a single workflow.

const planner = actions.createAgent("Planner"); // auto‑generates UUID id
console.log(planner.id); // uuid
console.log(planner.name); // "Planner"
planner.setName("PlanBot"); // updates display name (keeps same id)
planner.say("Starting plan…");

const planResult = await generateText({
  model: openai("gpt-4o"),
  system: "You are the planning agent.",
  messages: planner.transcript(),
  tools: tools.definitions,
});

// Assistant and tool messages are tagged to this agent only
await planner.handleModelResults(planResult);

// Another agent in the same workflow
const executor = actions.createAgent("Executor");
const execResult = await generateText({
  model: openai("gpt-4o-mini"),
  system: "You are the execution agent.",
  messages: executor.transcript(),
  tools: tools.definitions,
});
await executor.handleModelResults(execResult);

Notes:

actions.createAgent(name) returns a handle with id, name, setName, say, addMessage, transcript, handleModelResults.
Messages produced via these methods carry metadata.agentId = id and the UI prefixes messages with [name] (fallback to [id]).
All other state (loading, inputDisabled, statusLine, queue, taskList) remains global at the workflow level.

Message structure and storage

Storage model: a single workflow‑level message log (state.messages). There is no agentsById state; messages are attributed to agents via metadata and names are stored in state.agentNames.
Each message can include agent attribution by id only; names are resolved at render time:

// UI-only messages or model/tool messages with optional agent attribution
type UIMessage =
  | { role: "ui"; content: string; metadata?: { agentId?: string } & Record<string, unknown> }
  | (ModelMessage & { metadata?: { agentId?: string } & Record<string, unknown> });

// Example stored assistant message
const m: UIMessage = {
  role: "assistant",
  content: [{ type: "text", text: "Plan complete." }],
  metadata: { agentId: "planner" },
};

Transcript semantics

Global transcript (state.transcript): excludes UI messages and excludes any message with metadata.agentId (i.e., sub‑agents are filtered out). Use this for simple single‑agent prompts.
Per‑agent transcript (agent.transcript()): includes only non‑UI messages tagged with that agent’s id.
Tool execution: assistant messages and tool results produced via agent.handleModelResults(result) are tagged with that agent. Tool‑synthesized user messages are routed via the normal input path.
UI display: when metadata.agentId is present, the terminal UI prefixes the role header with [name] using state.agentNames[agentId] (fallback to [id]). You can override this by providing displayConfig.formatRoleHeader or a custom agentNameResolver.

📚 Examples

The SDK includes examples:

Simple Agent: Basic AI assistant
Research Assistant: Specialized research workflows with sources
Coding Agent: Code analysis and editing workflows
Text Adventure Game: Interactive storytelling with choices
Deploy Dashboard: DevOps workflows with progress tracking
Queue Demo: Task queue management
Codebase Quiz: Interactive code learning
Multi-Workflow Demo: Complete multi-assistant setup
Customizable Controls Demo: Shows customizable keyboard shortcuts and always-visible controls

Run any example:

node examples/multi-workflow-demo.js

Configuration

Workflow Configuration

Configure tools and security for any workflow setup:

// Single workflow configuration
run(workflow, {
  approvalPolicy: "suggest",
  config: {
    // Tool configuration
    tools: {
      shell: {
        maxBytes: 1024 * 1024, // 1MB output limit
        maxLines: 1000,
      },
    },
    // User-defined safe commands (auto-approved)
    safeCommands: ["git status", "npm test", "ls -la"],

    // Custom headers in terminal
    headers: [
      { label: "Project", value: "MyApp" },
      { label: "Env", value: "Development" },
    ],

    // Custom status line
    statusLine: "Ready for deployment",

    // Command history settings
    history: {
      maxSize: 100,
      saveHistory: true,
      sensitivePatterns: ["password", "token", "key"],
    },
  },
});

// Multi-workflow configuration
run([workflow1, workflow2], {
  approvalPolicy: "auto-edit", // Default policy for all workflows
  initialWorkflows: [{ id: "primary" }], // Start with specific workflows
  title: "Development Environment",

  // Customize keyboard shortcuts
  hotkeyConfig: {
    previousWorkflow: { key: "h", ctrl: true }, // Ctrl+H
    nextWorkflow: { key: "l", ctrl: true }, // Ctrl+L
    appCommands: { key: "k", ctrl: true, shift: true }, // Ctrl+Shift+K
  },
});

Approval Policies

Control command execution safety:

suggest (default): Ask for approval before executing commands
auto-edit: Auto-approve file edits, ask for other commands
full-auto: Auto-approve all commands (use with caution)

run(workflows, {
  approvalPolicy: "suggest", // Safe default
  additionalWritableRoots: ["/safe/directory"], // Limit file access
});

Safe Commands

Pre-approved commands that bypass approval prompts:

run(workflow, {
  config: {
    safeCommands: ["git status", "npm test", "ls -la", "pwd", "docker ps"],
  },
});

How Safe Commands Work:

Commands are matched exactly or by command name only
Built-in safe commands: cd, ls, pwd, cat, grep, git status, find, wc, head, tail
User-defined safe commands are checked alongside built-in ones
Useful for frequently used read-only or safe operations

Headless Mode

Headless mode runs workflows without a terminal UI for programmatic usage. Only available for single workflows.

import { run } from "codex-sdk";

// ✅ Single workflow headless - works
const manager = run(myWorkflow, {
  headless: true, // No terminal UI
  approvalPolicy: "auto-edit",
  onController: (controller) => {
    // Programmatic access to workflow
    controller.message("Hello AI!");
  },
});

// ❌ Multi-workflow headless - throws error
try {
  run([workflow1, workflow2], {
    headless: true, // Error: not supported for multi-workflows
  });
} catch (error) {
  console.log(error.message);
  // "Headless mode is not supported for multi-workflow execution..."
}

Tool Integration

import { generateText } from "ai";
import { openai } from "@ai-sdk/openai";

const workflow = createAgentWorkflow(
  "With Tools",
  ({ state, actions, tools }) => {
    return {
      initialize: async () => {
        actions.say("Tool-enabled assistant ready!");
      },
      message: async (userInput) => {
        actions.setLoading(true);
        actions.addMessage(userInput);

        // AI will use tools automatically when needed
        const result = await generateText({
          model: openai("gpt-4o"),
          system: "You can use shell, apply_patch, and user_select tools.",
          messages: state.transcript,
          tools: tools.definitions,
        });

        // This automatically executes any tool calls the AI made
        await actions.handleModelResult(result);
        actions.setLoading(false);
      },
      stop: () => actions.setLoading(false),
      terminate: () => {},
    };
  },
);

Built-in Tools

These tools are available to your agent when you pass tools.definitions to your model call. Consider including the brief definitions below in your system prompt so the model knows how to use them.

shell
- What it does: Execute a shell/CLI command (including git) in a given working directory.
- Parameters:
  - cmd: string[] — command and args, e.g. ["git","status"]
  - workdir: string — working directory
  - timeout: number — milliseconds
apply_patch
- What it does: Apply file edits via a structured apply_patch payload for auditable changes.
- Parameters:
  - cmd: string[] — invocation and payload, e.g. ["apply_patch","*** Begin Patch\\n...\\n*** End Patch"]
  - workdir: string — working directory
  - timeout: number — milliseconds
user_select (UI mode only)
- What it does: Prompt the user to choose from options. The UI also includes a "None of the above (enter custom option)" choice automatically.
- Parameters:
  - message: string — text shown to the user
  - options: string[] — available choices
  - defaultValue: string — must match one of options; used on timeout

Note: In headless mode, only shell and apply_patch are available; user_select is not supported.

Prompt guidance: include tool definitions

Add a succinct description of the tools and their parameter shapes in your system prompt so the model reliably uses them. For example:

You can use the following tools. Call them with the exact parameter shapes.

- shell: run terminal commands.
  Parameters: { cmd: string[]; workdir: string; timeout: number }

- apply_patch: make file edits using a single apply_patch payload wrapped in *** Begin Patch ... *** End Patch.
  Parameters: { cmd: string[]; workdir: string; timeout: number }

- user_select (UI only): ask the user to pick from options; a custom option entry is available in the UI.
  Parameters: { message: string; options: string[]; defaultValue: string }

Guidelines:
- Use shell for executing commands; prefer safe/read-only commands where possible.
- Use apply_patch for all file modifications; do not edit files via shell.
- Use user_select to disambiguate choices; ensure defaultValue matches one of options.

Sample system prompt

You are an engineering agent assisting inside a terminal UI. You can use tools.

Tools:
- shell(cmd: string[], workdir: string, timeout: number): run terminal commands.
- apply_patch(cmd: string[], workdir: string, timeout: number): apply file edits via a single *** Begin Patch ... *** End Patch block.
- user_select(message: string, options: string[], defaultValue: string): prompt the user to choose (UI only).

Rules:
- Prefer safe commands; request approval before risky actions if applicable.
- All code edits must use apply_patch with a complete patch block. Do not edit files via shell.
- Use user_select when you need the user to choose among options or confirm a decision.

📚 Type Definitions

Full TypeScript support with type definitions:

export interface WorkflowFactory {
  /* ... */
}
export interface WorkflowState {
  /* ... */
}
export interface WorkflowManager {
  /* ... */
}
export interface SingleWorkflowOptions {
  /* ... */
}
export interface MultiWorkflowOptions {
  /* ... */
}
export interface CustomizableHotkeyConfig {
  /* ... */
}

📄 License

Apache-2.0

🤝 Contributing

We welcome contributions! The multi-workflow system is designed to be extensible - add your own workflow types, UI components, and tools.