ai-cli-mcp
v2.21.0
Published
Run Claude, Codex, Gemini, Forge, and OpenCode CLIs through MCP with background jobs
Maintainers
Readme
AI CLI MCP Server
📦 Package Migration Notice: This package was formerly
@mkxultra/claude-code-mcpand has been renamed toai-cli-mcpto reflect its expanded support for multiple AI CLI tools.
An MCP (Model Context Protocol) server that allows running AI CLI tools (Claude, Codex, Gemini, Forge, and OpenCode) in background processes with automatic permission handling.
Did you notice that Cursor sometimes struggles with complex, multi-step edits or operations? This server, with its powerful unified run tool, enables multiple AI agents to handle your coding tasks more effectively.
Demo
Overview
This MCP server provides tools that can be used by LLMs to interact with AI CLI tools. When integrated with MCP clients, it allows LLMs to:
- Run Claude CLI with all permissions bypassed (using
--dangerously-skip-permissions) - Execute Codex CLI with approvals and sandbox bypassed (using
--dangerously-bypass-approvals-and-sandbox) - Execute Gemini CLI with automatic approval mode (using
-y) - Execute Forge CLI in non-interactive mode (using
forge -C <workFolder> -p <prompt>) - Execute OpenCode in non-interactive JSON mode (using
opencode run --format json --dir <workFolder> <prompt>) - Support multiple AI models: Claude (sonnet, sonnet[1m], opus, opusplan, haiku), Codex (gpt-5.4, gpt-5.5, gpt-5.4-mini, gpt-5.3-codex, gpt-5.3-codex-spark, gpt-5.2), Gemini (gemini-2.5-pro, gemini-2.5-flash, gemini-3.1-pro-preview, gemini-3-pro-preview, gemini-3-flash-preview), Forge (
forge), and OpenCode (opencodeplus explicitoc-<provider/model>wrappers such asoc-openai/gpt-5.4) - Manage background processes with PID tracking
- Parse and return structured outputs from both tools
Usage Example (Advanced Parallel Processing)
You can instruct your main agent to run multiple tasks in parallel like this:
Launch agents for the following 3 tasks using acm mcp run:
- Refactor
src/backendcode usingsonnet- Create unit tests for
src/frontendusinggpt-5.3-codex- Update docs in
docs/usinggemini-2.5-proWhile they run, please update the TODO list. Once done, use the
waittool to wait for all completions and report the results together.
Usage Example (Context Caching & Sharing)
You can reuse heavy context (like large codebases) using session IDs to save costs while running multiple tasks.
- First, use
acm mcp runwithopusto read all files insrc/and understand the project structure.- Use the
waittool to wait for completion and retrieve thesession_idfrom the result.- Using that
session_id, run the following two tasks in parallel withacm mcp run:
- Create refactoring proposals for
src/utilsusingsonnet- Add architecture documentation to
README.mdusinggpt-5.3-codex- Finally,
waitagain to combine both results.
Benefits
- True Async Multitasking: Agent execution happens in the background, returning control immediately. The calling AI can proceed with the next task or invoke another agent without waiting for completion.
- CLI in CLI (Agent in Agent): Directly invoke powerful CLI tools like Claude Code or Codex from any MCP-supported IDE or CLI. This enables broader, more complex system operations and automation beyond host environment limitations.
- Freedom from Model/Provider Constraints: Freely select and combine the "strongest" or "most cost-effective" models from Claude, Codex (GPT), Gemini, and Forge without being tied to a specific ecosystem.
Prerequisites
The only prerequisite is that the AI CLI tools you want to use are locally installed and correctly configured.
- Claude Code:
claude doctorpasses, and execution with--dangerously-skip-permissionsis approved (you must run it manually once to login and accept terms). - Codex CLI (Optional): Installed and initial setup (login etc.) completed.
- Gemini CLI (Optional): Installed and initial setup (login etc.) completed.
- Forge CLI (Optional): Installed and initial setup completed.
- OpenCode (Optional): Installed and configured. This integration uses
opencode run --format json, and explicit provider/model selection follows theoc-<provider/model>wrapper syntax exposed byai-cli models.
Installation & Usage
There are now two primary ways to use this package:
ai-cli-mcp: MCP server entrypointai-cli: human-facing CLI for background AI runs
MCP usage with npx
The recommended way to use the MCP server is via npx.
Using npx in your MCP configuration:
"ai-cli-mcp": {
"command": "npx",
"args": [
"-y",
"ai-cli-mcp@latest"
]
},Using Claude CLI mcp add command:
claude mcp add ai-cli '{"name":"ai-cli","command":"npx","args":["-y","ai-cli-mcp@latest"]}'Human CLI usage with global install
If you want to use the production CLI directly from your shell, install the package globally:
npm install -g ai-cli-mcpThis exposes both commands:
ai-cliai-cli-mcp
Examples:
ai-cli doctor
ai-cli models
ai-cli run --cwd "$PWD" --model sonnet --prompt "summarize this repository"
ai-cli run --cwd "$PWD" --model opencode --prompt "summarize this repository with OpenCode defaults"
ai-cli run --cwd "$PWD" --model oc-openai/gpt-5.4 --session-id ses_123 --prompt "continue this session with an explicit OpenCode model"
ai-cli ps
ai-cli result 12345
ai-cli result 12345 --verbose
ai-cli peek 12345 --time 10
ai-cli wait 12345 --timeout 300
ai-cli wait 12345 --verbose
ai-cli kill 12345
ai-cli cleanup
ai-cli-mcpHuman CLI usage with npx
Because the published package name is still ai-cli-mcp, the shortest npx form for the CLI is:
npx -y --package ai-cli-mcp@latest ai-cli run --cwd "$PWD" --model sonnet --prompt "hello"
npx -y --package ai-cli-mcp@latest ai-cli run --cwd "$PWD" --model oc-openai/gpt-5.4 --prompt "hello from OpenCode"Important First-Time Setup
For Claude CLI:
Before the MCP server can use Claude, you must first run the Claude CLI manually once with the --dangerously-skip-permissions flag, login and accept the terms.
npm install -g @anthropic-ai/claude-code
claude --dangerously-skip-permissionsFollow the prompts to accept. Once this is done, the MCP server will be able to use the flag non-interactively.
For Codex CLI:
For Codex, ensure you're logged in and have accepted any necessary terms:
codex loginFor Gemini CLI:
For Gemini, ensure you're logged in and have configured your credentials:
gemini auth loginmacOS might ask for folder permissions the first time any of these tools run. If the first run fails, subsequent runs should work.
CLI Commands
ai-cli currently supports:
runpsresultpeekwaitkillcleanupdoctormodelsmcp
Example flow:
ai-cli doctor
ai-cli models
ai-cli run --cwd "$PWD" --model gpt-5.4 --prompt "use the default Codex model"
ai-cli run --cwd "$PWD" --model codex-ultra --prompt "fix failing tests"
ai-cli run --cwd "$PWD" --model opencode --session-id ses_existing --prompt "continue this OpenCode session"
ai-cli run --cwd "$PWD" --model oc-openai/gpt-5.4 --prompt "run with an explicit OpenCode backend model"
ai-cli ps
ai-cli peek 12345 --time 10
ai-cli peek 12345 12346 --time 10
ai-cli wait 12345
ai-cli wait 12345 --verbose
ai-cli result 12345
ai-cli result 12345 --verbose
ai-cli cleanuprun accepts --cwd as the primary working-directory flag and also accepts the older aliases --workFolder / --work-folder for compatibility.
OpenCode model selection accepts either:
opencodefor the CLI's configured default modeloc-<provider/model>for an explicit OpenCode provider/model, for exampleoc-openai/gpt-5.4
ai-cli models exposes OpenCode machine-readably via opencode: ["opencode"] plus dynamicModelBackends.opencode, which points users to opencode models for backend-native discovery.
Codex model selection uses gpt-5.4 as the default advertised model.
doctor checks only binary availability and path resolution. Its JSON output includes a checks block that marks login state and terms acceptance as unchecked.
CLI State Storage
Background CLI runs are stored under:
~/.local/state/ai-cli/cwds/<normalized-cwd>/<pid>/Each PID directory contains:
meta.jsonstdout.logstderr.logexit-status.jsonfor detached runs
Use ai-cli cleanup to remove completed and failed runs. Running processes are preserved.
Exit Status Tracking
Detached ai-cli runs persist natural exit status for all supported backends through exit-status.json. Non-zero exits are surfaced as failed with the recorded exitCode; zero exits are surfaced as completed with exitCode: 0. ai-cli kill records SIGTERM termination as a failed exit, and a tracked process that disappears without exit metadata is treated as failed rather than assumed successful.
Connecting to Your MCP Client
After setting up the server, add the configuration to your MCP client's settings file (e.g., mcp.json for Cursor, mcp_config.json for Windsurf).
If the file doesn't exist, create it and add the ai-cli-mcp configuration.
Tools Provided
This server exposes the following tools:
run
Executes a prompt using Claude CLI, Codex CLI, Gemini CLI, Forge CLI, or OpenCode. The appropriate CLI is automatically selected based on the model name.
Arguments:
prompt(string, optional): The prompt to send to the AI agent. Eitherpromptorprompt_fileis required.prompt_file(string, optional): Path to a file containing the prompt. Eitherpromptorprompt_fileis required. Can be absolute path or relative toworkFolder.workFolder(string, required): The working directory for the CLI execution. Must be an absolute path. Models:- Ultra Aliases:
claude-ultra(defaults to max effort),codex-ultra(gpt-5.5, defaults to xhigh reasoning),gemini-ultra - Claude:
sonnet,sonnet[1m],opus,opusplan,haiku - Codex:
gpt-5.4,gpt-5.5,gpt-5.4-mini,gpt-5.3-codex,gpt-5.3-codex-spark,gpt-5.2 - Gemini:
gemini-2.5-pro,gemini-2.5-flash,gemini-3.1-pro-preview,gemini-3-pro-preview,gemini-3-flash-preview - Forge:
forge - OpenCode:
opencodefor the configured default backend model, plus explicit wrappers likeoc-openai/gpt-5.4 reasoning_effort(string, optional): Reasoning control for Claude and Codex. Claude uses--effort(allowed: "low", "medium", "high", "xhigh", "max"). Codex usesmodel_reasoning_effort(allowed: "low", "medium", "high", "xhigh"). Gemini, Forge, and OpenCode do not supportreasoning_effort.session_id(string, optional): Optional session ID to resume a previous session. Supported for Claude, Codex, Gemini, Forge, and OpenCode. OpenCode resumes in place via--sessionand may also be combined with an explicitoc-<provider/model>selection.
wait
Waits for multiple AI agent processes to complete and returns their combined results. Blocks until all specified PIDs finish or a timeout occurs.
By default, each returned result item uses the compact shape shared with get_result(verbose: false): operational fields such as pid, agent, status, exitCode, model, parsed output such as agentOutput, and top-level session_id when available. Set verbose: true to include full metadata like startTime, workFolder, prompt, and detailed parsed output such as agentOutput.tools.
Arguments:
pids(array of numbers, required): List of process IDs to wait for (returned by theruntool).timeout(number, optional): Maximum wait time in seconds. Defaults to 180 (3 minutes).verbose(boolean, optional): Iftrue, each result item uses the full result shape. Defaults tofalse.
peek
Starts a one-shot short observation window for running child agents and returns structured events observed during that specific call. By default this includes only natural-language message events; pass include_tool_calls or --include-tool-calls to also include normalized tool-call events. It is not a history API, not gapless streaming, and not shell stdout/stderr tailing. Separate peek calls may miss events emitted between calls; --follow is intentionally not part of v1.
CLI v1:
ai-cli peek 123 --time 10
ai-cli peek 123 456 --time 10
ai-cli peek 123 --time 10 --include-tool-callsArguments:
pids(array of numbers, required): 1..32 process IDs returned byrun. Duplicate PIDs are deduplicated server-side, preserving first occurrence order. Unknown or unmanaged PIDs are returned per process asnot_found, not as a whole-call failure.peek_time_sec(number, optional): Positive integer observation length in seconds. Defaults to 10 and is capped at 60.0, negative values, and fractional values are invalid.include_tool_calls(boolean, optional): Whentrue, each processeventsarray includes normalizedtool_callevents in addition to message events. Defaults tofalse.
Observation and filtering:
peek_started_atandevents[].tsare ai-cli-mcp server-side UTC RFC3339 timestamps.peek_started_atis when the observation window starts after validation and listener registration;events[].tsis when ai-cli-mcp observed and accepted the event.- The window ends when
peek_time_secelapses or all target processes reach a terminal state, whichever comes first. - Events emitted before the window starts are not returned. Concurrent
peekcalls for the same PID are allowed; each has an independent window and may return overlapping events. - Message events are recognized from Codex
agent_messagetext, Claude assistant text content, OpenCodetype: "text"events wherepart.typeis"text", Gemini stream-jsonmessageevents whereroleis"assistant", and best-effort Forge plain-text lines beginning withSummary:orCompleted successfully:. - When tool calls are included,
tool_callevents are normalized for Codex command/MCP calls, Claude tool use/results, Gemini tool use/results, OpenCode completed tool use events, and low-precision ForgeExecute/Finishedmarkers. Tool summaries are bounded one-line strings derived from tool names and input metadata only. Forge command output itself is not tailed or exposed. Raw stdout/stderr, raw JSONL, tool result output, command output,result.response, stats, token usage, and verbose metadata are excluded. - Unknown event shapes are denied by default. Managed agents without supported extraction return their real process status with
events: [],truncated: false, anderror: null. - Each PID keeps the first 50 events observed in the window. If later events are dropped,
truncatedistrue. statusis one ofrunning,completed,failed, ornot_found, and reflects state when the observation window closes.agentisclaude,codex,gemini,forge,opencode, a future tracked string value, ornullwhen the process is not found or the agent cannot be determined.
Example response:
{
"peek_started_at": "2026-04-11T12:34:56.789Z",
"observed_duration_sec": 10.01,
"processes": [
{
"pid": 123,
"agent": "codex",
"status": "running",
"events": [
{ "kind": "message", "ts": "2026-04-11T12:34:59.120Z", "text": "I'm checking the implementation." },
{ "kind": "tool_call", "ts": "2026-04-11T12:35:00.000Z", "phase": "started", "id": "item_0", "tool": "command_execution", "summary": "/bin/sh -c 'echo hi'" }
],
"truncated": false,
"error": null
},
{
"pid": 999,
"agent": null,
"status": "not_found",
"events": [],
"truncated": false,
"error": "process not found"
}
]
}list_processes
Lists all running and completed AI agent processes with their status, PID, and basic info.
doctor
Checks supported AI CLI binary availability and path resolution from MCP clients. Like ai-cli doctor, it returns a checks block and does not verify login state or terms acceptance.
models
Lists supported model names, aliases, and dynamic backend discovery hints from MCP clients. This returns the same structured payload as ai-cli models.
get_result
Gets the current output and status of an AI agent process by PID.
By default, this returns the compact result shape: operational fields such as pid, agent, status, exitCode, model, parsed output such as agentOutput, and top-level session_id when available. It omits metadata fields like startTime, workFolder, and prompt. Set verbose: true to return the full result shape including those metadata fields and detailed parsed output such as agentOutput.tools. If parsed output is unavailable or incomplete, the raw stdout/stderr fallback is preserved.
Arguments:
pid(number, required): The process ID returned by theruntool.verbose(boolean, optional): Iftrue, returns the full result shape. Defaults tofalse.
kill_process
Terminates a running AI agent process by PID.
Arguments:
pid(number, required): The process ID to terminate.
Troubleshooting
- "Command not found" (claude-code-mcp): If installed globally, ensure the npm global bin directory is in your system's PATH. If using
npx, ensurenpxitself is working. - "Command not found" (
ai-cli): If installed globally, ensure your npm global bin directory is inPATH. If usingnpx, usenpx -y --package ai-cli-mcp@latest ai-cli .... - "Command not found" (claude or ~/.claude/local/claude): Ensure the Claude CLI is installed correctly. Run
claude/doctoror check its documentation. - Permissions Issues: Make sure you've run the "Important First-Time Setup" step.
- JSON Errors from Server: If
MCP_CLAUDE_DEBUGistrue, error messages or logs might interfere with MCP's JSON parsing. Set tofalsefor normal operation. - ESM/Import Errors: Ensure you are using Node.js v20 or later.
Contributing
For development setup, testing, and contribution guidelines, see the Development Guide.
Testing
# Deterministic unit, parser, contract, and mocked e2e tests
npm test
# Published npm package contents smoke test
npm run test:package
# Deterministic PR/release gate used by GitHub Actions.
# This does not enable real external CLI runs by itself.
npm run test:release
# Release-time live E2E against real installed AI CLIs
ACM_LIVE_E2E=1 ACM_LIVE_E2E_AGENTS=claude,codex npm run test:live
# Release-time live E2E for both ai-cli and MCP server surfaces
ACM_LIVE_E2E=1 ACM_LIVE_E2E_SURFACE=all ACM_LIVE_E2E_AGENTS=claude,codex npm run test:liveLive E2E is opt-in because it depends on installed and authenticated external CLIs, network access, provider availability, and cost budget. ACM_LIVE_E2E_SURFACE defaults to cli; use mcp or all to include the MCP server surface.
Advanced Configuration (Optional)
Normally not required, but useful for customizing CLI paths or debugging.
CLAUDE_CLI_NAME: Override the Claude CLI binary name or provide an absolute path (default:claude)CODEX_CLI_NAME: Override the Codex CLI binary name or provide an absolute path (default:codex)GEMINI_CLI_NAME: Override the Gemini CLI binary name or provide an absolute path (default:gemini)FORGE_CLI_NAME: Override the Forge CLI binary name or provide an absolute path (default:forge)OPENCODE_CLI_NAME: Override the OpenCode CLI binary name or provide an absolute path (default:opencode)MCP_CLAUDE_DEBUG: Enable debug logging (set totruefor verbose output)
CLI Name Specification:
- Command name only:
CLAUDE_CLI_NAME=claude-custom - Absolute path:
CLAUDE_CLI_NAME=/path/to/custom/claudeRelative paths are not supported.
Example with custom CLI binaries:
"ai-cli-mcp": {
"command": "npx",
"args": [
"-y",
"ai-cli-mcp@latest"
],
"env": {
"CLAUDE_CLI_NAME": "claude-custom",
"CODEX_CLI_NAME": "codex-custom",
"OPENCODE_CLI_NAME": "opencode-custom"
}
},License
MIT


