langchain-codex

v0.6.0

Published

10 days ago

LangChain.js chat model adapter for OpenAI Codex SDK.

0High
0Medium
0Low

euan-cowie

langchain langgraph codex openai typescript llm

langchain-codex

LangChain.js chat model adapter for OpenAI Codex SDK.

langchain-codex lets LangChain and LangGraph code use local Codex through the official @openai/codex-sdk.

import { ChatCodexSDK } from "langchain-codex";

const model = new ChatCodexSDK({
  model: "gpt-5.4",
  workingDirectory: process.cwd(),
});

const response = await model.invoke("Review this repo and summarize the risks.");
console.log(response.content);

Status

This package is published on npm as langchain-codex. The API is still pre-1.0 and may change between minor releases.

What This Is

This package adapts the local Codex runtime to LangChain's BaseChatModel interface.

LangChain / LangGraph JS
  -> ChatCodexSDK
    -> @openai/codex-sdk or codex app-server
      -> local Codex runtime
        -> existing Codex auth session or API-key auth

It is intended for repository review, coding-agent workflows, LangGraph nodes, and automation where Codex's local runtime is the model provider.

What This Is Not

It is not a wrapper around the OpenAI API.
It does not handle OAuth tokens directly.
It does not call private Codex backend APIs.
It does not support browser use.
Its bindTools() support is experimental and prompt-mediated, not SDK-native provider tool calling.

Codex has its own local tools, shell access, patching, sandboxing, approvals, and persisted threads. Those are different from provider-side LangChain tool calling.

Installation

npm install langchain-codex @langchain/core zod

For local repository development:

npm install
npm run build

Requirements

Node.js 20 or later.
@openai/codex-sdk, installed as a runtime dependency of this package.
Codex authentication configured through the Codex CLI, IDE/app, or API-key auth.

Runtime Backend

ChatCodexSDK uses the TypeScript Codex SDK backend by default. An experimental App Server backend is available for users who want to test the migration path toward Codex's richer client protocol:

const model = new ChatCodexSDK({
  runtime: "app-server",
  model: "gpt-5.4",
  workingDirectory: process.cwd(),
});

const response = await model.invoke("Summarize this repo.");
await model.close();

The App Server backend speaks to a local codex app-server process over stdio. It is intended to become the default after parity testing covers invocation, streaming, structured output, approval handling, cancellation, and process cleanup. Until then, runtime: "sdk" remains the default.

When App Server asks the client to approve a command or file change, the adapter calls appServerApprovalHandler. Without a handler, the default is to fail the turn clearly instead of leaving the App Server process waiting:

const model = new ChatCodexSDK({
  runtime: "app-server",
  approvalPolicy: "on-request",
  appServerApprovalHandler: async (request) => {
    if (request.kind === "command") {
      return "accept";
    }

    return "decline";
  },
});

Set appServerDefaultApprovalDecision to "decline" or "cancel" when a host wants unattended approval requests to resolve without throwing.

Module Format

langchain-codex is ESM-only and supports Node.js 20 or later. Use import syntax from ESM projects:

import { ChatCodexSDK } from "langchain-codex";

CommonJS output is not published.

Authentication

This package does not handle OAuth tokens directly. It relies on the local Codex runtime and whatever authentication that runtime supports.

For local development, authenticate Codex before using this adapter:

codex login

For hosted or CI usage, configure Codex API-key auth according to the Codex documentation. Do not commit Codex auth files or API keys.

Basic Usage

import { ChatCodexSDK } from "langchain-codex";

const model = new ChatCodexSDK({
  model: "gpt-5.4",
  workingDirectory: process.cwd(),
  sandboxMode: "read-only",
});

const response = await model.invoke("Summarize this repository.");

console.log(response.content);
console.log(response.response_metadata.codex?.threadId);

Examples

The repository includes copyable TypeScript examples:

examples/basic.ts invokes Codex as a LangChain chat model.
examples/streaming.ts streams text, v1 content blocks, and streamEvents() custom Codex events.
examples/structured-output.ts uses withStructuredOutput() with a Zod schema.
examples/thread-resume.ts resumes a Codex thread across calls.
examples/langgraph-agent.ts runs a LangGraph StateGraph with ToolNode and toolsCondition.
examples/langgraph-react-agent.ts shows createReactAgent with experimental bindTools().
examples/langgraph-thread-resume.ts stores Codex thread IDs in checkpointed LangGraph state.

Streaming

Codex streaming is event-based. When Codex emits updated agent_message items, this adapter diffs the message text and yields LangChain AIMessageChunk values.

const stream = await model.stream("Review this repo and list the main risks.");

for await (const chunk of stream) {
  process.stdout.write(
    typeof chunk.content === "string" ? chunk.content : JSON.stringify(chunk.content),
  );
}

If Codex emits only a completed message, the stream may contain one large text chunk rather than token-sized chunks.

Structured Output

Codex supports native per-turn outputSchema. You can pass JSON Schema directly:

const response = await model.invoke("Summarize repository status.", {
  outputSchema: {
    type: "object",
    properties: {
      summary: { type: "string" },
      status: { type: "string", enum: ["ok", "action_required"] },
    },
    required: ["summary", "status"],
    additionalProperties: false,
  },
});

For Zod schemas, use withStructuredOutput():

import { z } from "zod";

const structured = model.withStructuredOutput(
  z.object({
    summary: z.string(),
    riskLevel: z.enum(["low", "medium", "high"]),
  }),
);

const result = await structured.invoke("Review this repo.");
console.log(result.riskLevel);

withStructuredOutput() uses the same Codex outputSchema path. Omit method, or pass { method: "jsonSchema" } when code needs to be explicit. OpenAI-style structured-output modes do not map to Codex here: method: "functionCalling", method: "jsonMode", and strict are rejected with CodexUnsupportedFeatureError.

Use includeRaw when you need the original LangChain message:

const structured = model.withStructuredOutput(schema, { includeRaw: true });
const result = await structured.invoke("Review this repo.");

console.log(result.raw.response_metadata);
console.log(result.parsed);

Experimental Tool Calling

bindTools() is available as an experimental LangChain compatibility layer. The Codex local runtimes do not expose LangChain-standard JavaScript tool registration, so this adapter uses Codex outputSchema plus tool instructions to return LangChain AIMessage.tool_calls.

There are three distinct tool surfaces:

| Surface | Executed by | How it appears | | --------------------------- | ----------------------------------------------------------- | --------------------------------------------------------------------------- | | Codex runtime tools | Codex inside the local turn | Codex content blocks, response_metadata.codex.items, and stream events | | LangChain client-side tools | Your LangChain or LangGraph app, usually through ToolNode | AIMessage.tool_calls followed by ToolMessage results | | ChatCodexSDK.bindTools() | Prompt-mediated adapter layer | Experimental compatibility that asks Codex to emit LangChain tool-call JSON |

The profile.toolCalling and profile.toolChoice flags are true for the experimental LangChain-compatible bindTools() path. They should not be read as native Codex SDK tool registration support, and raw provider tools call options are still rejected. The App Server runtime also has its own dynamic-tool protocol surface; ChatCodexSDK does not wire that surface to LangChain tools yet, and dynamic tool requests from App Server are rejected with a clear error.

By default, bound-tool responses are treated as an unreliable protocol boundary. ChatCodexSDK asks Codex for a per-tool structured-output schema, validates the returned tool name and args, and retries once in the same Codex thread if the response is malformed or does not match the selected tool schema. Zod tool schemas are checked with the original safeParse logic so refinements still apply, while JSON Schema/OpenAI-style tools are checked with Ajv without coercing values or inserting defaults.

import { tool } from "@langchain/core/tools";
import { z } from "zod";

const multiply = tool(async ({ a, b }) => a * b, {
  name: "multiply",
  description: "Multiply two numbers.",
  schema: z.object({
    a: z.number(),
    b: z.number(),
  }),
});

const modelWithTools = model.bindTools([multiply]);
const response = await modelWithTools.invoke("What is 6 * 7?");

console.log(response.tool_calls);

LangChain or LangGraph can execute those tool calls and feed back ToolMessage results on the next turn:

import { HumanMessage, ToolMessage } from "@langchain/core/messages";

const toolCall = response.tool_calls?.[0];
const final = await modelWithTools.invoke([
  new HumanMessage("What is 6 * 7?"),
  response,
  new ToolMessage({
    content: "42",
    tool_call_id: toolCall?.id ?? "missing-tool-call-id",
    name: "multiply",
  }),
]);

Supported tool_choice values are "auto", "any", "none", a tool name string, and common OpenAI-style function tool-choice objects. Streaming tool-call chunks are not native yet; streaming with bound tools yields the completed tool-call message as a final chunk.

Strict validation is the default:

const modelWithTools = model.bindTools([multiply], {
  tool_choice: "auto",
  toolCallValidation: "strict",
  toolCallRepairRetries: 1,
});

Use toolCallRepairRetries: 0 to disable repair, or raise it up to 3 for workflows that prefer extra latency and Codex tokens over surfacing a validation error. Use toolCallValidation: "basic" only as a compatibility escape hatch for older prompts that emit JSON-encoded string args; basic mode still checks the tool-call shape, known tool names, and tool_choice, but it does not validate args against each tool schema.

For best reliability, keep prompt-mediated tool sets small and give each tool a specific name, description, and object schema. In LangGraph, treat AIMessage.tool_calls as a request for your graph to authorize and execute client-side tools. Put policy checks, allowlists, tenant/user authorization, and side-effect controls around the ToolNode or the node that dispatches tools; Codex is only proposing a LangChain tool call.

LangGraph ToolNode

The experimental tool-call shape is compatible with LangGraph's ToolNode and toolsCondition patterns:

import { END, MessagesAnnotation, START, StateGraph } from "@langchain/langgraph";
import { ToolNode, toolsCondition } from "@langchain/langgraph/prebuilt";

const modelWithTools = model.bindTools([multiply]);
const toolNode = new ToolNode([multiply]);

const graph = new StateGraph(MessagesAnnotation)
  .addNode("agent", async (state) => {
    const response = await modelWithTools.invoke(state.messages);
    return { messages: [response] };
  })
  .addNode("tools", toolNode)
  .addEdge(START, "agent")
  .addConditionalEdges("agent", toolsCondition, ["tools", END])
  .addEdge("tools", "agent")
  .compile();

This uses LangGraph to execute client-side LangChain tools. It does not turn those tools into native Codex runtime tools or App Server dynamic tools.

Model Profile

ChatCodexSDK exposes a conservative LangChain model profile so dynamic LangChain and LangGraph code can inspect supported capabilities:

console.log(model.profile);
// {
//   structuredOutput: true,
//   imageInputs: true,
//   imageUrlInputs: false,
//   reasoningOutput: true,
//   toolCalling: true,
//   toolChoice: true,
//   ...
// }

The tool flags mean ChatCodexSDK.bindTools() can produce LangChain-standard tool calls for ToolNode/toolsCondition flows. They do not mean Codex has provider-native JavaScript tool registration; Codex's native runtime activity remains visible through content blocks, metadata, and stream events.

Thread Resume

By default, each call starts a new Codex thread. This keeps .batch() behavior predictable and close to other LangChain chat models.

To continue a Codex thread, pass the returned thread ID into a later call. The getCodexThreadId() helper avoids metadata casts:

import { getCodexThreadId } from "langchain-codex";

const first = await model.invoke("Inspect this repository.");
const threadId = getCodexThreadId(first);

const second = await model.invoke(
  "Continue with a concise risk summary.",
  threadId === undefined ? undefined : { threadId },
);

LangGraph Thread State

For LangGraph, store the Codex threadId in graph state or checkpointed state, then pass it back as the next model call's threadId. The same pattern works with runtime: "sdk" and runtime: "app-server":

import { HumanMessage, type BaseMessage } from "@langchain/core/messages";
import {
  Annotation,
  END,
  MemorySaver,
  START,
  StateGraph,
  messagesStateReducer,
} from "@langchain/langgraph";
import { ChatCodexSDK, getCodexThreadId } from "langchain-codex";

const CodexGraphState = Annotation.Root({
  messages: Annotation<BaseMessage[]>({
    reducer: messagesStateReducer,
    default: () => [],
  }),
  codexThreadId: Annotation<string | undefined>(),
});

const model = new ChatCodexSDK({
  runtime: "app-server",
  workingDirectory: process.cwd(),
  sandboxMode: "read-only",
});

const graph = new StateGraph(CodexGraphState)
  .addNode("codex", async (state) => {
    const inputMessages =
      state.codexThreadId === undefined ? state.messages : getPendingCodexMessages(state.messages);
    const response = await model.invoke(
      inputMessages,
      state.codexThreadId === undefined ? undefined : { threadId: state.codexThreadId },
    );

    return {
      messages: [response],
      codexThreadId: getCodexThreadId(response) ?? state.codexThreadId,
    };
  })
  .addEdge(START, "codex")
  .addEdge("codex", END)
  .compile({ checkpointer: new MemorySaver() });

const config = { configurable: { thread_id: "langgraph-thread" } };

await graph.invoke({ messages: [new HumanMessage("Inspect this repo.")] }, config);
await graph.invoke({ messages: [new HumanMessage("Continue the review.")] }, config);
await model.close();

function getPendingCodexMessages(messages: BaseMessage[]): BaseMessage[] {
  for (let index = messages.length - 1; index >= 0; index -= 1) {
    if (messages[index]?.getType() === "ai") {
      return messages.slice(index + 1);
    }
  }

  return messages;
}

The LangGraph thread_id and Codex threadId are different identifiers. LangGraph uses thread_id to load checkpointed graph state; ChatCodexSDK uses threadId to resume the persisted Codex session.

messagesStateReducer stores cumulative graph message history. Once a Codex thread is resumed, pass only the new pending graph messages to Codex; the previous transcript is already present in the Codex thread.

Thread ownership rules:

Stateless ChatCodexSDK calls start new Codex threads by default.
Passing threadId resumes that Codex thread for the current call.
A model constructed with a default threadId sets maxConcurrency: 1 unless you override it.
Do not pass LangGraph's checkpoint thread_id as Codex threadId; persist the Codex ID returned in response_metadata.codex.threadId.
Branching graph paths should not mutate the same Codex thread concurrently.

Working Directory and Sandbox

Codex runs against a local working directory. This adapter defaults to safer settings:

const model = new ChatCodexSDK({
  workingDirectory: "/path/to/repo",
  sandboxMode: "read-only",
  networkAccessEnabled: false,
});

Use workspace-write only when you want Codex to edit files:

const model = new ChatCodexSDK({
  workingDirectory: "/path/to/repo",
  sandboxMode: "workspace-write",
  approvalPolicy: "on-request",
  modelReasoningEffort: "medium",
});

Codex requires a git repository by default. For temporary or generated directories, pass skipGitRepoCheck: true intentionally.

Use modelReasoningEffort to set Codex's reasoning effort explicitly. The adapter passes it to the Codex thread and exposes it through LangChain invocation and identifying parameters for tracing. The examples use "low" for faster smoke runs; use "medium" or higher for deeper repository analysis.

Constructor Options

type ChatCodexSDKFields = {
  model?: string;
  workingDirectory?: string;
  skipGitRepoCheck?: boolean;

  sandboxMode?: "read-only" | "workspace-write" | "danger-full-access";
  approvalPolicy?: "never" | "on-request" | "on-failure" | "untrusted";
  modelReasoningEffort?: "minimal" | "low" | "medium" | "high" | "xhigh";
  networkAccessEnabled?: boolean;
  webSearchMode?: "disabled" | "cached" | "live";
  additionalDirectories?: string[];

  threadId?: string;
  env?: Record<string, string>;
  baseUrl?: string;
  apiKey?: string;
  codexPathOverride?: string;
  codexConfig?: Record<string, unknown>;
  appServerApprovalHandler?: (
    request: CodexAppServerApprovalRequest,
  ) =>
    | "accept"
    | "acceptForSession"
    | "decline"
    | "cancel"
    | Promise<"accept" | "acceptForSession" | "decline" | "cancel">;
  appServerDefaultApprovalDecision?: "decline" | "cancel" | "throw";

  timeoutMs?: number;
  maxConcurrency?: number;
};

Unsupported Features

Stop sequences are rejected because Codex runs through the local agent runtime rather than a plain text-completion endpoint.

Raw provider tools call options are rejected. Pass tools through model.bindTools(tools) so the experimental adapter can build the Codex prompt and output schema.

Troubleshooting

Missing or expired auth:

Codex authentication is required. This package does not handle OAuth tokens directly.

Run codex login locally, or configure Codex API-key auth in hosted environments.

Missing Codex runtime:

Could not start the local Codex runtime.

Ensure Codex is installed and available on PATH, or pass codexPathOverride.

Non-git working directory:

Codex requires a git working directory by default.

Use a git repository, or pass skipGitRepoCheck: true when intentional.

Development

npm install
npm run lint
npm run typecheck
npm test
npm run build

Integration tests are opt-in because they require local Codex auth:

RUN_CODEX_INTEGRATION_TESTS=1 npm run test:integration

The experimental App Server runtime has a separate opt-in suite:

CODEX_APP_SERVER_INTEGRATION=1 npm run test:integration:app-server

Set CODEX_APP_SERVER_INTEGRATION_MODEL or CODEX_INTEGRATION_MODEL to override the default model, and CODEX_APP_SERVER_INTEGRATION_TIMEOUT_MS for slower local runs.

Version Compatibility

This package targets:

@langchain/core 1.x
@openai/codex-sdk 0.125.x
Node.js 20 or later

The Codex SDK is moving quickly. This package pins a conservative dependency range and wraps the SDK behind a small adapter surface.

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

langchain-codex

Status

What This Is

What This Is Not

Installation

Requirements

Runtime Backend

Module Format

Authentication

Basic Usage

Examples

Streaming

Structured Output

Experimental Tool Calling

LangGraph ToolNode

Model Profile

Thread Resume

LangGraph Thread State

Working Directory and Sandbox

Constructor Options

Unsupported Features

Troubleshooting

Development

Version Compatibility

License