@monetisebg/circuit-breaker
v0.1.0
Published
Circuit breaker for AI agents — pick budget-guard or loop-killer mode and stop runaway token spend or stuck agents in one wrapper. Adapters for LangChain, OpenAI Agents SDK, and the Claude Agent SDK.
Maintainers
Readme
@monetisebg/circuit-breaker
One wrapper between you and runaway execution.
Minimal circuit breaker for AI agents. Wrap any supported agent and pick a mode — the breaker cuts the run short once provider-reported usage crosses a limit, and (optionally) refuses an oversized prompt before it is even sent.
- Zero-config: defaults work out of the box.
- Two modes, pick one:
budget-guard(token caps) andloop-killer(state-repeat detection). - Post-hoc enforcement by default: token tripping happens after each call
or turn boundary, so the call that crosses the limit still counts. Use the
optional
estimateInputTokenspreflight (see below) to reject oversized initial inputs before any provider work happens. - Visible: emits
CircuitBreakerEvents as the run progresses. - Typed: throws a
CircuitBreakerError, or routes through youronTriphandler. - Optional peer dependencies — only install the framework you actually use.
- No bundled tokenizer: bring your own (
js-tiktoken,tiktoken, provider SDK).
Shipped adapters: LangChain.js, OpenAI Agents SDK, Claude Agent SDK. The core is framework-agnostic; rolling your own adapter is a few lines.
Install
Requires Node ≥ 22 (the breaker uses node:crypto).
npm install @monetisebg/circuit-breaker
# plus the framework you use (minimum versions enforced via peerDependencies):
npm install @langchain/core@^1.1.47 # for the LangChain adapter
npm install @openai/agents@^0.11.0 # for the OpenAI Agents adapter
npm install @anthropic-ai/claude-agent-sdk@^0.2 # for the Claude Agent SDK adapterQuick start (budget-guard, the default)
import { withCircuitBreaker } from "@monetisebg/circuit-breaker/openai-agents";
const safeAgent = withCircuitBreaker(agent); // defaults: 10k input + 10k output
await safeAgent.run("Analyze this dataset");budget-guard caps input and output tokens independently. Default limits:
maxInputToken = 10_000, maxOutputToken = 10_000. Token usage is read from
each provider response, so the breaker trips on the next call/turn after
either bucket is exceeded — the call that pushed the bucket over the limit
still counts. To reject an oversized first prompt before it is sent, pass an
optional estimateInputTokens preflight (next section).
withCircuitBreaker(agent, {
mode: "budget-guard", // optional — this is the default
maxInputToken: 50_000,
maxOutputToken: 20_000,
});Preflight — estimateInputTokens
import { encoding_for_model } from "js-tiktoken";
const enc = encoding_for_model("gpt-4o");
withCircuitBreaker(agent, {
maxInputToken: 50_000,
// input is the wrapper's call argument (typed per adapter)
estimateInputTokens: (input) =>
typeof input === "string" ? enc.encode(input).length : undefined,
});If the estimate exceeds maxInputToken the wrapper throws
CircuitBreakerError with reason: "max_input_tokens" before the
underlying runnable / runner / query is called. Return undefined to skip
the check for that invocation (e.g. when you can't tokenize the input shape).
This is opt-in — without an estimator the wrapper behaves as before. No
tokenizer is bundled.
loop-killer mode
withCircuitBreaker(agent, {
mode: "loop-killer",
maxRetries: 3, // default
detectRepeatedState: true,// default — hashes each step's state
});With detectRepeatedState: true (default), the breaker hashes each step's
state (the latest message / turn input) and trips when any single state
recurs more than maxRetries times. Set detectRepeatedState: false to fall
back to a plain iteration cap.
Visibility — onEvent
The breaker emits events you can log, surface in your UI, or pipe to your observability stack.
withCircuitBreaker(agent, {
mode: "loop-killer",
maxRetries: 2,
onEvent(event) {
// event: CircuitBreakerEvent
console.log(event);
},
});CircuitBreakerEvent shapes:
| Event | When | Modes |
| ----------------------------------------------------- | ------------------------------------------------- | ------------ |
| { type: "retry"; retries: number } | A state recurred (detectRepeatedState: true) or each iteration past the first (detectRepeatedState: false) | loop-killer |
| { type: "stop"; reason: StopReason; saved: number } | The breaker tripped | both |
saved is signed limit - usage: positive means headroom that won't be
spent, negative means the call that pushed us over the limit still counted.
StopReason is one of "max_input_tokens" | "max_output_tokens" |
"max_retries" | "repeated_state".
Graceful handling — onTrip
Provide onTrip to suppress the throw and return a fallback value:
const safe = withCircuitBreaker(agent, {
maxInputToken: 50_000,
maxOutputToken: 20_000,
onTrip: (ctx) => ({
output: "Sorry, I had to stop early.",
reason: ctx.reason,
metrics: ctx.metrics,
}),
});onTrip receives a TripContext:
interface TripContext {
reason: StopReason;
mode: Mode; // "budget-guard" | "loop-killer"
metrics: { iterations: number; retries: number; tokens: {...} };
limits: ResolvedLimits; // the limits actually in force
saved: number;
message: string;
}LangChain.js
import { ChatOpenAI } from "@langchain/openai";
import { AgentExecutor, createOpenAIFunctionsAgent } from "langchain/agents";
import { withCircuitBreaker } from "@monetisebg/circuit-breaker/langchain";
const agent = await createOpenAIFunctionsAgent({ llm, tools, prompt });
const executor = new AgentExecutor({ agent, tools });
const safeExecutor = withCircuitBreaker(executor, {
maxInputToken: 50_000,
maxOutputToken: 20_000,
});
await safeExecutor.invoke({ input: "..." });Iterations are counted on handleLLMStart / handleChatModelStart. Token
usage is read from handleLLMEnd with provider-agnostic extraction
(OpenAI tokenUsage, Anthropic usage, newer usage_metadata).
OpenAI Agents SDK
import { Agent } from "@openai/agents";
import { withCircuitBreaker } from "@monetisebg/circuit-breaker/openai-agents";
const agent = new Agent({ name: "Assistant", instructions: "...", tools });
const safeAgent = withCircuitBreaker(agent, {
mode: "loop-killer",
maxRetries: 3,
});
await safeAgent.run("Hello");Iterations are counted on each agent_start event (one per turn); the most
recent turnInput item is hashed for loop detection. Tokens are read live
from RunContext.usage on each turn boundary. When a limit is hit the
wrapper aborts the in-flight run via AbortSignal; any caller-supplied
signal is chained, so external cancellation still works.
Streaming (
stream: true) is not yet supported. Open an issue if you need it.
Claude Agent SDK
import { query } from "@anthropic-ai/claude-agent-sdk";
import { withCircuitBreaker } from "@monetisebg/circuit-breaker/claude-agent-sdk";
const safeQuery = withCircuitBreaker(query, {
maxInputToken: 50_000,
maxOutputToken: 20_000,
});
for await (const message of safeQuery({ prompt: "Analyze this repo" })) {
// messages stream through untouched
}The wrapper takes the SDK's query function and returns a drop-in
replacement with the same call signature. It's itself an async generator —
SDKMessages stream through unchanged while the breaker watches them.
Iterations are counted on each assistant message (one per turn); its
content blocks are hashed for loop-killer detection. Tokens are read from
each assistant message's usage (input counts input_tokens plus cache
read/creation tokens). When a limit is hit the wrapper aborts the in-flight
query via the SDK's abortController option; any abortController you pass
in options is chained, so external cancellation still works.
With onTrip, the callback's return value is yielded as the generator's
final item instead of throwing.
Trip output
When a limit is reached the wrapper logs and throws:
[circuit-breaker] Agent stopped: input token budget exceeded (10_120/10_000; iterations: 8).Pass silent: true to suppress the log, or logger: (msg, ctx) => … to send
it elsewhere.
Options reference
| Field | Mode | Type | Default | Description |
| ---------------------- | ------------ | ----------- | ---------- | ---------------------------------------------------------------------------- |
| mode | both | Mode | "budget-guard" | "budget-guard" or "loop-killer". |
| maxInputToken | budget-guard | int ≥ 1 | 10_000 | Max aggregate input tokens before trip (post-hoc). |
| maxOutputToken | budget-guard | int ≥ 1 | 10_000 | Max aggregate output tokens before trip (post-hoc). |
| estimateInputTokens | budget-guard | (input) => number \| undefined | — | Preflight estimator; trips before the call when the estimate exceeds maxInputToken. |
| maxRetries | loop-killer | int ≥ 1 | 3 | Max times the same state may recur (or, with detection off, raw iterations). |
| detectRepeatedState | loop-killer | boolean | true | Hash each step's state for loop detection. |
| onEvent | both | EventListener | — | Receives CircuitBreakerEvent updates. |
| onTrip | wrappers | OnTrip<R> | — | Suppress the throw and use the callback's return value instead. |
All numeric options are validated at construction; passing 0, a negative,
NaN, Infinity, or a non-integer throws a TypeError.
Contributing
🤝 Our Philosophy & How You Can Help
We built Circuit Breaker to solve the immediate, visceral pain of runaway agent costs and infinite loops. However, we know that every execution environment is unique, and we do not have all the answers.
Right now, we are intentionally keeping the API minimal with core modes like budget-guard and loop-killer. We believe that the best systems are discovered through real user friction, not designed in a vacuum. Because of this, our roadmap is entirely driven by how you use — or fight — this tool in the wild.
We actively want to hear from you, especially if:
- It almost fits: Our default modes are 80% right for you, but you need one specific tweak or condition to make it perfect.
- You are building workarounds: You find yourself writing custom scripts or wrapping our API to force it to do what you need.
- You have diverging use cases: Your industry requires vastly different behavior (e.g., ultra-strict trading apps vs. loose research agents) and our defaults are breaking.
When you stop asking "what does this do?" and start asking "can I change how it works?", that is our signal to unlock more programmable control for the community.
Please open an issue, share your GitHub gists, or reach out to us directly. Your edge cases are our roadmap!
See AGENTS.md for the project layout, build/test commands, and the recipe for adding a new framework adapter.
License
Apache-2.0 — © 2026 MonetiseBG
