@yzj01/llm-router

v1.1.0

Published

a month ago

LLM Router local routing proxy for OpenAI-compatible Chat Completions APIs.

0High
0Medium
0Low

yzj01

deepseek llm-router local-proxy openai-compatible model-routing

LLM Router

LLM Router is a small local routing proxy for OpenAI-compatible Chat Completions APIs.

Public Model Semantics

LLM Router currently accepts exactly one client-facing request model ID: auto.

config.publicModels can still define aliases such as flash and pro, but those aliases are internal routing vocabulary. They are used by routing.tiers.*, response headers, traces, and diagnostics — not as public request IDs.

The sample config.example.json publishes:

flash
pro

These example aliases are not protocol constants. You can rename them, remove them, or add more aliases as long as auto remains the router entry and routing.tiers.*.publicModel points to alias entries.

auto is routed locally with a Flash-first policy. Simple summaries, short text, ordinary Q&A, lightweight code edits, simple agentic work, and routine structured output usually land on flash. Complex reasoning and natural multi-file debugging or repair workflows usually land on pro.

Clients, OpenClaw, and SDK integrations should always send auto. Routed aliases such as flash and pro appear in x-xy-router-model, traces, and internal configuration only.

Routing Thresholds

Scoring thresholds are internal launch constants and are not part of the public config surface. Configure routing.tiers, structuredOutputMinTier, and ambiguousDefaultTier to control model selection behavior.

Install

npm install
npm run build

Documentation

Reference Project

For related routing and OpenClaw ecosystem context, see BlockRunAI/ClawRouter.

OpenClaw Compatibility

For OpenClaw v2026.4.11 and v2026.3.24, LLM Router does not register a provider and does not declare providers in openclaw.plugin.json.

When loaded by OpenClaw, the plugin does not write or repair models.providers.xiaoyiprovider. It only starts the local Router proxy. OpenClaw provider/model configuration is maintained externally.

If models.providers.xiaoyiprovider already exists, existing apiKey, api_key, headers, and request.headers are read at service start and passed to the local proxy runtime.

CLI Usage

# 启动代理（必须提供配置文件）
llm-router --config config.json

# 自定义端口
llm-router --config config.json --port 9000

# Override API key
llm-router --config config.json --api-key sk-your-key

注意：v0.2.0 起不再支持纯环境变量启动。所有基础配置必须通过 --config 指定，运行时覆写项再通过 CLI 参数传入。

OpenClaw Plugin Usage

默认配置文件名为 .openclaw/llm-router-config.json，位于当前用户目录下。 OpenClaw 插件会自动读取该文件；不要在 openclaw.json 中写入完整 RawConfig 或配置路径。

# 查看外部维护的 provider 配置
openclaw config get models.providers.xiaoyiprovider

If the default plugin config file is missing or invalid, OpenClaw plugin mode falls back to a built-in default config so the local router can still start. The fallback config does not include a real apiKey.

When fallback config is used, the plugin adds the built-in default header x-request-from: openclaw. pluginConfig.defaultHeaders may add or override headers at this fallback-only layer; it is ignored when the default plugin config file is valid.

The plugin may optionally read the configured .xiaoyienv file as a fallback supplement:

SERVICE_URL=https://api.deepseek.com
PERSONAL-API-KEY=sk-example
PERSONAL-UID=123456

SERVICE_URL supplements the fallback upstream URL. Other keys are only sent as headers when mapped by pluginConfig.xiaoyiEnv.headerMap; the default map is { "PERSONAL-API-KEY": "x-api-key", "PERSONAL-UID": "x-uid" }.

OpenClaw 应始终向本地 Router 请求 auto，再由 Router 在内部决定最终落到哪个 alias / physical model。

API

Implemented:

GET  /health
POST /v1/chat/completions

Not implemented:

GET /v1/models

Unsupported model IDs return HTTP 400. Supported request IDs are currently only auto. Configured aliases remain internal routing outputs and are surfaced via response headers, not accepted in request bodies.

Runtime Protections

The local proxy applies conservative runtime limits by default. These limits are internal defaults rather than public config fields:

| Limit | Default | Behavior | | ---------------------- | ------: | ----------------------------------------------------------------------------- | | Request body size | 10 MB | Returns 413 Payload Too Large before forwarding upstream. | | Request body read time | 30 s | Returns 408 Request Timeout if the client does not finish sending the body. | | Upstream request time | 300 s | Aborts the upstream request and returns 504 Gateway Timeout. |

If the client disconnects before the upstream call completes, the proxy aborts the upstream request to avoid keeping stale work running.

Response Headers

The proxy adds routing headers:

x-xy-router-model
x-xy-router-actual-model
x-xy-router-tier
x-xy-router-trace
x-xy-router-routed
x-xy-router-fallback
x-xy-router-upstream

x-xy-router-model is the routed alias chosen inside the router. x-xy-router-actual-model is the physical upstream model ID used in the forwarded request.

Phase 2 Candidate: Response Cache

Response caching is intentionally not included in v0.2.0.

It is reserved as a future opt-in cost optimization for non-streaming, deterministic requests. A cache key must include at least model, messages, tools, temperature, max_tokens, and baseUrl, and it must distinguish between routed aliases and the resolved physical upstream model.

License

This project is licensed under the MIT License. See LICENSE.