agent-endpoint-doctor

v0.1.0

Published

12 days ago

Local-first compatibility tester for OpenAI-compatible endpoints used by AI coding agents.

0High
0Medium
0Low

gowrav_m

ai-agents openai-compatible llm cursor continue opencode tool-calling streaming developer-tools diagnostics

agent-endpoint-doctor

Local compatibility evidence for OpenAI-compatible endpoints used by AI coding agents.

Many providers say "OpenAI-compatible." Coding agents need more than that. They need chat, streaming, tool calls, streaming tool-call deltas, JSON mode, /responses, embeddings, and client-readable response shapes.

agent-endpoint-doctor answers one question:

Can I safely point my coding agent at this endpoint?

npx agent-endpoint-doctor demo

The demo is offline and writes reports under .endpoint-doctor/.

Visual Map

flowchart LR
  A["OpenAI-compatible endpoint"] --> B["agent-endpoint-doctor"]
  B --> C["Capability probes"]
  C --> C1["models"]
  C --> C2["chat"]
  C --> C3["streaming"]
  C --> C4["tool_calls"]
  C --> C5["streaming tool calls"]
  C --> C6["JSON mode"]
  C --> C7["responses"]
  C --> C8["embeddings"]
  B --> D["Agent readiness score"]
  B --> E["Markdown / HTML / JSON report"]
  B --> F["Continue / LiteLLM / OpenCode snippets"]

Agent Trust Suite

flowchart LR
  A["agent-endpoint-doctor"] --> F["agent-trust-center"]
  B["nim-doctor"] --> F
  C["agent-cognicheck"] --> F
  D["agent-skillguard"] --> F
  E["agentops-watchtower"] --> F
  F --> G["one trust report"]
  F --> H["CI gate"]

agent-endpoint-doctor contributes OpenAI-compatible endpoint evidence to Agent Trust Center through npx agent-endpoint-doctor evidence.

Why This Exists

flowchart TD
  A["/v1/models works"] --> B{"Can a coding agent use it?"}
  B -->|Maybe not| C["chat can fail"]
  B -->|Maybe not| D["streaming can hang"]
  B -->|Maybe not| E["tools can return plain text instead of tool_calls"]
  B -->|Maybe not| F["streaming tool deltas can be unparsable"]
  B -->|Maybe not| G["JSON mode can return invalid JSON"]
  B -->|Maybe not| H["/responses may be unsupported"]
  C --> I["Run endpoint doctor"]
  D --> I
  E --> I
  F --> I
  G --> I
  H --> I

What It Tests

| Capability | Why it matters | | --- | --- | | /v1/models | Client can discover or validate model IDs | | /v1/chat/completions | Basic agent conversation path | | streaming | IDEs and agents often stream partial output | | tool_calls | Coding agents need structured tool use | | streaming tool calls | Tool-heavy agent workflows often fail here | | JSON mode | Evals, config generation, and automation need parseable output | | /v1/responses | Some modern clients use the Responses API | | /v1/embeddings | Retrieval and code search workflows need vectors |

Quick Start

# Offline proof
npx agent-endpoint-doctor demo

# Test a local or hosted OpenAI-compatible endpoint
npx agent-endpoint-doctor test \
  --base-url http://localhost:1234/v1 \
  --model qwen3-coder

# Print saved matrix
npx agent-endpoint-doctor matrix

# Generate Continue/LiteLLM/OpenCode snippets
npx agent-endpoint-doctor config \
  --base-url http://localhost:1234/v1 \
  --model qwen3-coder

# Render reports
npx agent-endpoint-doctor report

PowerShell with a provider key:

$env:OPENAI_COMPATIBLE_API_KEY="sk-or-provider-key"
cmd /c npx -y agent-endpoint-doctor test --base-url https://example.com/v1 --model model-id

Example Matrix

offline-localhost-demo
----------------------
Decision: REVIEW | Agent readiness: 75/100
[PASS] models: Endpoint lists models.
[PASS] chat: Endpoint accepted a chat completion request.
[PASS] streaming: Endpoint returned SSE-style streaming data.
[PASS] tools: Endpoint returned OpenAI-style message.tool_calls.
[FAIL] streaming_tools: Endpoint did not stream recognizable tool-call deltas.
[PASS] json_mode: Endpoint returned parseable JSON content.
[FAIL] responses: Endpoint does not implement /responses.
[FAIL] embeddings: Chat model does not expose /embeddings.

Architecture

flowchart TB
  CLI["src/cli.ts"] --> Client["core/endpoint.ts"]
  CLI --> Matrix["core/matrix.ts"]
  CLI --> Report["report/render.ts"]
  CLI --> Configs["config snippets"]

  Client --> Endpoint["Any OpenAI-compatible /v1 endpoint"]
  Matrix --> Score["agentReadinessScore"]
  Matrix --> Decision["ready / review / blocked"]
  Report --> Evidence["JSON / Markdown / HTML"]
  Configs --> Tools["Continue / LiteLLM / OpenCode"]

Output Files

.endpoint-doctor/
  configs/
    continue.yaml
    litellm.yaml
    opencode.json
  reports/
    compatibility-matrix.json
    endpoint-doctor-report.json
    endpoint-doctor-report.md
    endpoint-doctor-report.html

Not A Router

This project does not proxy traffic or route requests. LiteLLM, OpenRouter-style gateways, provider SDKs, and local runtimes already handle routing.

agent-endpoint-doctor is the preflight check before you trust an endpoint inside an agent workflow.

Research notes: docs/research.md

Safety

Do not send private source code, customer data, secrets, medical data, financial data, or regulated data to a provider unless your organization and the provider terms allow it. The test prompts are intentionally tiny, but the endpoint still receives requests during live tests.

Development

npm install
npm run typecheck
npm test
npm run lint
npm run build
node dist/cli.js demo

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

agent-endpoint-doctor

Visual Map

Agent Trust Suite

Why This Exists

What It Tests

Quick Start

Example Matrix

Architecture

Output Files

Not A Router

Safety

Development

License