@mozilla-ai/any-llm

v0.1.0

Published

a day ago

TypeScript SDK for the any-llm gateway

Downloads

156

0High
0Medium
0Low

llm ai gateway openai any-llm

any-llm (TypeScript)

TypeScript client for any-llm-gateway. Communicate with any LLM provider through the gateway using a single, typed interface.

Python SDK | Documentation | Platform (Beta)

Quickstart

import { GatewayClient } from "@mozilla-ai/any-llm";

const client = new GatewayClient({
  apiBase: "http://localhost:8000",
  platformToken: "your-token-here",
});

const response = await client.completion({
  model: "openai:gpt-4o-mini",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(response.choices[0].message.content);

That's it! Change the model string to switch between LLM providers through the gateway.

Installation

Requirements

Node.js 18 or newer
A running any-llm-gateway instance

Install

npm install @mozilla-ai/any-llm

Setting Up Credentials

Set environment variables for your gateway:

export GATEWAY_API_BASE="http://localhost:8000"
export GATEWAY_PLATFORM_TOKEN="your-token-here"
# or for non-platform mode:
export GATEWAY_API_KEY="your-key-here"

Alternatively, pass credentials directly when creating the client (see Usage examples).

any-llm-gateway

This TypeScript SDK is a client for any-llm-gateway, an optional FastAPI-based proxy server that adds enterprise-grade features on top of the core library:

Budget Management - Enforce spending limits with automatic daily, weekly, or monthly resets
API Key Management - Issue, revoke, and monitor virtual API keys without exposing provider credentials
Usage Analytics - Track every request with full token counts, costs, and metadata
Multi-tenant Support - Manage access and budgets across users and teams

The gateway sits between your applications and LLM providers, exposing an OpenAI-compatible API that works with any supported provider.

Quick Start

docker run \
  -e GATEWAY_MASTER_KEY="your-secure-master-key" \
  -e OPENAI_API_KEY="your-api-key" \
  -p 8000:8000 \
  ghcr.io/mozilla-ai/any-llm/gateway:latest

Note: You can use a specific release version instead of latest (e.g., 1.2.0). See available versions.

Managed Platform (Beta)

Prefer a hosted experience? The any-llm platform provides a managed control plane for keys, usage tracking, and cost visibility across providers, while still building on the same any-llm interfaces.

Usage

Authentication Modes

The client supports two authentication modes, matching the Python SDK:

Platform Mode (Recommended)

Uses a Bearer token in the standard Authorization header:

const client = new GatewayClient({
  apiBase: "http://localhost:8000",
  platformToken: "tk_your_platform_token",
});

Non-Platform Mode

Sends the API key via a custom X-AnyLLM-Key header:

const client = new GatewayClient({
  apiBase: "http://localhost:8000",
  apiKey: "your-api-key",
});

Auto-Detection from Environment Variables

When no explicit credentials are provided, the client reads from environment variables:

// Uses GATEWAY_API_BASE, GATEWAY_PLATFORM_TOKEN, or GATEWAY_API_KEY
const client = new GatewayClient();

Chat Completions

const response = await client.completion({
  model: "openai:gpt-4o-mini",
  messages: [{ role: "user", content: "Hello!" }],
});

console.log(response.choices[0].message.content);

Streaming

const stream = await client.completion({
  model: "openai:gpt-4o-mini",
  messages: [{ role: "user", content: "Tell me a story." }],
  stream: true,
});

for await (const chunk of stream) {
  const content = chunk.choices[0]?.delta?.content;
  if (content) process.stdout.write(content);
}

Responses API

const response = await client.response({
  model: "openai:gpt-4o-mini",
  input: "Summarize this in one sentence.",
});

console.log(response.output_text);

Embeddings

const result = await client.embedding({
  model: "openai:text-embedding-3-small",
  input: "Hello world",
});

console.log(result.data[0].embedding);

Listing Models

const models = await client.listModels();
for (const model of models) {
  console.log(model.id);
}

Error Handling

In platform mode, HTTP errors are mapped to typed exceptions:

import { GatewayClient, AuthenticationError, RateLimitError } from "@mozilla-ai/any-llm";

try {
  const response = await client.completion({
    model: "openai:gpt-4o-mini",
    messages: [{ role: "user", content: "Hello!" }],
  });
} catch (error) {
  if (error instanceof AuthenticationError) {
    console.error("Invalid credentials:", error.message);
  } else if (error instanceof RateLimitError) {
    console.error("Rate limited, retry after:", error.retryAfter);
  }
}

| HTTP Status | Error Class | Description | |------------|-------------|-------------| | 401, 403 | AuthenticationError | Invalid or missing credentials | | 402 | InsufficientFundsError | Budget or credits exhausted | | 404 | ModelNotFoundError | Model not found or unavailable | | 429 | RateLimitError | Rate limit exceeded (includes retryAfter) | | 502 | UpstreamProviderError | Upstream provider unreachable | | 504 | GatewayTimeoutError | Gateway timed out waiting for provider |

Why choose `any-llm`?

Simple, unified interface - Single client for all providers through the gateway, switch models with just a string change
Developer friendly - Full TypeScript types for better IDE support and clear, actionable error messages
Leverages the OpenAI SDK - Built on the official OpenAI Node.js SDK for maximum compatibility
Stays framework-agnostic so it can be used across different projects and use cases
Battle-tested - Powers our own production tools (any-agent)

Development

# Install dependencies
npm install

# Run unit tests
npm run test:unit

# Run integration tests (requires a running gateway)
npm run test:integration

# Run all tests
npm test

# Type-check
npm run typecheck

# Build
npm run build

Documentation

Full Documentation - Complete guides and API reference
Supported Providers - List of all supported LLM providers
Gateway Documentation - Gateway setup and deployment
Python SDK - The full Python SDK with direct provider access
any-llm Platform (Beta) - Hosted control plane for key management, usage tracking, and cost visibility

Contributing

We welcome contributions from developers of all skill levels! Please see the Contributing Guide or open an issue to discuss changes.

License

This project is licensed under the Apache License 2.0 - see the LICENSE file for details.