@fallom/trace

v0.1.6

Published

2 days ago

Model A/B testing and tracing for LLM applications. Zero latency, production-ready.

0High
0Medium
0Low

atothey

sistillisteph

llm ai tracing opentelemetry a/b-testing observability openai anthropic langchain

@fallom/trace

Model A/B testing, prompt management, and tracing for LLM applications. Zero latency, production-ready.

Installation

npm install @fallom/trace

Quick Start

import fallom from "@fallom/trace";
import OpenAI from "openai";

// Initialize Fallom
await fallom.init({ apiKey: "your-api-key" });

// Wrap your LLM client for automatic tracing
const openai = fallom.trace.wrapOpenAI(new OpenAI());

// Set session context
fallom.trace.setSession("my-agent", sessionId);

// All LLM calls are now automatically traced!
const response = await openai.chat.completions.create({
  model: "gpt-4o",
  messages: [{ role: "user", content: "Hello!" }],
});

Model A/B Testing

Run A/B tests on models with zero latency. Same session always gets same model (sticky assignment).

import { models } from "@fallom/trace";

// Get assigned model for this session
const model = await models.get("summarizer-config", sessionId);
// Returns: "gpt-4o" or "claude-3-5-sonnet" based on your config weights

const response = await openai.chat.completions.create({ model, ... });

Fallback for Resilience

const model = await models.get("my-config", sessionId, {
  fallback: "gpt-4o-mini", // Used if config not found or Fallom unreachable
});

Prompt Management

Manage prompts centrally and A/B test them with zero latency.

Basic Prompt Retrieval

import { prompts } from "@fallom/trace";

// Get a managed prompt (with template variables)
const prompt = await prompts.get("onboarding", {
  variables: { userName: "John", company: "Acme" },
});

// Use the prompt with any LLM
const response = await openai.chat.completions.create({
  model: "gpt-4o",
  messages: [
    { role: "system", content: prompt.system },
    { role: "user", content: prompt.user },
  ],
});

The prompt object contains:

key: The prompt key
version: The prompt version
system: The system prompt (with variables replaced)
user: The user template (with variables replaced)

Prompt A/B Testing

Run experiments on different prompt versions:

import { prompts } from "@fallom/trace";

// Get prompt from A/B test (sticky assignment based on sessionId)
const prompt = await prompts.getAB("onboarding-test", sessionId, {
  variables: { userName: "John" },
});

// prompt.abTestKey and prompt.variantIndex are set
// for analytics in your dashboard

Version Pinning

// Use latest version (default)
const prompt = await prompts.get("my-prompt");

// Pin to specific version
const prompt = await prompts.get("my-prompt", { version: 2 });

Automatic Trace Tagging

When you call prompts.get() or prompts.getAB(), the next LLM call is automatically tagged with the prompt information. This allows you to see which prompts are used in your traces without any extra code.

// Get prompt - sets up auto-tagging for next LLM call
const prompt = await prompts.get("onboarding", {
  variables: { userName: "John" },
});

// This call is automatically tagged with promptKey, promptVersion, etc.
const response = await openai.chat.completions.create({
  model: "gpt-4o",
  messages: [
    { role: "system", content: prompt.system },
    { role: "user", content: prompt.user },
  ],
});

Tracing

Wrap your LLM client once, all calls are automatically traced.

OpenAI (+ OpenRouter, Azure, LiteLLM, etc.)

import OpenAI from "openai";
import fallom from "@fallom/trace";

await fallom.init({ apiKey: "your-api-key" });

// Works with any OpenAI-compatible API
const openai = fallom.trace.wrapOpenAI(
  new OpenAI({
    baseURL: "https://openrouter.ai/api/v1", // or Azure, LiteLLM, etc.
    apiKey: "your-provider-key",
  })
);

fallom.trace.setSession("my-config", sessionId);

// Automatically traced!
const response = await openai.chat.completions.create({
  model: "gpt-4o",
  messages: [{ role: "user", content: "Hello!" }],
});

Anthropic (Claude)

import Anthropic from "@anthropic-ai/sdk";
import fallom from "@fallom/trace";

await fallom.init({ apiKey: "your-api-key" });

const anthropic = fallom.trace.wrapAnthropic(new Anthropic());

fallom.trace.setSession("my-config", sessionId);

// Automatically traced!
const response = await anthropic.messages.create({
  model: "claude-3-5-sonnet-20241022",
  messages: [{ role: "user", content: "Hello!" }],
});

Google AI (Gemini)

import { GoogleGenerativeAI } from "@google/generative-ai";
import fallom from "@fallom/trace";

await fallom.init({ apiKey: "your-api-key" });

const genAI = new GoogleGenerativeAI(apiKey);
const model = fallom.trace.wrapGoogleAI(
  genAI.getGenerativeModel({ model: "gemini-pro" })
);

fallom.trace.setSession("my-config", sessionId);

// Automatically traced!
const response = await model.generateContent("Hello!");

What Gets Traced

For each LLM call, Fallom automatically captures:

✅ Model name
✅ Duration (latency)
✅ Token counts (prompt, completion, total)
✅ Input/output content (can be disabled)
✅ Errors
✅ Config key + session ID (for A/B analysis)
✅ Prompt key + version (when using prompt management)

Custom Metrics

Record business metrics for your A/B tests:

import { trace } from "@fallom/trace";

trace.span({
  outlier_score: 0.8,
  user_satisfaction: 4,
  conversion: true,
});

Configuration

Environment Variables

FALLOM_API_KEY=your-api-key
FALLOM_BASE_URL=https://spans.fallom.com
FALLOM_CAPTURE_CONTENT=true  # set to "false" for privacy mode

Privacy Mode

Disable prompt/completion capture:

fallom.init({ captureContent: false });

API Reference

`fallom.init(options?)`

Initialize the SDK.

`fallom.trace.wrapOpenAI(client)`

Wrap OpenAI client for automatic tracing. Works with any OpenAI-compatible API.

`fallom.trace.wrapAnthropic(client)`

Wrap Anthropic client for automatic tracing.

`fallom.trace.wrapGoogleAI(model)`

Wrap Google AI model for automatic tracing.

`fallom.trace.setSession(configKey, sessionId)`

Set session context for tracing.

`fallom.models.get(configKey, sessionId, options?)`

Get model assignment for A/B testing. Returns Promise<string>.

`fallom.prompts.get(promptKey, options?)`

Get a managed prompt. Returns Promise<PromptResult>.

promptKey: Your prompt key from the dashboard
options.variables: Template variables (e.g., { userName: "John" })
options.version: Pin to specific version (default: latest)

`fallom.prompts.getAB(abTestKey, sessionId, options?)`

Get a prompt from an A/B test. Returns Promise<PromptResult>.

abTestKey: Your A/B test key from the dashboard
sessionId: Session ID for sticky assignment
options.variables: Template variables

`fallom.trace.span(data)`

Record custom business metrics.

Testing

Run the test suite:

cd sdk/typescript-sdk
npm install
npm test

Deploying

To publish a new version to npm:

cd sdk/typescript-sdk

# Update version in package.json
# Then:
npm run build
npm publish --access public

# Or use convenience scripts:
npm run publish:patch  # 0.1.0 -> 0.1.1
npm run publish:minor  # 0.1.0 -> 0.2.0
npm run publish:major  # 0.1.0 -> 1.0.0

Requirements

Node.js >= 18.0.0

Works with ESM and CommonJS. Works with tsx, ts-node, Bun, and compiled JavaScript.

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@fallom/trace

Installation

Quick Start

Model A/B Testing

Fallback for Resilience

Prompt Management

Basic Prompt Retrieval

Prompt A/B Testing

Version Pinning

Automatic Trace Tagging

Tracing

OpenAI (+ OpenRouter, Azure, LiteLLM, etc.)

Anthropic (Claude)

Google AI (Gemini)

What Gets Traced

Custom Metrics

Configuration

Environment Variables

Privacy Mode

API Reference

fallom.init(options?)

fallom.trace.wrapOpenAI(client)

fallom.trace.wrapAnthropic(client)

fallom.trace.wrapGoogleAI(model)

fallom.trace.setSession(configKey, sessionId)

fallom.models.get(configKey, sessionId, options?)

fallom.prompts.get(promptKey, options?)

fallom.prompts.getAB(abTestKey, sessionId, options?)

fallom.trace.span(data)

Testing

Deploying

Requirements

License

`fallom.init(options?)`

`fallom.trace.wrapOpenAI(client)`

`fallom.trace.wrapAnthropic(client)`

`fallom.trace.wrapGoogleAI(model)`

`fallom.trace.setSession(configKey, sessionId)`

`fallom.models.get(configKey, sessionId, options?)`

`fallom.prompts.get(promptKey, options?)`

`fallom.prompts.getAB(abTestKey, sessionId, options?)`

`fallom.trace.span(data)`