@llm-dev-ops/observatory-sdk

v0.1.1

Published

7 months ago

Official Node.js SDK for LLM Observatory - High-performance observability for LLM applications

Downloads

0High
0Medium
0Low

gba_admin

llm observability opentelemetry monitoring tracing openai anthropic metrics logging

LLM Observatory Node.js SDK

Production-ready observability for LLM applications with OpenTelemetry.

The official Node.js SDK for LLM Observatory - a high-performance, open-source observability platform for Large Language Model applications.

Features

🔍 Automatic Instrumentation - Wrap OpenAI clients with zero code changes
💰 Cost Tracking - Real-time cost calculation for all major LLM providers
📊 OpenTelemetry Native - Standards-based telemetry with OTLP export
🌊 Streaming Support - Full support for streaming completions with TTFT tracking
⚡ High Performance - Minimal overhead with async/await and batching
🎯 Type Safety - Full TypeScript support with comprehensive types
🔧 Middleware Support - Express middleware for automatic request tracing
📈 Rich Metrics - Token usage, latency, errors, and custom attributes

Installation

npm install @llm-observatory/sdk
# or
yarn add @llm-observatory/sdk
# or
pnpm add @llm-observatory/sdk

Quick Start

1. Initialize Observatory

import { initObservatory } from '@llm-observatory/sdk';

const observatory = await initObservatory({
  serviceName: 'my-llm-app',
  serviceVersion: '1.0.0',
  otlpEndpoint: 'http://localhost:4317',
  environment: 'production',
});

2. Instrument OpenAI Client

import { instrumentOpenAI } from '@llm-observatory/sdk';
import OpenAI from 'openai';

const openai = new OpenAI({
  apiKey: process.env.OPENAI_API_KEY,
});

// Instrument the client
instrumentOpenAI(openai, {
  enableCost: true,
  enableStreaming: true,
});

3. Use as Normal

// All calls are automatically traced and cost-tracked
const response = await openai.chat.completions.create({
  model: 'gpt-4o-mini',
  messages: [{ role: 'user', content: 'Hello!' }],
});

console.log(response.choices[0].message.content);
// Traces and metrics are automatically sent to your collector

Configuration

Observatory Options

interface ObservatoryConfig {
  serviceName: string;              // Required: Service identifier
  serviceVersion?: string;          // Service version (default: '1.0.0')
  otlpEndpoint?: string;            // OTLP endpoint (default: 'http://localhost:4317')
  useGrpc?: boolean;                // Use gRPC protocol (default: true)
  enableMetrics?: boolean;          // Enable metrics collection (default: true)
  enableTraces?: boolean;           // Enable trace collection (default: true)
  sampleRate?: number;              // Sample rate 0.0-1.0 (default: 1.0)
  environment?: string;             // Environment name (default: NODE_ENV)
  resourceAttributes?: Record<...>; // Custom resource attributes
  debug?: boolean;                  // Enable debug logging (default: false)
  exportIntervalMs?: number;        // Export interval (default: 5000ms)
  maxBatchSize?: number;            // Max batch size (default: 512)
}

Instrumentation Options

interface InstrumentOpenAIOptions {
  enableCost?: boolean;             // Enable cost calculation (default: true)
  enableStreaming?: boolean;        // Enable streaming support (default: true)
  logPayloads?: boolean;            // Log request/response (default: false)
  metadata?: Metadata;              // Custom metadata for all spans
  spanProcessor?: (span) => void;   // Custom span processor
}

Usage Examples

Basic Chat Completion

import { initObservatory, instrumentOpenAI } from '@llm-observatory/sdk';
import OpenAI from 'openai';

async function main() {
  await initObservatory({ serviceName: 'chat-app' });

  const openai = new OpenAI({ apiKey: process.env.OPENAI_API_KEY });
  instrumentOpenAI(openai);

  const response = await openai.chat.completions.create({
    model: 'gpt-4o-mini',
    messages: [{ role: 'user', content: 'Hello!' }],
  });

  console.log(response.choices[0].message.content);
}

Streaming Completions

const stream = await openai.chat.completions.create({
  model: 'gpt-4o-mini',
  messages: [{ role: 'user', content: 'Write a haiku' }],
  stream: true,
});

for await (const chunk of stream) {
  const content = chunk.choices[0]?.delta?.content || '';
  process.stdout.write(content);
}
// Automatically tracks TTFT and streaming metrics

Express Middleware

import express from 'express';
import { initObservatory } from '@llm-observatory/sdk';

const app = express();
const observatory = await initObservatory({ serviceName: 'api' });

// Add automatic request tracing
app.use(observatory.middleware({
  captureRequestBody: true,
  ignorePaths: ['/health', '/metrics'],
}));

app.post('/chat', async (req, res) => {
  const response = await openai.chat.completions.create({
    model: 'gpt-4o-mini',
    messages: [{ role: 'user', content: req.body.message }],
  });
  res.json({ response: response.choices[0].message.content });
});

Custom Metadata

instrumentOpenAI(openai, {
  metadata: {
    userId: 'user-123',
    sessionId: 'session-456',
    environment: 'production',
    tags: ['chat', 'customer-support'],
    attributes: {
      region: 'us-east-1',
      version: '2.0',
    },
  },
});

Cost Tracking

import { PricingEngine } from '@llm-observatory/sdk';

// List all available models
const models = PricingEngine.listModels();
console.log(`Available models: ${models.length}`);

// Compare costs across models
const comparisons = PricingEngine.compareCosts(
  ['gpt-4o', 'gpt-4o-mini', 'claude-3-5-sonnet-20241022'],
  1000, // prompt tokens
  500   // completion tokens
);

comparisons.forEach(({ model, cost }) => {
  console.log(`${model}: $${cost.toFixed(6)}`);
});

// Add custom pricing
PricingEngine.addCustomPricing({
  model: 'my-custom-model',
  promptCostPer1k: 0.001,
  completionCostPer1k: 0.002,
});

Advanced Tracing

import { withSpan, Provider } from '@llm-observatory/sdk';

// Create custom spans
await withSpan(
  'rag.workflow',
  async (span) => {
    span.setAttribute('query', 'What is observability?');

    // Nested operations are automatically traced
    const embedding = await generateEmbedding(query);
    const documents = await retrieveDocuments(embedding);
    const response = await generateResponse(documents);

    return response;
  },
  { provider: Provider.OpenAI, model: 'gpt-4o' }
);

Error Handling

try {
  const response = await openai.chat.completions.create({
    model: 'gpt-4o',
    messages: [{ role: 'user', content: 'Hello' }],
  });
} catch (error) {
  // Errors are automatically captured in traces
  console.error('LLM call failed:', error);
}

Cost Calculation

The SDK includes comprehensive pricing data for all major LLM providers, updated as of January 2025:

Supported Providers

OpenAI: GPT-4o, GPT-4o mini, GPT-4 Turbo, GPT-3.5 Turbo, o1 models
Anthropic: Claude Sonnet 4.5, Claude 3.5, Claude 3 (Opus, Sonnet, Haiku)
Google: Gemini 2.5 Pro/Flash, Gemini 1.5 Pro/Flash
Mistral: Mistral Large, Small, open-source models

Cost Examples

// Automatic cost tracking
instrumentOpenAI(openai, {
  enableCost: true,
  spanProcessor: (span) => {
    if (span.cost) {
      console.log(`Cost: $${span.cost.amountUsd.toFixed(6)}`);
    }
  },
});

// Manual cost calculation
const cost = PricingEngine.calculateCost('gpt-4o', 1000, 500);
console.log(`Estimated cost: $${cost.toFixed(6)}`);

OpenTelemetry Integration

The SDK uses OpenTelemetry semantic conventions with LLM-specific attributes:

Span Attributes

// System attributes
llm.system = 'openai'
llm.request.model = 'gpt-4o'
llm.request.temperature = 0.7
llm.request.max_tokens = 500

// Token usage
llm.usage.prompt_tokens = 100
llm.usage.completion_tokens = 200
llm.usage.total_tokens = 300

// Cost
llm.cost.total_usd = 0.0045
llm.cost.prompt_usd = 0.001
llm.cost.completion_usd = 0.0035

// Latency
llm.latency.ttft_ms = 234
llm.duration_ms = 1567

// Streaming
llm.streaming.enabled = true
llm.streaming.chunk_count = 42

Development

Building

npm run build

Testing

npm test
npm run test:coverage

Linting

npm run lint
npm run lint:fix

Examples

# Run examples (requires OpenAI API key)
export OPENAI_API_KEY=your-key
npx ts-node examples/basic-usage.ts
npx ts-node examples/streaming.ts
npx ts-node examples/cost-tracking.ts

Architecture

Your App
   ↓
OpenAI Client (instrumented)
   ↓
LLM Observatory SDK
   ↓
OpenTelemetry SDK
   ↓
OTLP Exporter (gRPC/HTTP)
   ↓
LLM Observatory Collector
   ↓
Storage (TimescaleDB, Tempo, Loki)
   ↓
Grafana

Performance

< 1ms overhead per LLM call
Async batching for minimal latency impact
Memory efficient with streaming support
Configurable sampling for high-volume scenarios

Best Practices

Initialize once at application startup
Use middleware for automatic request tracing
Enable cost tracking to monitor spending
Set metadata for better trace filtering
Configure sampling for high-traffic applications
Graceful shutdown to flush telemetry

// Graceful shutdown example
process.on('SIGTERM', async () => {
  await observatory.flush();
  await observatory.shutdown();
  process.exit(0);
});

Troubleshooting

Traces not appearing

Verify collector is running: curl http://localhost:4317
Enable debug logging: debug: true
Check for errors in console
Verify OTLP endpoint configuration

Cost calculation errors

Check if model is supported: PricingEngine.hasPricing(model)
Add custom pricing if needed
Verify model name matches exactly

High memory usage

Reduce maxBatchSize in config
Increase exportIntervalMs
Lower sampleRate for high traffic

Examples

See the examples/ directory for complete examples:

basic-usage.ts - Simple chat completion
streaming.ts - Streaming responses
express-middleware.ts - Express integration
cost-tracking.ts - Cost analysis
advanced-tracing.ts - RAG workflow

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

License

Apache 2.0 - see LICENSE for details.

Support

Documentation: docs.llm-observatory.io
Issues: GitHub Issues
Discussions: GitHub Discussions

Related Projects

LLM Observatory - Main repository
Rust SDK - Rust implementation
OpenTelemetry - Observability framework

Built with ❤️ for the LLM community

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

LLM Observatory Node.js SDK

Features

Installation

Quick Start

1. Initialize Observatory

2. Instrument OpenAI Client

3. Use as Normal

Configuration

Observatory Options

Instrumentation Options

Usage Examples

Basic Chat Completion

Streaming Completions

Express Middleware

Custom Metadata

Cost Tracking

Advanced Tracing

Error Handling

Cost Calculation

Supported Providers

Cost Examples

OpenTelemetry Integration

Span Attributes

Development

Building

Testing

Linting

Examples

Architecture

Performance

Best Practices

Troubleshooting

Traces not appearing

Cost calculation errors

High memory usage

Examples

Contributing

License

Support

Related Projects