@traceai/together

v0.1.0

Published

a month ago

TraceAI instrumentation for Together AI

0High
0Medium
0Low

traceai together together-ai opentelemetry instrumentation tracing llm observability

@traceai/together

OpenTelemetry instrumentation for the Together AI TypeScript SDK.

Installation

npm install @traceai/together
# or
pnpm add @traceai/together

Requirements

Node.js 18+
together-ai SDK version >= 0.5.0

Quick Start

import { NodeTracerProvider } from "@opentelemetry/sdk-trace-node";
import { SimpleSpanProcessor, ConsoleSpanExporter } from "@opentelemetry/sdk-trace-base";
import { TogetherInstrumentation } from "@traceai/together";
import Together from "together-ai";

// Set up the tracer provider
const provider = new NodeTracerProvider();
provider.addSpanProcessor(new SimpleSpanProcessor(new ConsoleSpanExporter()));
provider.register();

// Create and enable the instrumentation
const instrumentation = new TogetherInstrumentation();
instrumentation.setTracerProvider(provider);
instrumentation.enable();

// Import and patch the Together module
const togetherModule = await import("together-ai");
instrumentation.manuallyInstrument(togetherModule);

// Use Together AI as normal - all calls will be traced
const client = new Together({
  apiKey: process.env.TOGETHER_API_KEY,
});

const response = await client.chat.completions.create({
  model: "meta-llama/Llama-3-8b-chat-hf",
  messages: [
    { role: "user", content: "Hello!" },
  ],
});

Instrumented Methods

This instrumentation automatically traces the following Together AI SDK methods:

Chat Completions

client.chat.completions.create() - Chat completions with support for streaming

Completions (Legacy)

client.completions.create() - Text completions

Embeddings

client.embeddings.create() - Text embeddings

Trace Attributes

The instrumentation captures the following semantic attributes:

Common Attributes

fi.span_kind - Type of span (LLM, EMBEDDING)
llm.system - "together"
llm.provider - "together"
llm.model_name - Model name used
input.value - Input content
input.mime_type - Input content type
output.value - Output content
output.mime_type - Output content type
raw.input - Raw request JSON
raw.output - Raw response JSON

Chat Completion Attributes

llm.input_messages.{index}.role - Message role
llm.input_messages.{index}.content - Message content
llm.output_messages.{index}.role - Response message role
llm.output_messages.{index}.content - Response message content
llm.invocation_parameters - Model parameters (temperature, max_tokens, etc.)

Tool Call Attributes

llm.tools.{index}.tool.json_schema - Tool definitions
llm.output_messages.{index}.tool_calls.{index}.id - Tool call ID
llm.output_messages.{index}.tool_calls.{index}.function.name - Function name
llm.output_messages.{index}.tool_calls.{index}.function.arguments.json - Function arguments

Token Usage Attributes

llm.token_count.prompt - Input token count
llm.token_count.completion - Output token count
llm.token_count.total - Total token count

Embedding Attributes

embedding.model_name - Embedding model name
embedding.embeddings.{index}.text - Input text
embedding.embeddings.{index}.vector - Embedding vector

Configuration

Instrumentation Options

const instrumentation = new TogetherInstrumentation({
  instrumentationConfig: {
    enabled: true, // Enable/disable instrumentation
  },
  traceConfig: {
    hideInputs: false,  // Hide input content in traces
    hideOutputs: false, // Hide output content in traces
  },
});

Streaming Support

The instrumentation fully supports streaming responses. When using streaming, the complete response is captured after the stream is consumed:

const stream = await client.chat.completions.create({
  model: "meta-llama/Llama-3-8b-chat-hf",
  messages: [{ role: "user", content: "Hello!" }],
  stream: true,
});

for await (const chunk of stream) {
  // Process chunks as they arrive
  console.log(chunk.choices[0]?.delta?.content);
}
// Span is ended with full content captured

Supported Models

Together AI supports a wide variety of models. Some popular ones include:

Chat Models:

meta-llama/Llama-3-8b-chat-hf
meta-llama/Llama-3-70b-chat-hf
mistralai/Mixtral-8x7B-Instruct-v0.1
togethercomputer/llama-2-70b-chat

Text Completion Models:

togethercomputer/RedPajama-INCITE-7B-Base
togethercomputer/GPT-JT-6B-v1

Embedding Models:

togethercomputer/m2-bert-80M-8k-retrieval
BAAI/bge-large-en-v1.5

See the Together AI models page for a complete list.

Examples

See the examples directory for complete working examples.

License

Apache 2.0

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@traceai/together

Installation

Requirements

Quick Start

Instrumented Methods

Chat Completions

Completions (Legacy)

Embeddings

Trace Attributes

Common Attributes

Chat Completion Attributes

Tool Call Attributes

Token Usage Attributes

Embedding Attributes

Configuration

Instrumentation Options

Streaming Support

Supported Models

Examples

License