@reaatech/otel-genai-semconv-utils

v0.1.0

Published

25 days ago

Token counting, cost calculation, and PII redaction utilities for OTel GenAI

0High
0Medium
0Low

reaatech

@reaatech/otel-genai-semconv-utils

Status: Pre-1.0 — APIs may change in minor versions. Pin to a specific version in production.

Token counting, cost calculation, and PII redaction utilities for LLM observability. Provides accurate token estimation via tiktoken for OpenAI models, character-based estimation for other providers, per-request cost computation with baked-in pricing tables, and automatic detection and redaction of sensitive data (emails, SSNs, credit cards, IP addresses, phone numbers).

Installation

npm install @reaatech/otel-genai-semconv-utils
# or
pnpm add @reaatech/otel-genai-semconv-utils

Feature Overview

Provider-aware token counting — tiktoken integration for OpenAI, estimation fallbacks for Anthropic, Vertex AI, and Bedrock
Built-in pricing tables — pre-configured pricing for GPT-4, GPT-4o, Claude 3, Gemini, and Bedrock model families
Custom pricing overrides — inject per-model pricing at construction time or via setter methods
PII redaction — five default patterns (email, SSN, credit card, phone, IP) with add/remove/exclude API
Object deep redaction — recursively redact PII from nested objects and arrays
LRU caching — token counts and cost calculations are cached with configurable TTL and size limits
Singleton accessors — getDefaultTokenCounter(), getDefaultCostCalculator(), getDefaultPIIRedactor() for zero-config use

Quick Start

import { CostCalculator, TokenCounter, PIIRedactor } from "@reaatech/otel-genai-semconv-utils";

const calculator = new CostCalculator();
const cost = calculator.calculate({
  provider: "openai",
  model: "gpt-4",
  inputTokens: 1000,
  outputTokens: 500,
});
// { total: 0.045, input: 0.03, output: 0.015, currency: "USD" }

const counter = new TokenCounter();
const tokens = counter.countTokens("Hello, world!");
// ~4 (estimated at 4 chars/token)

const redactor = new PIIRedactor();
const safe = redactor.redact("Contact [email protected] or 555-1234");
// "Contact [REDACTED_EMAIL] or [REDACTED_PHONE]"

API Reference

`CostCalculator` (class)

Constructor

new CostCalculator(options?: { customPricing?: Record<string, PricingInfo> })

Methods

| Method | Description | |--------|-------------| | calculate(params) | Compute cost from provider, model, input/output token counts | | calculateFromUsage(params) | Compute cost from a TokenUsage object | | getPricing(provider, model) | Look up pricing for a specific model (checks custom first, then defaults) | | setCustomPricing(model, pricing) | Override pricing for a single model | | setCustomPricingBatch(pricing) | Override pricing for multiple models | | getAllPricing() | Returns all pricing data (including custom overrides) | | clearCache() | Invalidate the cost calculation cache | | getCacheSize() | Number of cached cost calculations |

`CostCalculationParams`

| Property | Type | Description | |----------|------|-------------| | provider | ProviderType | Provider: openai, anthropic, vertexai, bedrock | | model | string | Model name/ID | | inputTokens | number | Input/prompt token count | | outputTokens | number | Output/completion token count | | currency | string | Currency code (default: "USD") |

`TokenCounter` (class)

Constructor

new TokenCounter(options?: {
  provider?: ProviderType;
  customCounter?: ITokenCounter;
  enableCache?: boolean;
  cacheTTL?: number;
  maxCacheSize?: number;
})

Methods

| Method | Description | |--------|-------------| | countTokens(text) | Count tokens in a text string | | countMessageTokens(message) | Count tokens for a single message (role + content) | | countConversationTokens(messages) | Count tokens for an entire conversation | | setCustomCounter(counter) | Replace the token counting implementation | | clearCache() | Invalidate the token count cache | | getCacheSize() | Number of cached token counts |

`ITokenCounter` (interface)

Implement this to create a custom token counter:

interface ITokenCounter {
  countTokens(text: string): number;
  countMessageTokens(message: { role: string; content: string }): number;
  countConversationTokens(messages: Array<{ role: string; content: string }>): number;
}

`EstimationTokenCounter` (class)

A character-based token estimator using ~4 characters per token (English text). Used as the fallback when no provider-specific counter is configured.

`PIIRedactor` (class)

Constructor

new PIIRedactor(options?: PIIRedactionOptions)

`PIIRedactionOptions`

| Property | Type | Default | Description | |----------|------|---------|-------------| | enabled | boolean | true | Enable/disable redaction | | redactMessageContent | boolean | false | Whether to redact message content | | hashInsteadOfRedact | boolean | false | Hash values instead of replacing with [REDACTED_*] | | customPatterns | PIIPattern[] | — | Additional regex patterns to detect | | excludePatterns | string[] | — | Pattern descriptions to skip |

Methods

| Method | Description | |--------|-------------| | redact(text) | Redact PII from a string | | redactObject(obj) | Deep redact PII from an object | | containsPII(text) | Check if text contains any PII | | detectPIITypes(text) | List which PII types were detected | | addPattern(pattern) | Add a custom detection pattern | | removePattern(description) | Remove a pattern by description | | excludePattern(description) | Exclude a pattern (skip during redaction) | | includePattern(description) | Re-include a previously excluded pattern |

Default PII Patterns

| Pattern | Replacement | Description | |---------|-------------|-------------| | Email addresses | [REDACTED_EMAIL] | [email protected] | | SSN | [REDACTED_SSN] | 123-45-6789 | | Credit cards | [REDACTED_CC] | 13–19 digit numbers | | Phone numbers | [REDACTED_PHONE] | 10+ digit sequences | | IP addresses | [REDACTED_IP] | IPv4 addresses |

Usage Patterns

Custom Pricing Overrides

const calculator = new CostCalculator({
  customPricing: {
    "gpt-4": { input: 0.03, output: 0.06 },
    "my-custom-model": { input: 0.001, output: 0.002, provider: "openai" },
  },
});

Provider-Specific Token Counters

import { OpenAITokenCounter } from "@reaatech/otel-genai-semconv-openai";
import { TokenCounter } from "@reaatech/otel-genai-semconv-utils";

const counter = new TokenCounter({
  customCounter: new OpenAITokenCounter(),
});
const tokens = counter.countTokens("Hello, world!");

Selective PII Redaction

const redactor = new PIIRedactor({ excludePatterns: ["Email addresses"] });
redactor.redact("[email protected] called 555-1234");
// "[email protected] called [REDACTED_PHONE]"

Related Packages

@reaatech/otel-genai-semconv-core — Core types and constants
@reaatech/otel-genai-semconv-openai — OpenAI provider instrumentation (with tiktoken counter)

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@reaatech/otel-genai-semconv-utils

Installation

Feature Overview

Quick Start

API Reference

CostCalculator (class)

Constructor

Methods

CostCalculationParams

TokenCounter (class)

Constructor

Methods

ITokenCounter (interface)

EstimationTokenCounter (class)

PIIRedactor (class)

Constructor

PIIRedactionOptions

Methods

Default PII Patterns

Usage Patterns

Custom Pricing Overrides

Provider-Specific Token Counters

Selective PII Redaction

Related Packages

License

`CostCalculator` (class)

`CostCalculationParams`

`TokenCounter` (class)

`ITokenCounter` (interface)

`EstimationTokenCounter` (class)

`PIIRedactor` (class)

`PIIRedactionOptions`