ai-sdk-cost-calculator

v0.1.22

Published

9 days ago

Cost calculator for Vercel AI SDK token usage. Supports OpenAI, Anthropic, Google, xAI (Grok), and DeepSeek with long context pricing, prompt caching, and reasoning tokens.

0High
0Medium
0Low

renkoya1

ai ai-sdk vercel-ai cost calculator pricing token usage openai anthropic claude google gemini xai grok deepseek llm gpt chatgpt

ai-sdk-cost-calculator

Cost calculator for Vercel AI SDK token usage. Calculate API costs for OpenAI, Anthropic, Google, xAI (Grok), DeepSeek, and Perplexity with support for long context pricing, prompt caching, reasoning tokens, and web search.

Features

Multi-provider support: OpenAI, Anthropic, Google Gemini, xAI Grok, DeepSeek, Perplexity
AI SDK native: Use with onFinish callbacks via withCost() wrapper
Session tracking: Track costs across multiple calls with createCostAwareAI()
Multi-model tracking: Track costs across different models in a single tracker
Web search pricing: Calculate costs for grounding/search features
Google Maps pricing: Calculate costs for Google Maps API requests
Image generation pricing: Calculate costs for DALL-E, Imagen, and other image models
Auto-detection: Automatically detect web search/maps/image generation tool calls from results
Long context pricing: Automatic tier-based pricing (e.g., Claude Sonnet 4.5 >200K tokens)
Prompt caching: Separate pricing for cache reads and writes
Reasoning tokens: Support for o1/o3/R1 reasoning model costs
TypeScript: Full type definitions included

Installation

npm install ai-sdk-cost-calculator

Peer dependency: Requires ai (Vercel AI SDK) version 6.0.0 or higher.

npm install ai

Quick Start

import { generateText } from "ai";
import { openai } from "@ai-sdk/openai";
import { calculateCost, formatCost } from "ai-sdk-cost-calculator";

const result = await generateText({
  model: openai("gpt-4o"),
  prompt: "Hello, world!",
});

const cost = calculateCost({
  model: "gpt-4o",
  usage: result.usage,
});

console.log(`Total cost: ${formatCost(cost.totalCost)}`);
// Output: Total cost: $0.000150

AI SDK Integration

Using `withCost` with `onFinish`

The easiest way to track costs is with the withCost callback wrapper:

import { generateText } from "ai";
import { openai } from "@ai-sdk/openai";
import { withCost, formatCost } from "ai-sdk-cost-calculator";

const result = await generateText({
  model: openai("gpt-4o"),
  prompt: "Hello!",
  onFinish: withCost("gpt-4o", ({ cost, usage }) => {
    console.log(`Cost: ${formatCost(cost.totalCost)}`);
    console.log(`Tokens: ${usage.inputTokens} in, ${usage.outputTokens} out`);
  }),
});

You can also pass the model object directly:

const model = openai("gpt-4o");

await generateText({
  model,
  prompt: "Hello!",
  onFinish: withCost(model, ({ cost }) => {
    console.log(`Cost: ${formatCost(cost.totalCost)}`);
  }),
});

Using `getCost` for One-Shot Calculation

For simple post-hoc cost calculation:

import { generateText } from "ai";
import { openai } from "@ai-sdk/openai";
import { getCost, formatCost } from "ai-sdk-cost-calculator";

const result = await generateText({
  model: openai("gpt-4o"),
  prompt: "Hello!",
});

const cost = getCost("gpt-4o", result.usage);
console.log(`Cost: ${formatCost(cost.totalCost)}`);

Using `createCostAwareAI` for Session Tracking

For tracking costs across multiple calls in a session:

import { generateText } from "ai";
import { openai } from "@ai-sdk/openai";
import { anthropic } from "@ai-sdk/anthropic";
import { createCostAwareAI, formatCost } from "ai-sdk-cost-calculator";

const ai = createCostAwareAI({
  onCost: (model, cost) => {
    console.log(`[${model}] ${formatCost(cost.totalCost)}`);
  },
});

// First call
await generateText({
  model: openai("gpt-4o"),
  prompt: "Hello!",
  onFinish: ai.onFinish("gpt-4o"),
});

// Second call with different model
await generateText({
  model: anthropic("claude-sonnet-4-5"),
  prompt: "Hello!",
  onFinish: ai.onFinish("claude-sonnet-4-5"),
});

// Get totals
console.log(`Total: ${formatCost(ai.getTotal().totalCost)}`);
console.log(`Requests: ${ai.getRequestCount()}`);
console.log(`Models: ${ai.getModels().join(", ")}`);

// Reset when needed
ai.reset();

Multi-Model Tracker

Track costs across multiple models in a single tracker instance.

import { generateText } from "ai";
import { openai } from "@ai-sdk/openai";
import { anthropic } from "@ai-sdk/anthropic";
import { createTracker, formatCost } from "ai-sdk-cost-calculator";

const tracker = createTracker({
  onCost: (model, cost) => {
    console.log(`[${model}] Cost: ${formatCost(cost.totalCost)}`);
  },
});

// Use with onFinish callback
await generateText({
  model: openai("gpt-4o"),
  prompt: "Hello!",
  onFinish: tracker.onFinish("gpt-4o"),
});

await generateText({
  model: anthropic("claude-sonnet-4-5"),
  prompt: "Hello!",
  onFinish: tracker.onFinish("claude-sonnet-4-5", ({ cost, text }) => {
    console.log(`Response: ${text}, Cost: ${formatCost(cost.totalCost)}`);
  }),
});

// Or add usage manually
tracker.add("gpt-4o", result.usage, { webSearchRequests: 5 });

// Get per-model costs
for (const summary of tracker.getAllCosts()) {
  console.log(`${summary.model}: ${formatCost(summary.cost.totalCost)}`);
}

// Get total cost across all models
console.log(`Total: ${formatCost(tracker.getTotal().totalCost)}`);
console.log(`Requests: ${tracker.getTotalRequestCount()}`);
console.log(`Models: ${tracker.getModels().join(", ")}`);

// Reset when needed
tracker.reset();

Web Search & Google Maps Costs

Calculate costs including web search/grounding and Google Maps features.

const cost = calculateCost({
  model: "gemini-2.0-flash",
  usage: result.usage,
  webSearchRequests: 10,   // Number of web searches
  googleMapsRequests: 5,   // Number of Google Maps API calls
});

console.log(cost.webSearchCost);   // $0.14 for 10 searches
console.log(cost.googleMapsCost);  // $0.035 for 5 maps requests

Auto-Detection from Tool Calls

Automatically detect web search and Google Maps usage from AI SDK results:

import { createTracker } from "ai-sdk-cost-calculator";

const tracker = createTracker();

await generateText({
  model: google("gemini-2.0-flash"),
  prompt: "Find coffee shops near Tokyo Station",
  tools: { web_search: mySearchTool, places: myPlacesTool },
  onFinish: tracker.onFinish("gemini-2.0-flash", undefined, {
    webSearchTools: [],  // Use default tool names for detection
  }),
});

// Or add custom tool names
await generateText({
  model: openai("gpt-4o"),
  prompt: "Search the web",
  tools: { my_custom_search: customSearchTool },
  onFinish: tracker.onFinish("gpt-4o", undefined, {
    webSearchTools: ["my_custom_search"],  // Add to defaults
  }),
});

Using `detectRequestsFromResult` Directly

import { detectRequestsFromResult, calculateCost } from "ai-sdk-cost-calculator";

const result = await generateText({
  model: google("gemini-2.0-flash"),
  prompt: "What's the weather?",
  tools: { web_search: searchTool },
});

// Detect tool usage
const { webSearchRequests, googleMapsRequests } = detectRequestsFromResult(result, {
  webSearchTools: ["my_search"],  // Optional: add custom tool names
});

const cost = calculateCost({
  model: "gemini-2.0-flash",
  usage: result.usage,
  webSearchRequests,
  googleMapsRequests,
});

Default Tool Names

The following tool names are detected by default:

Web Search: web_search, webSearch, google_search, googleSearch, grounding, web_search_preview, tavily_search, brave_search, bing_search, duckduckgo_search

Google Maps: google_maps, googleMaps, place_search, placeSearch, geocode, places

Pricing by Provider

| Provider | Model | Web Search (/1k) | Google Maps (/1k) | Notes | | -------------- | ---------------- | ---------------- | ----------------- | ---------------------- | | OpenAI | GPT-4o, GPT-4.1 | $10 | - | + tokens at model rate | | OpenAI | GPT-4o-mini | $10 | - | + 8k tokens per search | | Google | Gemini Flash | $14 | $7 | Grounding with Google | | Google | Gemini Pro | $35 | $7 | Grounding with Google | | xAI | Grok models | $5 | - | Web Search / X Search | | Perplexity | Sonar | $5 | - | Search-native | | Perplexity | Sonar Pro | $6 | - | Search-native |

Image Generation Costs

Calculate costs for image generation models like DALL-E and Imagen.

const cost = calculateCost({
  model: "dall-e-3",
  usage: { promptTokens: 0, completionTokens: 0, totalTokens: 0 },
  imageGenerations: 2,        // Number of images
  imageSize: "hd-1024x1024",  // Size/quality for pricing lookup
});

console.log(cost.imageGenerationCost);  // $0.16 for 2 HD images

Image Generation Pricing

| Provider | Model | Default (/image) | Size Options | | ---------- | ----------------- | ---------------- | ------------------------------------------------- | | OpenAI | DALL-E 3 | $0.04 | 1024x1024: $0.04, 1024x1792: $0.08, hd-*: $0.08-$0.12 | | OpenAI | DALL-E 2 | $0.02 | 1024x1024: $0.02, 512x512: $0.018, 256x256: $0.016 | | OpenAI | gpt-image-1 | $0.04 | 1024x1024: $0.04, 1536x1024: $0.08 | | Google | Imagen 3 | $0.04 | 1024x1024: $0.04, 1536x1536: $0.08 | | Google | Imagen 3 Fast | $0.02 | 1024x1024: $0.02, 1536x1536: $0.04 |

Auto-Detection for Image Generation

await generateText({
  model: openai("gpt-4o"),
  prompt: "Generate an image of a cat",
  tools: { generate_image: myImageGenTool },
  onFinish: tracker.onFinish("gpt-4o", undefined, {
    imageGenerationTools: [],  // Use default tool names
  }),
});

// Custom tool names
onFinish: tracker.onFinish("gpt-4o", undefined, {
  imageGenerationTools: ["my_dalle_tool"],
})

Default Image Generation Tool Names: generate_image, generateImage, image_generation, imageGeneration, create_image, createImage, dall_e, dalle, imagen, stable_diffusion, text_to_image, textToImage

API Reference

`calculateCost(options)`

Calculate the cost for a single API request.

import { calculateCost } from "ai-sdk-cost-calculator";

const cost = calculateCost({
  model: "claude-sonnet-4-5",
  usage: {
    inputTokens: 1000,
    outputTokens: 500,
    inputTokenDetails: {
      noCacheTokens: 700,
      cacheReadTokens: 200,
      cacheWriteTokens: 100,
    },
    outputTokenDetails: {
      textTokens: 500,
      reasoningTokens: 0,
    },
  },
  webSearchRequests: 5,
});

Options

| Property | Type | Required | Description | | -------------------- | -------------------- | -------- | ---------------------------------------------------------- | | model | string | Yes | Model identifier (e.g., "gpt-4o", "claude-sonnet-4-5") | | usage | LanguageModelUsage | Yes | Token usage from AI SDK | | customPricing | ModelPricing | No | Override default pricing | | webSearchRequests | number | No | Number of web search requests | | googleMapsRequests | number | No | Number of Google Maps API requests |

Return Value: `CostBreakdown`

| Property | Type | Description | | ---------------- | --------- | -------------------------------------------- | | inputCost | number | Cost for input tokens (excluding cache) | | outputCost | number | Cost for output tokens (excluding reasoning) | | cacheReadCost | number | Cost for cached input tokens | | cacheWriteCost | number | Cost for writing to cache | | reasoningCost | number | Cost for reasoning tokens | | webSearchCost | number | Cost for web search requests | | googleMapsCost | number | Cost for Google Maps API requests | | totalCost | number | Sum of all costs | | currency | "USD" | Always USD | | isLongContext | boolean | Whether long context pricing was applied |

`createTracker(options?)`

Create a multi-model cost tracker.

import { createTracker } from "ai-sdk-cost-calculator";

const tracker = createTracker({
  customPricing: {
    "my-model": { inputPer1MTokens: 1, outputPer1MTokens: 2 },
  },
  onCost: (model, cost) => console.log(model, cost.totalCost),
});

// Methods
tracker.onFinish(model, callback?, options?);  // Create onFinish callback
tracker.add(model, usage, options?);           // Add usage, returns CostBreakdown
tracker.getModel(model);                       // Get ModelCostSummary for a model
tracker.getModels();                           // Get all tracked model names
tracker.getAllCosts();                         // Get all ModelCostSummary[]
tracker.getTotal();                            // Get total CostBreakdown
tracker.getTotalRequestCount();                // Get total request count
tracker.resetModel(model);                     // Reset specific model
tracker.reset();                               // Reset all

`createCostTracker(options)`

Track cumulative costs for a single model.

import { createCostTracker } from "ai-sdk-cost-calculator";

const tracker = createCostTracker({ model: "gpt-4o" });

tracker.addUsage(result1.usage);
tracker.addUsage(result2.usage);

console.log(tracker.getTotalCost().totalCost);
tracker.reset();

`withCost(model, callback?, options?)`

Create an onFinish callback wrapper that adds cost calculation.

import { withCost } from "ai-sdk-cost-calculator";

await generateText({
  model: openai("gpt-4o"),
  prompt: "Hello!",
  onFinish: withCost("gpt-4o", ({ cost, usage, text }) => {
    console.log(`Cost: $${cost.totalCost}`);
  }),
});

| Parameter | Type | Required | Description | | ---------- | ------------------------- | -------- | ------------------------------------ | | model | LanguageModel \| string | Yes | Model ID or AI SDK model instance | | callback | function | No | Called with result + cost breakdown | | options | CostAwareOptions | No | Custom pricing, web search, onCost |

`getCost(model, usage, options?)`

One-shot cost calculation from AI SDK result.

import { getCost } from "ai-sdk-cost-calculator";

const result = await generateText({ model: openai("gpt-4o"), prompt: "Hello!" });
const cost = getCost("gpt-4o", result.usage);
// or: getCost(openai("gpt-4o"), result.usage)

| Parameter | Type | Required | Description | | --------- | ------------------------- | -------- | ----------------------------------- | | model | LanguageModel \| string | Yes | Model ID or AI SDK model instance | | usage | LanguageModelUsage | Yes | Token usage from AI SDK | | options | CostAwareOptions | No | Custom pricing, web search, onCost |

`createCostAwareAI(options?)`

Create a cost-tracking helper for use across multiple AI SDK calls.

import { createCostAwareAI } from "ai-sdk-cost-calculator";

const ai = createCostAwareAI({
  customPricing: { "my-model": { inputPer1MTokens: 1, outputPer1MTokens: 2 } },
  onCost: (model, cost, usage) => console.log(`[${model}] $${cost.totalCost}`),
});

// Use with onFinish
await generateText({
  model: openai("gpt-4o"),
  onFinish: ai.onFinish("gpt-4o"),
});

// Or calculate directly
const cost = ai.calculate("gpt-4o", result.usage);

// Methods
ai.onFinish(model, callback?, options?);  // Create onFinish callback
ai.calculate(model, usage, options?);     // Calculate and track cost
ai.getModel(model);                       // Get CostBreakdown for a model
ai.getModels();                           // Get all tracked model names
ai.getTotal();                            // Get total CostBreakdown
ai.getRequestCount();                     // Get total request count
ai.reset();                               // Reset all tracking

`getModelId(model)`

Extract model ID from AI SDK model or string.

import { getModelId } from "ai-sdk-cost-calculator";
import { openai } from "@ai-sdk/openai";

getModelId("gpt-4o");           // "gpt-4o"
getModelId(openai("gpt-4o"));   // "gpt-4o"

`formatCost(cost, decimals?)`

Format a cost value as a currency string.

formatCost(0.0015);      // "$0.001500"
formatCost(0.0015, 4);   // "$0.0015"

`formatCostBreakdown(breakdown, decimals?)`

Format a full cost breakdown as a multi-line string.

const output = formatCostBreakdown(cost);
// Input:        $0.002400
// Cache Read:   $0.000060
// Output:       $0.007500
// Web Search:   $0.050000
// Google Maps:  $0.007000
// Total:        $0.066960

`calculateStreamCost(streamResult, options)`

Calculate cost from a streaming response.

import { streamText } from "ai";
import { calculateStreamCost, formatCost } from "ai-sdk-cost-calculator";

const stream = await streamText({
  model: openai("gpt-4o"),
  prompt: "Write a haiku",
});

const cost = await calculateStreamCost(stream, {
  model: "gpt-4o",
  onCost: (cost, usage) => {
    console.log(`Stream completed: ${formatCost(cost.totalCost)}`);
  },
});

`detectRequestsFromResult(result, options?)`

Detect web search and Google Maps tool calls from an AI SDK result.

import { detectRequestsFromResult } from "ai-sdk-cost-calculator";

const result = await generateText({
  model: google("gemini-2.0-flash"),
  prompt: "Find restaurants",
  tools: { web_search: searchTool, places: placesTool },
});

const { webSearchRequests, googleMapsRequests } = detectRequestsFromResult(result);

// With custom tool names (added to defaults)
const detected = detectRequestsFromResult(result, {
  webSearchTools: ["my_search", "custom_tavily"],
  googleMapsTools: ["location_finder"],
});

| Parameter | Type | Required | Description | | ---------------------- | --------------------- | -------- | ------------------------------------------ | | result | ResultWithToolCalls | Yes | AI SDK result with toolCalls/steps | | options.webSearchTools | string[] | No | Additional tool names to count as search | | options.googleMapsTools | string[] | No | Additional tool names to count as maps |

`withDetectedRequests(result, options?)`

Convenience wrapper that returns detected requests for spreading into calculateCost.

import { withDetectedRequests, calculateCost } from "ai-sdk-cost-calculator";

const cost = calculateCost({
  model: "gemini-2.0-flash",
  usage: result.usage,
  ...withDetectedRequests(result),
});

// With custom tool names
const cost2 = calculateCost({
  model: "gemini-2.0-flash",
  usage: result.usage,
  ...withDetectedRequests(result, {
    webSearchTools: ["my_search"],
  }),
});

Supported Models

OpenAI (23 models)

GPT-4o, GPT-4o-mini
GPT-4.1, GPT-4.1-mini, GPT-4.1-nano
GPT-4 Turbo, GPT-4, GPT-4-32k
GPT-3.5 Turbo
o1, o1-mini, o1-pro, o1-preview
o3, o3-mini, o3-pro
o4-mini

Anthropic (23 models)

Claude Opus 4.5, Sonnet 4.5, Haiku 4.5
Claude Opus 4.1, Sonnet 4, Opus 4
Claude 3.5 Sonnet, 3.5 Haiku
Claude 3 Opus, 3 Sonnet, 3 Haiku
All dated variants (e.g., claude-3-5-sonnet-20241022)

Google Gemini (17 models)

Gemini 3 Pro, 3 Flash
Gemini 2.5 Pro, 2.5 Flash, 2.5 Flash-Lite
Gemini 2.0 Flash, 2.0 Flash-Lite
Gemini 1.5 Pro, 1.5 Flash, 1.5 Flash-8B

xAI Grok (12 models)

Grok 4, Grok 4-0709
Grok 4 Fast (reasoning/non-reasoning)
Grok 4.1 Fast (reasoning/non-reasoning)
Grok Code Fast
Grok 3, Grok 3-mini
Grok 2, Grok 2 Vision

DeepSeek (10 models)

DeepSeek Chat (V3, V3.2)
DeepSeek Reasoner (R1)
DeepSeek R1 Distill variants
DeepSeek Coder

Perplexity (6 models)

Sonar, Sonar Small
Sonar Pro
Sonar Reasoning, Sonar Reasoning Pro
Sonar Deep Research

Long Context Pricing

Some models have tiered pricing based on input token count:

| Model | Threshold | Standard | Long Context | | ----------------- | --------- | ----------- | ------------ | | Claude Sonnet 4.5 | 200K | $3/$15 | $6/$22.50 | | Gemini 2.5 Pro | 200K | $1.25/$10 | $2.50/$15 | | Gemini 1.5 Pro | 128K | $1.25/$5 | $2.50/$10 | | Grok 4 Fast | 128K | $0.20/$0.50 | $0.40/$1.00 |

The calculator automatically applies the correct pricing tier based on input token count.

Custom Pricing

Override default pricing for any model:

const cost = calculateCost({
  model: "gpt-4o",
  usage: result.usage,
  customPricing: {
    inputPer1MTokens: 2.0,
    outputPer1MTokens: 8.0,
    cacheReadPer1MTokens: 1.0,
    webSearchPer1kRequests: 15,
    googleMapsPer1kRequests: 10,
  },
});

TypeScript Types

import type {
  // Core
  CostBreakdown,
  CalculateCostOptions,
  ModelPricing,
  ProviderPricing,
  Provider,
  // Single model tracker
  CostTracker,
  CreateCostTrackerOptions,
  StreamCostOptions,
  // Multi-model tracker
  MultiModelCostTracker,
  CreateMultiModelTrackerOptions,
  ModelCostSummary,
  AddUsageOptions,
  // AI SDK integration
  CostAwareAI,
  CreateCostAwareAIOptions,
  CostAwareOptions,
  ResultWithCost,
  FinishResultWithCost,
  // Auto-detection
  DetectOptions,
  DetectedRequests,
  ResultWithToolCalls,
} from "ai-sdk-cost-calculator";

Pricing Sources

Pricing data is sourced from official provider documentation:

Note: Prices may change. Please verify current pricing with official sources for production use.

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

ai-sdk-cost-calculator

Features

Installation

Quick Start

AI SDK Integration

Using withCost with onFinish

Using getCost for One-Shot Calculation

Using createCostAwareAI for Session Tracking

Multi-Model Tracker

Web Search & Google Maps Costs

Auto-Detection from Tool Calls

Using detectRequestsFromResult Directly

Default Tool Names

Pricing by Provider

Image Generation Costs

Image Generation Pricing

Auto-Detection for Image Generation

API Reference

calculateCost(options)

Options

Return Value: CostBreakdown

createTracker(options?)

createCostTracker(options)

withCost(model, callback?, options?)

getCost(model, usage, options?)

createCostAwareAI(options?)

getModelId(model)

formatCost(cost, decimals?)

formatCostBreakdown(breakdown, decimals?)

calculateStreamCost(streamResult, options)

detectRequestsFromResult(result, options?)

withDetectedRequests(result, options?)

Supported Models

OpenAI (23 models)

Anthropic (23 models)

Google Gemini (17 models)

xAI Grok (12 models)

DeepSeek (10 models)

Perplexity (6 models)

Long Context Pricing

Custom Pricing

TypeScript Types

Pricing Sources

License

Using `withCost` with `onFinish`

Using `getCost` for One-Shot Calculation

Using `createCostAwareAI` for Session Tracking

Using `detectRequestsFromResult` Directly

`calculateCost(options)`

Return Value: `CostBreakdown`

`createTracker(options?)`

`createCostTracker(options)`

`withCost(model, callback?, options?)`

`getCost(model, usage, options?)`

`createCostAwareAI(options?)`

`getModelId(model)`

`formatCost(cost, decimals?)`

`formatCostBreakdown(breakdown, decimals?)`

`calculateStreamCost(streamResult, options)`

`detectRequestsFromResult(result, options?)`

`withDetectedRequests(result, options?)`