tokcost

v0.1.0

Published

19 days ago

An instant, offline token + cost counter for LLM text.

0High
0Medium
0Low

abdulmunimjemal

tokens tokenizer tiktoken llm gpt cost cli

tokcost

An instant, offline token + cost counter for LLM text. Daily-use for anyone building with LLMs.

Offline & deterministic — counts tokens locally via gpt-tokenizer. No network, no API key.
Files or stdin — count a file, pipe text in, or pass multiple files for per-file counts plus a total.
Cost estimates — optional --cost flag estimates input dollars from a small built-in pricing table.
Library too — import countTokens / estimateCost in your own code.

Install

# global CLI
npm install -g tokcost

# or as a project dependency / library
pnpm add tokcost

Requires Node.js >= 18.

CLI usage

# count a file
tokcost prompt.md

# count piped stdin
cat prompt.md | tokcost
echo "hello world" | tokcost

# multiple files: per-file counts + a total
tokcost a.md b.md

# pick a model (selects the encoding)
tokcost -m gpt-4 prompt.md

# estimate input cost
tokcost --cost -m gpt-4o prompt.md

# machine-readable output
tokcost --json prompt.md

Flags

| Flag | Description | | --- | --- | | -m, --model <name> | Model to count for (default gpt-4o). Selects the tiktoken encoding. | | --cost | Estimate input $ using the built-in approximate price table. | | --json | Output machine-readable JSON. | | -h, --help | Show help. | | -v, --version | Show version. |

Exit codes: 0 on success, 2 on error (bad model, no input).

`--json` shape

{
  "model": "gpt-4o",
  "encoding": "o200k_base",
  "files": [{ "name": "prompt.md", "tokens": 42 }],
  "total": 42,
  "cost": {
    "available": true,
    "usd": 0.000105,
    "pricePerMTokUsd": 2.5,
    "note": "approximate input pricing; edit the table to suit your needs"
  }
}

Models & encodings

tokcost always prints which encoding it used. Model names map to a tiktoken encoding:

o200k_base — gpt-4o, gpt-4.1, gpt-5, the o-series (o1/o3/o4), and — as an approximation — claude-* and gemini-*.
cl100k_base — gpt-4 and gpt-3.5.

Counts are exact for OpenAI models. For Claude and Gemini there is no public local tokenizer, so tokcost approximates with the closest OpenAI encoding (o200k_base). Treat those counts as estimates, not ground truth.

Pricing caveat

The --cost table is a small, hand-maintained set of approximate input prices (USD per 1M tokens) for common models:

gpt-4o, gpt-4o-mini, gpt-4.1, o3, claude-3.5-sonnet, claude-3.5-haiku, gemini-2.0-flash.

These figures are a rough gut-check, not billing — providers change prices often. Edit src/models.ts to suit your needs. If a model is not in the table, tokcost reports cost as unavailable.

Library API

import { countTokens, estimateCost, resolveEncoding } from "tokcost";

countTokens("hello world");              // => number (uses gpt-4o by default)
countTokens("hello world", "gpt-4");     // => number (uses cl100k_base)

resolveEncoding("claude-3.5-sonnet");    // => "o200k_base"

const tokens = countTokens(text, "gpt-4o");
const cost = estimateCost(tokens, "gpt-4o");
// => { usd: number | undefined, available: boolean, pricePerMTok: number | undefined }

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

tokcost

Install

CLI usage

Flags

--json shape

Models & encodings

Pricing caveat

Library API

License

`--json` shape