@dex-ai/context

v0.7.17

Published

8 days ago

Index-and-pointer context management — indexes all tool outputs into FTS5 knowledge base for zero-loss compression. Provides ctx_search for on-demand retrieval. Zero config.

0High
0Medium
0Low

klxdev

@dex-ai/context

Context management extension for Dex — tracks token usage, provides a visual /context command, enforces context budgets, and guides the LLM to avoid flooding the context window.

What It Does

Context Usage

■ ■ ■ ■ ■ ■ ■ ■ ■ ■      Total Usage    102k ( 51.1%)
■ ■ ■ ■ ■ ■ ■ ■ ■ ■
■ ■ ■ ■ ■ ■ ■ ■ □ □      ■ System Prompt   1k (  0.7%)
□ □ □ □ □ □ □ □ □ □      ■ System Tools    2k (  0.9%)
□ □ □ □ □ □ □ □ □ □      ■ Tool Results   96k ( 48.1%)
                           ■ Messages        3k (  1.5%)
                           □ Available      98k ( 48.9%)

Features

Real-time context tracking — estimates token usage across categories (system prompt, tools, messages, tool calls/results, images, files, reasoning) using fast heuristics calibrated against cl100k_base.
/context command — visual grid + percentage breakdown showing exactly where context is being consumed, with color-coded categories.
Threshold warnings — automatically injects guidance to the LLM when context usage exceeds configurable thresholds (default: warn at 75%, critical at 90%).
Context-awareness skill — teaches the LLM patterns for keeping outputs small (targeted reads, search-before-read, output truncation).
Tool output tracking — records per-tool output sizes to identify the biggest context consumers.

Usage

import { contextExtension } from "@dex-ai/context";
import { Agent } from "@dex-ai/sdk";

const agent = await Agent.create({
  provider: "anthropic",
  model: "claude-sonnet-4-20250514",
  extensions: [
    providerExt,
    contextExtension(), // sensible defaults
  ],
});

With Options

contextExtension({
  maxTokens: 128_000, // override context window (auto-detected from model)
  warnAt: 70, // warn threshold (default: 75%)
  criticalAt: 85, // critical threshold (default: 90%)
  injectGuidance: true, // inject context-awareness skill (default: true)
  trackToolOutputs: true, // track per-tool output sizes (default: true)
  largeOutputThreshold: 4_000, // tokens above which an output is "large"
});

`/context` Command Integration

The extension stores accessor functions in AgentContext.state that the host (CLI/TUI) can invoke:

// In your host/CLI command handler:
const getFormatted = agent.context.state.get(
  "context:getFormatted",
) as () => string;
const getPlain = agent.context.state.get("context:getPlain") as () => string;
const getSnapshot = agent.context.state.get(
  "context:getSnapshot",
) as () => ContextSnapshot;

// For TUI rendering (ANSI color codes):
console.log(getFormatted());

// For plain text (no ANSI):
console.log(getPlain());

// For programmatic access:
const snapshot = getSnapshot();
console.log(
  `${snapshot.usagePercent}% used, ${snapshot.availableTokens} remaining`,
);

How It Works

Token Estimation

Uses character-based heuristics calibrated for modern tokenizers:

English text: ~4 chars per token
Code: ~3.5 chars per token
JSON/structured: ~3 chars per token
Images: Anthropic tile-based estimation

This is intentionally approximate (±10%). Real token counts come from the provider's usage response and are used to calibrate the estimates.

Category Tracking

| Category | What's counted | | ------------- | ---------------------------------------------------- | | System Prompt | System messages + injected skills | | System Tools | Tool definitions (name + description + JSON Schema) | | Tool Calls | Assistant tool invocations (name + serialized input) | | Tool Results | Tool outputs (text, JSON, rich content) | | Messages | User + assistant text messages | | Images | Image content (resolution-based estimation) | | Files | File attachments (size-based estimation) | | Reasoning | Chain-of-thought / extended thinking |

Threshold Behavior

When context usage exceeds a threshold, the extension injects a concise system message into the next model request:

Warning (75%): "⚠️ Context usage: 76%. Be concise. Avoid large reads/outputs."
Critical (90%): "🚨 Context nearly full: 91% used. Complete the current task as concisely as possible."

These are injected once per threshold crossing (not on every model call).

Context-Awareness Skill

When injectGuidance: true (default), a skill is added to the system prompt teaching the LLM:

Prefer targeted reads (line ranges) over full file reads
Use search before read to find relevant sections
Truncate bash output with head/tail/grep
Avoid redundant re-reads of files already in context
Be more concise when context is running low

Comparison with context-mode

| Feature | context-mode | @dex-ai/context | | ------------------- | ---------------------- | ------------------------------------------------- | | Token tracking | ❌ (defers to host) | ✅ Built-in estimation | | Visual display | ❌ | ✅ Grid + categories | | Hard-blocks tools | ✅ (blocks curl/fetch) | ❌ (guidance-based) | | Sandboxed execution | ✅ (separate process) | ❌ (not needed — Dex tools already handle this) | | FTS5 knowledge base | ✅ (SQLite) | ❌ (out of scope — use @dex-ai/knowledge) | | Session persistence | ✅ (SessionDB) | ❌ (out of scope — use @dex-ai/session-extension) | | Native dependencies | better-sqlite3 | None | | Weight | 3.6 MB | ~15 KB |

Philosophy difference: context-mode hard-blocks certain tool calls to prevent context flooding. @dex-ai/context takes a guidance approach — it teaches the LLM good patterns and warns when budgets are exceeded, but trusts the model to make the right choice. This works better with modern models that can follow instructions.

API

`contextExtension(opts?): Extension`

Factory function — creates the extension.

`estimateTokens(text: string): number`

Estimate token count for a string.

`formatContextUsage(snapshot: ContextSnapshot): string`

Format a snapshot with ANSI colors for terminal display.

`formatContextUsagePlain(snapshot: ContextSnapshot): string`

Format a snapshot as plain text (no colors).

Types

interface ContextSnapshot {
  timestamp: number;
  totalTokens: number;
  maxTokens: number;
  usagePercent: number;
  categories: CategoryUsage[];
  availableTokens: number;
}

interface CategoryUsage {
  category: ContextCategory;
  tokens: number;
  percent: number;
}

type ContextCategory =
  | "system-prompt"
  | "system-tools"
  | "messages"
  | "tool-calls"
  | "tool-results"
  | "images"
  | "files"
  | "reasoning";

Development

bun install
bun run typecheck
bun test

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@dex-ai/context

What It Does

Features

Usage

With Options

/context Command Integration

How It Works

Token Estimation

Category Tracking

Threshold Behavior

Context-Awareness Skill

Comparison with context-mode

API

contextExtension(opts?): Extension

estimateTokens(text: string): number

formatContextUsage(snapshot: ContextSnapshot): string

formatContextUsagePlain(snapshot: ContextSnapshot): string

Types

Development

License

`/context` Command Integration

`contextExtension(opts?): Extension`

`estimateTokens(text: string): number`

`formatContextUsage(snapshot: ContextSnapshot): string`

`formatContextUsagePlain(snapshot: ContextSnapshot): string`