opencode-copilot-usage-detector

v0.1.7

Published

2 days ago

OpenCode plugin that tracks GitHub Copilot token usage, learns rate limits empirically, and proactively informs the user

0High
0Medium
0Low

moodl

opencode opencode-plugin plugin copilot budget rate-limit token-tracking

opencode-copilot-usage-detector

Experimental — This plugin is in early development. Features may change, data formats may evolve, and there will be rough edges. Use at your own risk and please report issues.

An OpenCode plugin that tracks GitHub Copilot token usage across sessions, empirically learns rate limits, and proactively informs you before you hit them.

What This Plugin Does

This plugin has two capabilities:

Token tracking — Tracks token and request usage across sessions, per model, per day. Always works from minute one via /budget status.
Rate limit prediction — Learns your daily token limits empirically by observing when you get rate-limited, then warns you as you approach those limits on subsequent days. This requires hitting a rate limit at least once before predictions begin — the system learns from real data, not assumptions.

Note: If you never hit rate limits (e.g., you have a high-tier plan or light usage), the prediction system won't have data to learn from and won't produce warnings. Cross-session token tracking via /budget status may still be useful, but rate limit prediction is the primary purpose of this plugin.

The Problem

GitHub Copilot doesn't publish concrete token/request limits for the coding assistant. There are multiple opaque limit tiers:

Short-term burst limits per time window
Preview model limits that are separate and stricter
Daily/monthly token budgets that vary by plan

This plugin learns these limits from your own usage patterns and warns you as you approach them.

Features

Token & request tracking — Per-day, per-model usage with RPM monitoring
Adaptive limit learning — Weighted averages with exponential recency decay and confidence scoring
Multi-dimensional hypothesis tracking — Learns whether limits are token-based, request-based, or RPM-based
Blocked model detection — Identifies models not available on your plan (403, access denied, etc.) and separates them from real rate limits
Preview model detection — Automatically identifies models with separate, stricter limits
Rate-limit classification — 5-stage classifier distinguishing burst, preview, and daily limits
Toast notifications — Non-intrusive TUI toasts for rate limits, blocked models, and budget thresholds (doesn't pollute conversation)
System prompt injection — Budget status in every LLM context (zero tool-call overhead, skipped for subagent sessions)
Threshold notifications — Configurable alerts at 60%, 80%, 95% of estimated limits
Config validation — Validates config types on startup, warns on unknown keys
Temporal patterns — Learns what time of day you typically hit limits and how model choice affects runway
Model fallback detection — Detects when Copilot silently downgrades your model
Full error catalog — Logs all API errors for pattern analysis

Requirements

OpenCode v1.2.0 or later
Node.js 18+
GitHub Copilot subscription

Installation

1. Install the package

cd ~/.config/opencode
npm install opencode-copilot-usage-detector

2. Register the plugin

Add it to your ~/.config/opencode/opencode.json:

{
  "plugin": ["opencode-copilot-usage-detector"]
}

If you already have other plugins, add it to the existing array:

{
  "plugin": ["@tarquinen/opencode-dcp@latest", "opencode-copilot-usage-detector"]
}

3. Restart OpenCode

The plugin loads automatically on startup. Use /budget to verify it's working.

Updating

cd ~/.config/opencode
npm install opencode-copilot-usage-detector@latest

Then restart OpenCode.

Usage

The plugin works automatically -- no action needed. It:

Tracks every LLM request via event hooks
Injects budget status into the system prompt
Notifies you in chat when approaching estimated limits
Logs everything to ~/.config/copilot-budget/observations.jsonl

`/budget` Command

| Command | Description | |---------|-------------| | /budget or /budget status | Current usage, estimates, and model breakdown | | /budget history | Daily token usage for the last 14 days | | /budget insights | Learned patterns, limit estimates, temporal analysis | | /budget errors | Rate limit events and error catalog | | /budget recompute | Force recompute all estimates from observations | | /budget reset | Wipe today's data and start fresh | | /budget clean [target] | Remove specific entries from the observation log |

Clean targets

| Target | Description | |--------|-------------| | errors | Remove all logged errors | | blocked | Remove all blocked model entries | | limit_hits | Remove all rate limit entries | | fake_hits | Remove limit_hits from models with no usage (misrecorded blocked models) | | model <name> | Remove all entries for a specific model | | before <date> | Remove entries before a date (YYYY-MM-DD) |

Example: `/budget status`

Copilot Budget — 2026-03-21

Tokens today: 1.8M (67 requests)
RPM: 3 req/min (peak: 7)
Estimated limit: ~2.9M (82% confidence)
Usage: ~63%

Models:
  claude-opus-4.5  1.4M   42 req
  gpt-5.4-mini     422K   25 req

Example: System Prompt Injection

Every LLM response automatically sees this context (no tool call needed):

<copilot-budget>
Daily token usage: 1.8M tokens (67 requests)
Estimated daily limit: ~2.9M tokens (confidence: 82%)
Usage percentage: ~63%
Current rate: 3 req/min (peak: 7)

Model breakdown:
  claude-opus-4.5: 1.4M tokens / 42 requests (stable)
  gpt-5.4-mini: 422K tokens / 25 requests (stable)
</copilot-budget>

Toast Notifications

Alerts appear as non-intrusive TUI toasts that don't pollute the conversation:

Budget warning — 80% of daily budget used (2.3M / ~2.9M est.)
Rate limited — 2.8M tokens, 142 req | claude-opus-4.5 | hard_daily_limit
Model blocked — claude-opus-4.6 is not available on your plan (status: 403)

Example: `/budget insights`

After accumulating data over several days:

Copilot Budget Insights

Data since: 2026-03-01
Days observed: 21
Days with limit hit: 8

Global Daily Budget
  Token estimate: ~2.9M (+/- 210K)
  Confidence: 82% (8 data points)
  Active limit type: tokens

Model Categories
  claude-opus-4.5  stable   auto  95%  5 errors
  claude-opus-4.6  preview  auto  88%  4 errors  limit ~400K
  gpt-5.4-mini     stable   auto  90%  1 errors

Temporal Patterns
  Typical limit time: 16:30
  Std dev: +/- 75 min
  Reset type: daily_fixed
  Estimated reset: 00:00

Insights
  [model_impact] claude-opus-4.5-heavy days hit limits ~2.1h earlier than mixed days (75%, 8 data points)
  [preview_detection] claude-opus-4.6 has separate preview limit (~400K tokens) (88%, 4 data points)

Configuration

Optionally create ~/.config/copilot-budget/config.json:

{
  "debug": false,
  "known_preview_models": [],
  "known_stable_models": [],
  "notification_thresholds": [60, 80, 95],
  "premium_request_multipliers": {
    "claude-opus-4.5": 3.0,
    "claude-sonnet-4.5": 1.0,
    "gpt-5.4-mini": 0.33
  },
  "timezone": "Europe/Berlin",
  "quiet_mode": false
}

All fields are optional -- sensible defaults are used.

| Field | Description | Default | |-------|-------------|---------| | debug | Log all events to debug-events.jsonl | false | | known_preview_models | Models to always treat as preview | [] | | known_stable_models | Models to always treat as stable | [] | | notification_thresholds | Percentage thresholds for chat warnings | [60, 80, 95] | | premium_request_multipliers | Model cost multipliers for weighted tracking | {} | | timezone | Timezone for day boundaries (e.g., Europe/Berlin, America/New_York) | "UTC" | | quiet_mode | Suppress threshold notifications | false |

Data Storage

All data is stored locally in ~/.config/copilot-budget/:

| File | Description | |------|-------------| | observations.jsonl | Append-only event log (source of truth) | | estimates.json | Derived limit model (can be deleted and regenerated) | | config.json | User configuration | | debug-events.jsonl | Debug event log (only when debug: true) |

The JSONL file auto-rotates at 50MB or when entries are older than 90 days. No data is sent anywhere -- everything stays on your machine.

How It Learns

Aggregation -- Every LLM response's token counts are recorded per model per day
Error detection -- API errors (especially HTTP 429) are captured with full context including response headers
Classification -- A 5-stage classifier determines if an error is a burst limit, preview limit, or daily limit
Estimation -- Weighted averages with 14-day half-life produce limit estimates with confidence scores
Insight generation -- After accumulating data, the system identifies patterns (e.g., "opus-heavy days hit limits 2h earlier")

Development

git clone https://github.com/moodl/opencode-copilot-usage-detector.git
cd opencode-copilot-usage-detector
npm install
npm run build
npm test

After code changes:

npm run build
cd ~/.config/opencode
npm install /path/to/opencode-copilot-usage-detector
# Restart OpenCode

Disclaimer

This project is not affiliated with, endorsed by, or associated with GitHub, Microsoft, or OpenCode in any way. It is an independent, community-built tool.

This plugin observes your local usage patterns and API error responses. It does not access any external APIs — all data is derived from local observation of API responses.
Rate limit estimates are empirical approximations, not official figures. GitHub may change limits at any time without notice.
The authors assume no responsibility for any consequences of using this plugin, including but not limited to: account restrictions, incorrect estimates, missed rate limits, or any impact on your GitHub Copilot service.
All data collected by this plugin is stored locally on your machine and is never transmitted to any external service.

License

MIT -- see LICENSE for full text.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND. See the license for the complete terms.

Contributing

Contributions are welcome! Please open an issue first to discuss what you'd like to change.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

opencode-copilot-usage-detector

What This Plugin Does

The Problem

Features

Requirements

Installation

1. Install the package

2. Register the plugin

3. Restart OpenCode

Updating

Usage

/budget Command

Clean targets

Example: /budget status

Example: System Prompt Injection

Toast Notifications

Example: /budget insights

Configuration

Data Storage

How It Learns

Development

Disclaimer

License

Contributing

`/budget` Command

Example: `/budget status`

Example: `/budget insights`