opencode-rate-limit

v1.4.0

Published

a month ago

OpenCode plugin that automatically switches to fallback models when rate limited

Downloads

413

0High
0Medium
0Low

zaplakhov

opencode plugin rate-limit fallback ai

opencode-rate-limit

🛑 No more "429 Rate Limit Exceeded" interruptions. Keep your coding session in the flow.

OpenCode plugin that keeps your dev flow unblocked by automatically switching between multiple AI models when you hit rate limits, quota exceedances, or high concurrency errors.

If you "vibe code" with several providers (Gemini Pro, Claude Sonnet, OpenAI, etc.), this plugin turns them into a single resilient virtual model pool. When your primary model drops, OpenCode transparently falls back to the next configured model without breaking your session or losing context.

🛡️ Trust & Security

We know your code and API keys are sensitive.

100% Local: All configurations live on your machine.
Zero Telemetry: The plugin does not track you, send analytics, or transmit your code anywhere. It only intercepts standard HTTP status codes (like 429) to trigger the fallback.
Strictly Validated: Built with a strict ConfigValidator and backed by Vitest unit tests to ensure it never crashes your main OpenCode process.

✨ Features

🔄 Automatic Fallback — Instantly detects API limits and switches to a backup model.
📋 Priority-Based Pool — Define exactly which models to use and in what order.
🤖 Context Preservation — Maintains your exact agent context/mode during the switch.
⏱️ Smart Cooldown — Remembers exhausted models and skips them for a configurable duration.
⚡ Circuit Breaker — Automatically disconnects consistently failing models to prevent wasted API calls.
📈 Local Metrics — Optional detailed stats on rate limits, fallbacks, and model health.
🔃 Hot Reload — Tweak your configuration file on the fly without restarting OpenCode.
🗄️ OpenCode DB Integration — Reads historical metrics from OpenCode's SQLite database with heuristic-based fallback estimation.
📅 30-Day Statistics Window — Default time horizon for historical data analysis.

⚡ Quick Start

Get protected from API limits in under a minute.

1. Enable the plugin

Add "opencode-rate-limit" to your opencode.json configuration file. OpenCode handles the automatic installation.

{
  "plugins": ["opencode-rate-limit"]
}

2. Configure your backup pool

Create a config file at ~/.opencode/rate-limit-fallback.json to define your models.

Mac/Linux One-Liner (Copy & Paste):

echo '{
  "fallbackModels": [
    { "providerID": "anthropic", "modelID": "claude-sonnet-4-20250514" },
    { "providerID": "google", "modelID": "gemini-2.5-pro" },
    { "providerID": "google", "modelID": "gemini-2.5-flash" }
  ]
}' > ~/.opencode/rate-limit-fallback.json

Windows / Manual: Create rate-limit-fallback.json in your home .opencode folder and paste the JSON content manually.

🎉 Done! Next time Claude or Gemini hits a limit, the plugin will seamlessly switch to the next available model.

⚙️ Advanced Configuration (Optional)

You can customize almost everything: retry strategies, jitter, circuit breakers, and health trackers. Place this in ~/.opencode/rate-limit-fallback.json or your project root:

{
  "enabled": true,
  "cooldownMs": 60000,
  "fallbackMode": "cycle",
  "fallbackModels": [
    { "providerID": "anthropic", "modelID": "claude-sonnet-4-20250514" },
    { "providerID": "google", "modelID": "gemini-2.5-pro" }
  ],
  "retryPolicy": {
    "maxRetries": 3,
    "strategy": "exponential",
    "baseDelayMs": 1000,
    "jitterEnabled": true
  },
  "circuitBreaker": {
    "enabled": true,
    "failureThreshold": 5,
    "recoveryTimeoutMs": 60000
  },
  "dynamicPrioritization": {
    "enabled": true,
    "updateInterval": 10
  }
}

Tip: Set "fallbackMode": "cycle" to loop through models, or "stop" to halt when all models are exhausted.

📦 Manual Installation (For Devs)

If you prefer to install the package manually via npm:

npm install opencode-rate-limit

📊 Monitoring & Diagnostics

Data Sources

The plugin combines two types of metrics for comprehensive model health monitoring:

Historical Data (OpenCode SQLite DB):

Reads directly from OpenCode's internal database: ~/.opencode/data/opencode.db
Uses heuristic-based estimation for fallback counts based on retry patterns
Includes all configured models even if no fallback events occurred
Default time window: 30 days (configurable)

Real-Time Data (MetricsManager):

Tracks current session statistics
Monitors active fallbacks, retries, and cooldowns
Provides immediate feedback on model health

Real-Time Status

Use the /rate-limit-status command in OpenCode to get a Markdown report of your current model health, fallbacks, and retry stats. The report includes:

Historical metrics from OpenCode DB (30-day window)
Real-time session statistics from MetricsManager
Model ranking with fallback counts (estimated via heuristics)
Current cooldown status per model

The command is automatically installed when the plugin loads.

Configuration

You can customize the data source behavior in your rate-limit-fallback.json:

{
  "statistics": {
    "enabled": true,
    "windowDays": 30,
    "dbPath": "~/.opencode/data/opencode.db"
  }
}

windowDays: Time horizon for historical data (default: 30)
dbPath: Path to OpenCode's SQLite database (default: auto-detected)

Auto-Detection Logic: If dbPath is not specified, the plugin automatically tries to locate the OpenCode database:

Primary: ~/.opencode/data/opencode.db (current OpenCode location)
Fallback: ~/.local/share/opencode/opencode.db (legacy location)

This ensures compatibility with both current and older OpenCode installations.

Troubleshooting

If the plugin isn't falling back:

Ensure your config file is valid JSON.
Check that fallbackModels contains valid Provider and Model IDs.
Add "verbose": true to your config file to see detailed circuit breaker logs in the OpenCode console.
Verify OpenCode's database exists at the configured path if historical statistics don't appear.

🤝 Support & Feedback

Found a bug or have a feature request? Let's make this plugin better together.

Open an Issue
Submit a Pull Request

📄 License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

opencode-rate-limit

🛡️ Trust & Security

✨ Features

⚡ Quick Start

1. Enable the plugin

2. Configure your backup pool

⚙️ Advanced Configuration (Optional)

📦 Manual Installation (For Devs)

📊 Monitoring & Diagnostics

Data Sources

Real-Time Status

Configuration

Troubleshooting

🤝 Support & Feedback

📄 License