opencode-rate-limit-fallback

v0.3.1

Published

11 days ago

OpenCode plugin that automatically switches to a fallback model when rate limits are hit

0High
0Medium
0Low

liamvinberg

opencode rate-limit fallback model plugin

opencode-rate-limit-fallback

OpenCode plugin that automatically switches to a fallback model when rate limits are hit.

Installation

Add to your opencode.jsonc:

{
  "plugin": ["opencode-rate-limit-fallback"]
}

Configuration

Create rate-limit-fallback.json in your OpenCode config directory:

Locations checked (in order):

~/.config/opencode/rate-limit-fallback.json
~/.config/opencode/config/rate-limit-fallback.json
~/.config/opencode/plugins/rate-limit-fallback.json
~/.config/opencode/plugin/rate-limit-fallback.json

Example config:

{
  "enabled": true,
  "fallbackModel": "anthropic/claude-opus-4-5",
  "cooldownMs": 300000,
  "patterns": [
    "rate limit",
    "usage limit",
    "too many requests",
    "quota exceeded",
    "overloaded"
  ],
  "logging": true
}

Options

| Option | Type | Default | Description | |--------|------|---------|-------------| | enabled | boolean | true | Enable/disable the plugin | | fallbackModel | string | object | "anthropic/claude-opus-4-5" | Fallback model (see formats below) | | cooldownMs | number | 300000 | Cooldown period in ms (default: 5 minutes) | | patterns | string[] | (see below) | Custom rate limit detection patterns | | logging | boolean | false | Enable file-based logging |

Fallback Model Formats

String format (recommended):

{
  "fallbackModel": "anthropic/claude-opus-4-5"
}

Custom Patterns

Add your own rate limit detection patterns:

{
  "patterns": [
    "rate limit",
    "usage limit",
    "too many requests",
    "quota exceeded",
    "overloaded",
    "capacity exceeded"
  ]
}

Patterns are case-insensitive and matched against the retry message.

Logging

When logging: true, logs are written to:

~/.local/share/opencode/logs/rate-limit-fallback.log

Log entries include timestamps and details about rate limit detection, fallback attempts, and errors.

How It Works

Detection: Listens for session.status events with retry messages matching configured patterns.
Fallback: When detected:
- Aborts the current retry loop
- Retrieves the last user message from the session
- Reverts the session to before that message (removing the failed attempt)
- Re-sends the original message with the fallback model
- Starts a cooldown timer
Cooldown: During the cooldown period, subsequent rate limits on the same session are ignored (prevents spam). After cooldown expires, normal model selection resumes.

This approach keeps the conversation history clean - no "continue" messages or duplicates. The session seamlessly continues with the fallback model as if the rate limit never happened.

Local Development

For local development, use a file:// URL in your config:

{
  "plugin": [
    "file:///path/to/opencode-rate-limit-fallback/index.ts"
  ]
}

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

opencode-rate-limit-fallback

Installation

Configuration

Options

Fallback Model Formats

Custom Patterns

Logging

How It Works

Local Development

License