opencode-session-recall

v0.11.0

Published

7 days ago

Everything your agent ever did is already in the database — this plugin lets it look

0High
0Medium
0Low

maelos

opencode opencode-plugin plugin recall history compaction context search memory

opencode-session-recall

Every conversation your agent has ever had — across every session, every project — is already in the database. It's just not looking.

OpenCode stores the full conversation history from every session your agent has ever run — messages, tool calls, tool outputs, reasoning traces. All of it. Not just the current session. Not just the current project. Every project on the machine. Even after compaction shrinks what the model can see, the original content stays in the database — just no longer visible to the agent.

This plugin gives the agent five tools to search and retrieve all of it on demand.

No new database. No embeddings. No summarization. No duplication. No overhead.

Just install the plugin. The agent gains access to its entire history.

The problem is absurd when you think about it

Your agent solves a tricky build error. Twenty minutes later, compaction runs. An hour later, the same error shows up. The agent starts from zero — debugging something it already figured out, while the answer sits in the database it's connected to.

You built rate-limiting middleware in your API project last week. Now you need it in another project. The agent has no idea it ever existed — while the original implementation, the requirements discussion, the edge cases you worked through, all of it is sitting in the same database, in a session from a different project.

You're 200 tool calls and 3 compactions deep. The agent has drifted from your original request. Your exact words are gone from context. But they're not gone — they're in the database. The agent just can't see them.

The data already exists. This plugin removes the blindfold.

What it looks like

"We already fixed this."

recall({ query: "ECONNREFUSED retry", scope: "session" })

Agent finds its own solution from 2 hours ago. Doesn't re-derive it.

"It was in that other project."

recall_sessions({ scope: "global", search: "rate limit" })
recall_get({ sessionID: "...", messageID: "..." })

Finds the implementation from your API project. Reuses it instead of reinventing it.

"What did I originally ask for?"

recall_messages({ limit: 5, role: "user" })

Pulls up exact original requirements after 3 compactions. Checks its own work against what you actually said.

"What was that error?"

recall({ query: "TypeError", type: "tool", scope: "session" })

Gets the full stack trace from a tool output that got pruned. Doesn't re-run the failing command.

"Why did we decide on that approach?"

recall({ query: "chose postgres over", scope: "project", type: "reasoning" })

Recovers the reasoning behind an architectural decision from three sessions ago. Context that no summary captures.

"Find it even with a typo."

recall({ query: "prefiltr", match: "fuzzy", scope: "session" })

Fuzzy search finds prefilter even when the agent misremembers the exact spelling. Results ranked by relevance, not just recency.

"Which sessions touched this topic?"

recall({ query: "rate limiting", scope: "global", match: "smart", group: "session" })

4 sessions across 3 projects, each with hitCount and best representative snippet. One call to discover everywhere a topic came up.

Smart and fuzzy search

Ranked fuzzy retrieval powered by Fuse.js. Three matching strategies:

| Mode | Behavior | Best for | | ------------------- | ----------------------------------- | ----------------------------------------------- | | literal (default) | Case-insensitive substring match | Exact terms, all scopes | | smart | Fuzzy ranked search (threshold 0.3) | Uncertain wording, typos, separator differences | | fuzzy | Looser fuzzy search (threshold 0.5) | Very approximate queries, exploratory search |

recall({ query: "rate limit middleware", match: "smart", scope: "project" })

Smart and fuzzy modes:

Handle typos — prefiltr finds prefilter, ECONNREFUSD finds ECONNREFUSED
Normalize separators — rate-limit matches rateLimit matches rate_limit
Rank by relevance — results scored 0–1 with structural boosts for exact phrases, full token coverage, reasoning traces, and recency
Fall back gracefully — if smart/fuzzy finds nothing, literal search runs automatically
Time-budget degradation — if ranking takes too long, returns prefilter-ranked results instead of timing out
Explain mode — add explain: true to see scoring breakdowns via matchReasons

Available across all scopes — "session", "project", and "global".

Recall is not memory

This is not a memory system. Memory is selective and curated. Recall is raw history retrieval — verbatim, exhaustive, on demand.

If you use a persistent memory system alongside this plugin, recall gives it source material. The agent searches history, follows promising hits with recall_get or recall_context, then stores only durable findings deliberately.

Good memory candidates: user preferences, project decisions, reusable root causes, environment facts, corrections, and approaches that clearly succeeded or failed. Do not store ephemeral session details, one-off commands, transient errors, or implementation minutiae.

Install

opencode plugin opencode-session-recall

Or add it to your opencode.json:

{
  "plugin": ["opencode-session-recall"],
}

To disable cross-project search:

{
  "plugin": [["opencode-session-recall", { "global": false }]],
}

Tools

Five tools, designed around how agents actually navigate conversation history:

`recall` — Search

The primary tool. Full-text search across session titles, messages, tool outputs, tool-input commands and cwd values, reasoning, and subtask descriptions. Searches globally by default, or narrow to the current project or session.

Use before real work when prior history could change the approach: debugging, unexpected behavior, feature work, architecture or configuration changes, past commands, root causes, decisions, or "what did we do last time?" questions.

Do not call it for every request. Skip it for trivial commands, local file/code inspection, simple edits with complete context, and questions that do not benefit from prior conversations. Use code search for the current codebase.

recall({ query: "authentication", scope: "project" })
recall({ query: "error", type: "tool", scope: "session" })
recall({ query: "JWT", sessionID: "ses_from_another_project" })
recall({ query: "rate limit", match: "smart", scope: "session", group: "session" })
recall({ query: "prefiltr", match: "fuzzy", scope: "session", explain: true })
recall({ query: "unauthorized", expand: "context", window: 1 })
recall({ query: "auth failure", expand: "context", window: "auto" })
recall({ query: "migration", last: "7d", directory: "/workspace/project" })
recall({ query: "release notes", from: "30d ago", to: "now" })
recall({ query: "legacy config", before: "2026-01-01" })
recall({ query: "deploy", directory: "/workspace/project", fallback: true })
recall({ query: "npm test", type: "tool", toolName: "bash" })

First call guidance: omit sessions unless you need a hard scan cap; the default scans all eligible sessions subject to configured and provider limits. Use match: "smart" for topic discovery, naming variants, and likely typos. Use group: "session" for broad discovery. Add expand: "context" or expand: "message" when you already know you need evidence from the top hit. Reserve literal matching for exact errors, commands, function names, or file paths.

| Param | ---------------------- | query | scope | match | explain | sessionID | type | role | before/ | last | from/to | since/< | directory | fallback | toolName | expand | expandResults | window | expandBudgetMessages | expandBudgetChars | width | sessions | title | group | results | Default | Description | | ------------ | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | required | Text to search for | | "global" | "session", "project", or "global" | | "literal" | "literal", "smart", or "fuzzy" | | false | Include scoring metadata in results | | — | Target a specific session (overrides scope); blank values are ignored | | "all" | "text", "tool", "reasoning", or "all" | | "all" | "user", "assistant", or "all" | after | — | Time bounds as ms epoch or dates like "2026-01-01"; nonpositive numbers are ignored | | — | Recent-history lower bound like "2h", "7d", or "3w" | | — | Explicit time bounds like "365d ago" and "now" | code>until | — | Compatibility relative filters; prefer last, from/to, or before/after in new calls | | — | Filter sessions by exact directory or descendant path | | false | With directory, fill remaining results from same project/worktree and then global history | | — | Exact tool-name filter; only valid with type: "all" or type: "tool" | | "none" | "none", "context", or "message"; inlines bounded evidence from top results | | 1 | Number of results to expand; oversized values are clamped to the safe maximum | | 3 | Messages on each side for expand: "context", or "auto"; oversized values are clamped to plugin maxWindow | | 30 | Total context messages to inline across expanded results | | 30000 | Total expanded text budget | | 200 | Snippet size (50–1000 chars) | | all eligible | Optional max sessions to scan, capped by plugin maxSessions | | — | Filter by session title substring; blank values are ignored | | "part" | "part" or "session" — when "session", collapses results by session (one entry per session with the best-scoring or most-recent hit as representative, plus hitCount) | | 10 | Max results to return |

Blank optional filters are ignored. Malformed optional time filters are ignored or normalized with warnings when safe. toolName is exact; if unsure, run a broad type: "tool" search first and inspect returned toolName values. Combining time bounds is allowed only when the resulting time window is valid.

Result entries can include these fields. Ranked-mode fields are present for smart/fuzzy matches; evidence fields explain why any result matched.

| Field | Description | | -------------------- | ------------------------------------------------------------------------ | | score | Relevance score (0–1, higher is better) | | matchMode | Which strategy produced this result | | matchedTerms | Query tokens found in the candidate | | matchReasons | Scoring breakdown (only when explain: true) | | hitCount | Number of part-level hits in this session (only when group: "session") | | source | "message", "title", "tool", or "reasoning" | | why | Compact match explanation: matched fields, terms, confidence, recency | | directoryRelevance | "exact", "project", "global", or "unknown" | | titleMatch | Title evidence when a session title matched |

Response metadata:

| Field | Description | | ---------------- | ----------------------------------------------------------------------------- | | loadErrorCount | Number of scanned sessions that failed to load; omitted when zero | | loadErrors | Sample load failures; use this to distinguish load failures from zero matches | | matchMode | "smart", "fuzzy", or "literal" (if fell back) | | degradeKind | "none", "time", "budget", or "fallback" | | group | "part" or "session" — echoes back the grouping applied | | expanded | Bounded context or message entries when expand is not "none" | | warnings | Safe downgrades, clamped options, expansion caps, fallback broadening | | suggestions | Actionable next steps for empty or weak searches | | coverage | Sessions/messages/parts searched, skipped reasons, and limiting factors | | nearMisses | Closest searched sessions for empty results when cheap to report |

Expanded entries inline full message structure, but large text/output/error fields are truncated with a [truncated by recall expansion] marker to keep recall responses bounded. If context expansion exceeds message or text budgets, recall returns the base hits plus as much expansion as fits and reports the cap in warnings; it does not hard-fail a successful base search because expansion was too large.

`recall_get` — Retrieve

Get the full content of a specific message, including all parts. Tool outputs are returned in their original form, even if they were pruned from context. Use after recall finds something interesting.

recall_get({ sessionID: "ses_abc", messageID: "msg_def" })

`recall_context` — Expand

Get a window of messages around a specific message. After recall finds a match, see what was asked before it and what happened after. Supports symmetric and asymmetric windows.

recall_context({ sessionID: "ses_abc", messageID: "msg_def", window: 3 })
recall_context({ sessionID: "ses_abc", messageID: "msg_def", before: 1, after: 5 })

Returns hasMoreBefore/hasMoreAfter so the agent knows if it's at a boundary.

`recall_messages` — Browse

Paginated message browsing. Walk through a session chronologically, read the beginning, check the most recent messages, or filter by role. Also supports content filtering to combine search and pagination.

recall_messages({ limit: 5, role: "user", reverse: true })
recall_messages({ sessionID: "ses_abc", offset: 10, limit: 10 })
recall_messages({ query: "npm", role: "user", reverse: true })

Defaults to the current session. Pagination metadata includes total, hasMore, and offset. Blank sessionID or query values are treated as omitted.

`recall_sessions` — Discover

List sessions by title. Use this for lightweight recent-session browsing or recency checks. For topical discovery, prefer recall; it searches titles and content together and labels title-only hits.

recall_sessions({ scope: "project", search: "auth" })
recall_sessions({ scope: "global", search: "deployment" })

Blank search values are treated as omitted.

Options

| Option | Type | Default | Description | | --------- | --------- | ------- | --------------------------------------------------- | | primary | boolean | true | Register tools as primary (available to all agents) | | global | boolean | true | Allow cross-project search via scope: "global" |

Advanced limits (all have sensible defaults):

| Option | Default | Description | | ---------------- | --------- | --------------------------------------------------------------------------------------- | | concurrency | 3 | Parallel session loads | | maxSessions | unlimited | Hard max sessions per search; caps recall.sessions and directory-filter broad listing | | maxResults | 50 | Max results per search | | maxSessionList | 100 | Max sessions in listing | | maxMessages | 50 | Max messages per browse | | maxWindow | 10 | Max context window size | | defaultWidth | 200 | Default snippet width |

How it works

When OpenCode compacts a session, it doesn't delete anything. Tool outputs get a compacted timestamp and are replaced with placeholder text in the LLM's context — but the original data stays in the database. Messages before a compaction boundary are skipped when building the LLM context — but they're still there.

This plugin reads all of it through the OpenCode SDK:

No direct database queries, no separate storage
Zero setup — no embeddings to generate, no indexes to build, no data to sync
Eligible sessions scanned newest-first with bounded concurrency; maxSessions is the hard safety cap
Respects abort signals for long-running searches
Cross-project search enabled by default (disable with global: false)
Smart and fuzzy ranking works across all scopes — session, project, and global

Smart/fuzzy pipeline

When match is "smart" or "fuzzy", the search goes through a multi-stage ranking pipeline:

Candidate construction — Messages are scanned newest-first. Session titles and each part's searchable text are extracted and tokenized. Per-session and global budgets cap the candidate pool.
Prefiltering — Cheap lexical gate using exact substring, quoted phrase, token overlap, and bounded edit-distance (Levenshtein ≤ 1 for tokens ≥ 4 chars). Only candidates with at least one match survive.
Normalization — Surviving candidates get full stage-2 normalization (camelCase splitting, separator normalization, whitespace collapse) for Fuse.js field matching.
Fuse.js ranking — Weighted search across primary text (0.65), project directory (0.20), session title (0.10), and tool name (0.05). Returns all matches above the mode threshold.
Structural re-ranking — Fuse scores are adjusted with deterministic boosts (exact phrase, full token coverage, reasoning traces, error text, user role, recency) and penalties (weak single-token fuzzy, poor coverage).
Snippet selection — Token-density sliding window finds the most relevant excerpt from the raw text.

The entire pipeline runs within a 2-second post-fetch time budget. If the pre-Fuse stage alone exceeds 1.5 seconds, Fuse.js is skipped and prefilter-ranked results are returned with degradeKind: "time". If the full pipeline completes but exceeds the total budget, Fuse-ranked results are still returned but marked as time-degraded.

Contributing

See CONTRIBUTING.md for architecture details, module guide, and development setup.

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

opencode-session-recall

The problem is absurd when you think about it

What it looks like

Smart and fuzzy search

Recall is not memory

Install

Tools

recall — Search

recall_get — Retrieve

recall_context — Expand

recall_messages — Browse

recall_sessions — Discover

Options

How it works

Smart/fuzzy pipeline

Contributing

License

`recall` — Search

`recall_get` — Retrieve

`recall_context` — Expand

`recall_messages` — Browse

`recall_sessions` — Discover