opencode-diane

v0.0.10

Published

17 days ago

OpenCode plugin: hierarchical, token-efficient memory for any git repository. Convention-free — pre-fills from git diff-structure and project files, no LLM at the core, no commit-message parsing. Optional cross-lingual semantic search; skill mining.

0High
0Medium
0Low

alexeyev

opencode opencode-plugin memory experience-reuse token-saving bm25 language-agnostic agents.md

opencode-diane

A memory layer for OpenCode. It gives the coding agent a persistent, searchable store of structural facts about a repository, so it can navigate code with one bounded memory_recall call instead of many raw git log, grep, and file-read tool calls.

Named for Diane in Twin Peaks — the recipient of Dale Cooper's recorded case notes. The plugin works the same way: it keeps the record of what your coding agent learns about a codebase.

What it actually does — and what it costs

These numbers are from a 40-session controlled eval: 2 SWE-bench Lite instances (psf__requests-1963, pylint-dev__pylint-6506), 5 runs per condition × 2 models (Haiku 4.5, Sonnet 4.6), file-level diff overlap as the metric.

| Model | Condition | hit-rate | $/session | wall-clock | |---|---:|---:|---:|---:| | Sonnet 4.6 | diane | 9/10 (90 %) | $0.182 | 61 s | | Sonnet 4.6 | baseline | 9/10 (90 %) | $0.100 | 104 s | | Haiku 4.5 | diane | 5/10 (50 %) | $0.156 | 80 s | | Haiku 4.5 | baseline | 5/9 (56 %) | $0.082 | 85 s |

The honest read:

Solve-rate. No measurable benefit on this slice. Both conditions hit 9/10 on Sonnet and 5/10 vs 5/9 on Haiku.
Wall-clock. Diane on Sonnet is ~40 % faster — 61 s vs 104 s average. On requests-1963 specifically, 37 s vs 110 s. The recall surfaces the right file faster than the agent finds it by grep.
Cost. Diane is ~1.8× more expensive per session at both models (the ratio holds across Sonnet and Haiku). Cost comes from tool descriptions in the system prompt and from recall results that linger in conversation history.
Trade. Tokens for latency. If your bottleneck is developer wait-time on agent runs, that trade may be worth it. If your bottleneck is API spend on bulk runs, it isn't.

Caveats: n = 2 instances, both small Python libraries. File-level overlap, not test-pass. Not benchmarked against agentmemory or aider. A 30-instance, multi-repo eval is in the roadmap. See EVAL.md for the harness and raw session traces.

What it is, mechanically

A hierarchical, BM25-ranked store of facts about a repository.
Three tools the agent reaches for in practice: memory_recall, memory_outline, memory_remember. (Earlier versions exposed ten; the eval showed seven were never called.)
Ingestion runs once on plugin load: code-map (tree-sitter signatures) and git history (co-change, churn). Other ingesters (docs, project notes, table headers, cross-refs, sessions) are opt-in via config — they were on by default in earlier versions but added startup time without measurable benefit.
Deterministic. BM25 over a hand-built inverted index — no embeddings, no model, no API key, no GPU, no network. Reproducible and debuggable. An opt-in semantic add-on exists for cross-lingual recall.
Convention-free. Never parses commit messages or natural language for meaning. Every signal is a physical fact (files touched, lines ±, co-change frequency). It behaves identically on a wip / . / non-English history and a pristine one.
Languages. Code map covers 13 tree-sitter grammars. The grammar-agnostic cross-reference ingester extends this to 30+ languages (Pascal, Ruby, Perl, Elixir, Verilog, VHDL, COBOL, Fortran, Solidity, Smalltalk, Racket, Common Lisp, Vim script, and more) plus low-code DSLs (GitHub Actions, k8s, Terraform, n8n).
What it costs. ~1.6 MB packed, ~17 MB unpacked (includes tree-sitter grammar .wasm files and SheetJS for spreadsheet headers). A few hundred MB RAM for a large store. Dependencies: @opencode-ai/plugin, web-tree-sitter, xlsx.
What it is not. Not an LLM, not an unbounded archive (a configurable disk budget, 50 MB by default, ages out least-used facts), not a replacement for AGENTS.md (though we read AGENTS.md and index its contents). Not a vector store by default — lexical BM25 — though cross-lingual semantic search is available as an explicit opt-in.
Maturity. 691 assertions across 26 test suites, ~90 % line coverage; verified against the documented plugin contract in 30+ languages and against live builds with oh-my-opencode and caveman as coexisting plugins. Not yet run end-to-end inside a live OpenCode server — see the WIKI.

The full design — how the memory is structured, how retrieval works, what happens without git, scaling numbers, how it compares to other approaches, and every honest limitation — is in WIKI.md. Start there with Straight answers for a decision-maker.

The tools

| Tool | What it does | |---|---| | memory_recall | BM25 search over the store — co-change-boosted, token-budgeted, with optional category / subject filters. The recall-first entry point. Takes an optional by: "name" (default) for exact matches you already know — a function, class, file, or error message — or "meaning" to also blend semantic similarity for plain-English descriptions of what code does. | | memory_code_map | Aider-style structural map: per-file signatures of functions/classes/types, ranked and budgeted. Needs enableCodeMap. | | memory_remember | Store an explicit note for future turns. | | memory_snapshot | Record this session's understanding — mental model, decisions, conventions — for a later or parallel session to resume from. | | memory_outline | Counts per category — token-cheap orientation. | | memory_status | Size, byte usage vs budget, last-ingest timestamps. | | memory_ingest_sessions | Pull task + tool-trace summaries from past OpenCode sessions. | | memory_ingest_git | Re-scan git history for new commits after a pull / merge / rebase. Idempotent — already-known commits are skipped. The plugin also auto-runs this in the background when it detects HEAD moved as a side effect of a bash call. | | memory_mine_skills | Cluster memories by subject into SKILL.md files. Runs in the background. | | memory_skill | List the mined skill files, or load one into the conversation — so a skill mined this session is usable now, no restart. |

Install

npm install opencode-diane

Then in opencode.json:

{
  "$schema": "https://opencode.ai/config.json",
  "plugin": ["opencode-diane"]
}

Open OpenCode in any git repository, in any language. The plugin loads, runs prefill in the background, registers all ten tools, and the agent can use them immediately. If the directory is neither a git repo nor has a recognised manifest, the plugin logs one idle line and does nothing.

The Aider-style code map is on by default since v0.0.4 — it gives memory_code_map and recall enough structural signal (per-file function/class/type signatures, 13 tree-sitter grammars) that turning it off is rarely worth it. The grammar .wasm files (~16 MB, vendored under grammars/) ship with the package regardless, since they're loaded lazily on first use; the option only controls whether the plugin parses files at prefill. If you want to skip that parsing — for a tighter prefill on a huge monorepo, or on a non-source repo where the code map adds no signal — disable it via the [name, options] tuple form:

{
  "plugin": [["opencode-diane", { "enableCodeMap": false }]]
}

Install from a local clone

git clone <repo-url> opencode-diane
cd opencode-diane
bun install      # fetches @opencode-ai/plugin into ./node_modules
bun run build    # compiles src/ -> dist/

Then point opencode.json at the built file:

{
  "plugin": ["file:///absolute/path/to/opencode-diane/dist/index.js"]
}

bun install is required for the local form — OpenCode resolves plugin imports through the module resolver, so node_modules/@opencode-ai/plugin must sit next to dist/.

Configuration

Every setting is optional with a sensible default. To override, list the plugin as a [name, options] tuple — OpenCode passes the options object straight through, and bad or unknown keys are ignored so a malformed config never breaks the plugin.

interface UserConfig {
  maxMemoryDiskMB?: number       // default 50
  autoIngestOnStartup?: boolean  // default true
  gitHistoryDepth?: number       // default 500
  forceActive?: boolean          // default false
  skillsOutputDir?: string       // default ".opencode/skills"
  skillMiningMinCluster?: number // default 3
  ingestSessions?: boolean       // default true
  enableCodeMap?: boolean        // default true  (see WIKI: Code map)
  installUsageSkill?: boolean    // default true  — write a using-memory skill on first startup
  ingestDocs?: boolean           // default true  — index docs/ headings as section pointers
  ingestProjectNotes?: boolean   // default true  — index AGENTS.md, CLAUDE.md, .cursorrules, …
  ingestTableHeaders?: boolean   // default true  — index CSV/TSV/XLSX column headers
  ingestCrossRefs?: boolean      // default true  — grammar-agnostic cross-file edge discovery
  crossRefsRarityThreshold?: number // default 3 — max files a symbol can appear in to count
  enableNudgeHook?: boolean      // default true  (see WIKI: Compatibility)
  adaptive?: boolean             // default true  (see WIKI: Adaptive sizing)
  enableSemanticSearch?: boolean // default false (see WIKI: Semantic search)
  embeddingModel?: string        // default "Xenova/multilingual-e5-small"
  personalizedPageRank?: boolean // default false (co-change ranking; see WIKI)
  enableSessionProvenance?: boolean    // default true  — record the recall→edit causal link
  enableSessionWorkingSet?: boolean    // default true  — adapt recall to the active task within a session
  semanticRerank?: boolean             // default false — EXPERIMENTAL: rerank meaning-mode top results by per-function embedding (R@1). Not yet live-validated.
  semanticRerankWindow?: number        // default 3     — window size for the meaning-mode rerank
  recordSessionActivity?: boolean      // default true  — record this session's edits + bash as a rolling memory
  bashFileTrackingMaxFiles?: number    // default 20    — refresh code-map for files a bash call touched (0 = off)
  autoReingestGitOnHeadChange?: boolean // default true — re-ingest git when bash moves HEAD (pull/merge/rebase)
}

With adaptive on (the default), prefill measures one cheap signal — commit count, or file count when there is no git — and scales the history depth and code-map file cap to the repo's size. An explicit value in your config always wins.

maxMemoryDiskMB is the disk budget for the memory store: once it is exceeded, least-used memories are evicted (pinned ones never). The default is 50 MB — generous enough that a realistic store (even on a large repo, ~4–6 MB) never hits it, so eviction acts as a safety valve rather than a routine clip. Note it also bounds RAM: the search index is held in memory at roughly 70 MB of heap per 1 MB of budget if filled, so on an unusually large monorepo, lower it to cap memory or raise it for a deeper store. See Performance in the WIKI.

enableSemanticSearch (default off) adds opt-in cross-lingual recall: with it on, the plugin also embeds memories with a small multilingual e5 model and fuses vector similarity with the BM25 ranking, so a query in one language (Russian, Chinese, …) can retrieve code and comments written in another. It needs the optional @huggingface/transformers dependency (bun add @huggingface/transformers); the model downloads on first use. When off — the default — no model is downloaded, the dependency is never loaded, and retrieval is the unchanged lexical path. See Semantic search in the WIKI.

enableSessionWorkingSet (default on) makes recall adapt to the task in flight: when a file you recalled then gets edited, it joins a per-session working set, and subsequent recalls are nudged toward those files and their co-change neighbours — so locating one file of a multi-file change helps surface the rest. The boost decays over the session and clears on a sustained topic shift, falling back to the plain lexical+history ranking when the set is empty. It is observable in the event stream (wsBoost); set it to false to pin recall to the deterministic baseline.

Fine-grained tuning

Most users never set these — the defaults cover typical repos. They exist for monorepos, documentation-heavy projects, and locked-down environments where every walk needs an explicit ceiling. All numeric limits are clamped to a safe minimum and rounded; garbage input in opencode.json never breaks the plugin.

| Option | Default | What it does | |---|---|---| | docsMaxFiles | 200 | Cap on .md / .markdown files walked under docs/ plus conventional root docs (CHANGELOG, CONTRIBUTING, ARCHITECTURE, ROADMAP, …). | | docsBodyChars | 240 | Characters of body text captured after each heading as the recall snippet. | | docsMaxHeadingLevel | 3 | Deepest heading level indexed (3 = H1–H3). Clamped to [1, 6]. | | notesMaxBytes | 6144 | Maximum bytes read from each agent-instruction file (AGENTS.md, CLAUDE.md, .cursorrules, …). | | tablesMaxFiles | 200 | Cap on table files (CSV / TSV / XLSX / XLS) walked per prefill pass. | | tablesMaxXlsxMB | 50 | Skip XLSX/XLS files larger than this (in MB). Set 0 to skip all spreadsheets. | | tablesMaxColumns | 40 | Maximum column headers listed per table/sheet. Wider tables get a (N more) note. | | crossRefsMaxFiles | 2000 | Cap on files the cross-reference ingester walks per prefill. Raise for monorepos. | | crossRefsMaxEdges | 10000 | Hard cap on cross-reference edges emitted per pass. | | coChangeMinOccurrences | 3 | Minimum commits in which two files must co-change before a co-change edge is recorded. | | codeMapMaxFiles | adaptive (1500 / 4000 / 10000) | Cap on source files the code-map ingester parses per pass. With adaptive: true (the default), this is sized at startup by the small / medium / large tier. Setting it explicitly overrides the adaptive choice. | | coChangeMaxCommits | 5000 | Cap on git commits the co-change graph builder scans. Adaptive sizing keeps this uniform across tiers in the current implementation; only codeMapMaxFiles and gitHistoryDepth vary by tier. |

Learn more

WIKI.md covers everything else, including:

Straight answers for a decision-maker — the questions above, in depth
How the memory is structured — the record shape and the hierarchy, with diagrams
The pillars — retrieval, prefill, code-health, and the rest, with diagrams
How it compares — versus embeddings, aider's repo-map, and AGENTS.md
Without git history — what works, what doesn't, and why
Semantic search — the opt-in cross-lingual embedding feature
Token savings — what reduction to expect, and how it is measured
Performance & Scaling — measured numbers, and the honest heap caveat
Prompt-cache friendliness — what's byte-stable across calls, what's deliberately not
Code map, Session snapshots, Skill mining, Rich logs, Tests & CI

License

MIT.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

opencode-diane

What it actually does — and what it costs

What it is, mechanically

The tools

Install

Install from a local clone

Configuration

Fine-grained tuning

Learn more

License