opencode-fractal-memory

v0.6.25

Published

4 days ago

Fractal memory system for OpenCode with semantic search and automatic compression.

0High
0Medium
0Low

holgerreppert

opencode opencode-plugin memory embeddings semantic-search hnsw vector-search bm25 agent-memory

opencode-fractal-memory

Fractal memory system for OpenCode with semantic search, automatic compression, and multi-level retrieval.

about me and the usage

I made this because I needed a longterm memory at first. Then while working with it I extended it's functionality. It might be a little bit overwhelming but if you work with it you will start to love it. You can tell the coding agent to make a memory of everything. And later on you can tell it to read it. You can also use the management app that includes a nice threejs visualization and searching from there in the memory nodes. You can also inject nodes directly to the agent from there. You can edit the nodes too. I think I forgot to mention some of features here. I'll update this project constantly. Feel free to use it and tell me how much you hate or like it ;)

Have phun

Holger

PS.: Did I mention that this is alpha? So feel free to post issues with suggestions if you find bugs or if you just want to suggest improvements

Features

Memory nodes — structured persistent memory with labels, content, metadata, and type system
Semantic search — ONNX-powered embeddings (all-MiniLM-L6-v2) with HNSW vector index for fast ANN retrieval
BM25 full-text search — keyword search as a fallback, hybrid-scored with embeddings for best results
Fractal retrieval — drill-down from high-level summaries to granular details
Automatic compression — periodically summarizes low-level nodes into progressively higher-level abstractions (4 levels)
Auto-retrieve — context-aware injection of relevant memories, skills, and playbooks into the prompt
Ollama reranking — re-ranks memory search results with a local LLM for better relevance
LLM compression — uses LLM to generate richer summaries instead of regex extraction
Auto-distill — automatically extracts actionable rules from lesson nodes
Predictive rating — adjusts memory usefulness scores over time based on usage patterns
Cache system — in-memory LRU cache for frequently accessed nodes with configurable TTL
Journal — append-only searchable journal entries with semantic search
Playbooks — reusable workflow templates (sticky memory nodes) proposed by the agent
Management server — local web UI (port 8787) for browsing, searching, and editing memory
Sub-agents — memory-hints and memory-researcher agents for guided memory interaction

Prerequisites

| Requirement | Version | Notes | |---|---|---| | OpenCode | v1.15.13+ | SDK peer dependency | | Bun | >=1.0.0 | Plugin runtime | | Node.js | >=18 | For npm-based installs only |

Installation

For OpenCode users

Add the plugin name to ~/.config/opencode/opencode.json:

{
  "plugin": ["opencode-fractal-memory"]
}

OpenCode installs it automatically at startup from npm. Model files (~24 MB) download on first plugin load via ensureModels() — no manual steps needed.

Updating

OpenCode caches plugins at ~/.cache/opencode/packages/. When a new version is published to npm, the cache may stay pinned to the old version due to bun's dual caching (lockfile + global metadata cache):

# Clear bun's global metadata cache and force re-resolve
rm -rf ~/.bun/install/cache/
cd ~/.cache/opencode/packages/opencode-fractal-memory@latest/node_modules/opencode-fractal-memory/
bun add opencode-fractal-memory@latest

If that doesn't work, copy the published files manually:

cd <your-local-clone>
npm run build
cp -r dist management package.json LICENSE README.md commands agent \
  ~/.cache/opencode/packages/opencode-fractal-memory@latest/node_modules/opencode-fractal-memory/

This is a known OpenCode issue: #6774, #10546, #25293.

For development / manual install

npm install opencode-fractal-memory

Models download on first run. Use --ignore-scripts if installing via Bun (Bun skips lifecycle scripts).

MCP server setup

Enables memory tools in IDEs that support the Model Context Protocol (Cursor, Windsurf, etc.):

{
  "mcp": {
    "fractal-memory": {
      "type": "local",
      "command": ["bun", "run", "~/.config/opencode/node_modules/opencode-fractal-memory/dist/mcp-server.js"],
      "enabled": true
    }
  }
}

Configuration

Create ~/.config/opencode/opencode-mem.json to customize (optional — all defaults work out of the box). This is the single config file — all settings including journal and management live here:

{
  "autoRetrieve": {
    "enabled": true,
    "candidateCount": 30,
    "maxInjectNodes": 5,
    "maxInjectPlaybooks": 3,
    "minQueryLength": 10,
    "injectionCooldownMs": 30000
  },
  "ollama": {
    "enabled": false,
    "baseUrl": "http://localhost:11434",
    "model": "qwen2.5-coder:1.5b",
    "mode": "binary"
  },
  "llmCompression": {
    "enabled": false,
    "maxSummaryTokens": 500
  },
  "autoDistill": {
    "enabled": false,
    "minLessons": 3,
    "useLlm": false
  },
  "autoConsolidate": {
    "enabled": false,
    "similarityThreshold": 0.3,
    "maxFactsPerCluster": 5,
    "minClusterSize": 2
  },
  "predictiveRating": {
    "enabled": false,
    "decayDays": 30,
    "confidenceThreshold": 0.3,
    "positiveBoost": 0.1,
    "negativePenalty": 0.05
  },
  "management": {
    "enabled": true,
    "port": 8787
  },
  "journal": {
    "enabled": false
  },
  "maxInjectionTokens": 8000,
  "coreInjectionTokens": 2000,
  "cacheSize": 8,
  "cacheTTLHours": 2,
  "autoCompressThreshold": 0.7,
  "highContextThreshold": 0.6,
  "criticalContextThreshold": 0.8,
  "defaultTtlDays": 0,
  "enableMiddleTermCapture": true
}

Config reference

| Field | Type | Default | Description | |---|---|---|---|---| | autoRetrieve.enabled | bool | false | Enable automatic memory injection into prompts | | autoRetrieve.candidateCount | int | 30 | Number of candidates to fetch for injection | | autoRetrieve.maxInjectNodes | int | 5 | Max memory nodes to inject per turn | | autoRetrieve.maxInjectPlaybooks | int | 3 | Max matching playbooks to list | | autoRetrieve.minQueryLength | int | 10 | Min user message length to trigger injection | | autoRetrieve.injectionCooldownMs | int | 30000 | Min ms between injections (rate limit) | | ollama.enabled | bool | false | Use local LLM for reranking search results | | ollama.baseUrl | string | http://localhost:11434 | Ollama server URL | | ollama.model | string | qwen2.5-coder:1.5b | Model for reranking | | ollama.mode | enum | "binary" | "binary" (relevant/not) or "score" (0-1 rating) | | llmCompression.enabled | bool | false | Use LLM for richer compression summaries | | llmCompression.model | string | none | LLM model name (uses ollama if not set) | | llmCompression.maxSummaryTokens | int | 500 | Max tokens per LLM-generated summary | | autoDistill.enabled | bool | false | Auto-extract rules from lesson nodes | | autoDistill.minLessons | int | 3 | Min lessons before extraction | | autoDistill.useLlm | bool | false | Use LLM for more specific rules | | autoDiscover.enabled | bool | false | Auto-detect playbook patterns from tool call sequences | | autoDiscover.minSequenceLength | int | 3 | Min steps for a detected pattern | | autoDiscover.minRepeatCount | int | 2 | Min repeats to qualify as a pattern | | autoDiscover.maxInjectPlaybooks | int | 3 | Max proposed playbooks per detection | | autoConsolidate.enabled | bool | false | Extract semantic facts from episodic session clusters on idle | | autoConsolidate.similarityThreshold | float | 0.3 | Cosine similarity threshold for clustering episodic nodes | | autoConsolidate.maxFactsPerCluster | int | 5 | Max facts to extract per cluster | | autoConsolidate.minClusterSize | int | 2 | Min episodic nodes needed to form a cluster | | predictiveRating.enabled | bool | false | Auto-decay and boost node usefulness | | predictiveRating.decayDays | int | 30 | Days until usefulness decay (exponential half-life) | | predictiveRating.confidenceThreshold | float | 0.3 | Min confidence to count as relevant | | predictiveRating.positiveBoost | float | 0.1 | Usefulness boost on positive rate | | predictiveRating.negativePenalty | float | 0.05 | Usefulness penalty on negative rate | | maxInjectionTokens | int | 8000 | Max tokens allowed in a single injection | | coreInjectionTokens | int | 2000 | Tokens reserved for core rules in injection | | cacheSize | int | 8 | Max cached nodes in LRU cache | | cacheTTLHours | int | 2 | Cache entry TTL in hours | | autoCompressThreshold | float | 0.7 | Context usage ratio triggering auto-compression | | highContextThreshold | float | 0.6 | Token usage ratio for high context warning | | criticalContextThreshold | float | 0.8 | Token usage ratio for critical warning | | defaultTtlDays | int | 0 | Default TTL for new nodes (0 = no expiry) | | enableMiddleTermCapture | bool | true | Save middle-term snapshots before compression | | management.enabled | bool | false | Auto-start the management web UI on plugin init | | management.port | int | 8787 | Port for the management server | | journal.enabled | bool | false | Enable append-only searchable journal entries | | autoFileSummarization.enabled | bool | false | Auto-summarize files on read |

Advanced Features

Ollama Reranking

When enabled in config, auto-retrieve results are re-ranked by a local LLM (via Ollama) for better relevance. The reranker scores candidates against the user's query and only keeps the most relevant ones:

{
  "ollama": {
    "enabled": true,
    "baseUrl": "http://localhost:11434",
    "model": "qwen2.5-coder:1.5b",
    "mode": "binary"
  }
}

In "binary" mode, the LLM labels each candidate as relevant or not. In "score" mode, it assigns a 0-1 relevance score.

LLM Compression

Instead of regex-based compression (which extracts keywords), LLM compression generates richer natural-language summaries:

{
  "llmCompression": {
    "enabled": true,
    "model": "qwen2.5-coder:1.5b",
    "maxSummaryTokens": 500
  }
}

Invoke manually with memory_llm_compress.

Auto-Distill

Periodically extracts actionable rules from lesson-type nodes created by memory_reflect. Rules are stored as rule:standard:* / rule:suggestion:* nodes for immediate injection:

{
  "autoDistill": {
    "enabled": true,
    "minLessons": 3,
    "useLlm": false
  }
}

Set useLlm: true for LLM-generated rules instead of keyword extraction.

Episodic / Semantic Memory Categories

Every memory node is auto-categorized on creation based on its type. This affects retrieval, decay, and consolidation:

| Category | Types | Half-life | Search weight | |---|---|---|---| | Episodic | event, note, session, task, plan, exploration, debug-investigation, improvement, review | 7 days | 0.5× importance | | Semantic | concept, fact, lesson, rule:*, decision, architecture, howto, preference, convention, skill, playbook, knowledge, research, core, summary, bug, fix, etc. | 365 days | 1.0× importance |

Episodic nodes decay fast and are weighted lower in search — they represent session-level traces.
Semantic nodes persist long-term and are boosted in search — they represent learned knowledge.
Use category_filter on memory_search to scope searches (e.g. memory_search(category_filter="semantic")).
Dashboard shows the category distribution.

Consolidation

When a session goes idle, autoConsolidate extracts semantic facts from episodic clusters and promotes them to type: "fact" nodes. This creates a bridge from ephemeral session traces to long-term knowledge:

{
  "autoConsolidate": {
    "enabled": true,
    "similarityThreshold": 0.3,
    "maxFactsPerCluster": 5,
    "minClusterSize": 2
  }
}

How it works:

Collects all episodic nodes created during the session
Clusters them by cosine similarity of their embeddings
Extracts declarative statements (uses "is"/"has"/"uses"/"defines" patterns)
Creates new type: "fact" semantic nodes with parentIds pointing back to source episodic nodes
Facts persist with full semantic weight and long decay half-life (365 days)

Predictive Rating

Automatically adjusts node usefulness scores over time. Frequently accessed nodes get boosted; nodes that haven't been touched in decayDays get gradually decayed:

{
  "predictiveRating": {
    "enabled": true,
    "decayDays": 30,
    "confidenceThreshold": 0.3,
    "positiveBoost": 0.1,
    "negativePenalty": 0.05
  }
}

Commands

Memory tools

| Command | Description | |---|---| | memory_set | Create or update a memory node | | memory_get | Get a single node by ID or label | | memory_fetch | Fetch a node by exact label | | memory_search | Search nodes by text, embedding, or BM25 — supports category_filter, expand_links, expand_temporal | | memory_delete | Delete a node by ID or label | | memory_list | List nodes with optional scope/level filters | | memory_replace | Replace content in a memory node | | memory_rate | Rate a node's usefulness (helps ranking) | | memory_prune | Find and remove stale/unused nodes | | memory_temporal_edges | Inspect temporal edges between nodes (conversation flow) | | memory_inject | Inject relevant memories into the prompt with token budgeting | | memory_injection_debug | Show what was injected in the last session | | memory_injection_feedback | Rate injected memory usefulness | | memory_injection_stats | View injection efficiency metrics | | memory_drilldown | Retrieve a node with its source chain (fractal retrieval) | | memory_drilldown_query | Top-down drilldown by query (find + expand) | | memory_detect_topics | Detect topic clusters in memory | | memory_stats | Show memory statistics (nodes per level, compression ratios) | | memory_dashboard | Display memory dashboard with visual overview | | memory_tool_stats | View tool call statistics and efficiency | | memory_session_stats | Get statistics about the current session | | memory_compress | Compress old nodes into higher-level summaries | | memory_llm_compress | LLM-powered compression (richer summaries) | | memory_extract_patterns | Extract cross-topic pattern summaries | | memory_distill | Extract actionable rules from lesson nodes | | memory_summarize | Generate an LLM prompt to summarize a node | | memory_check_context | Check token usage of memory nodes | | memory_total_tokens | Complete token analysis (memory + conversation) | | memory_generate_embeddings | Generate embeddings for nodes that lack them | | memory_middle_term | View context snapshots before compaction | | memory_cache_status | Show working-memory cache usage | | memory_skill_load | Load a skill's instructions by name | | memory_playbook_execute | Execute a playbook workflow | | memory_verify | Verify that a node's information is correct | | memory_reflect | Analyze a session and create lesson nodes | | memory_help | Show all available memory commands | | memory_version | Show installed plugin version |

Playbook tools

| Command | Description | |---|---| | memory_playbook_execute | Execute a playbook (returns steps for the agent to run) |

Playbooks are stored as type: "playbook" memory nodes with steps in metadata. CRUD uses generic memory_set / memory_get / memory_search tools. The agent proposes playbooks when it spots repeated multi-step patterns.

Journal tools

| Command | Description | |---|---| | journal_write | Write a new journal entry | | journal_read | Read a journal entry by ID | | journal_search | Search journal entries semantically |

MCP tools

When the MCP server is configured, the memory_search, memory_get, and related tools are available as MCP resources for IDE integration.

Skills

Skills are specialized instruction sets stored as memory nodes. When a task matches a skill's trigger keywords, its instructions load into context to guide the agent.

Available skills

| Skill | Triggers | |---|---| | debug-workflow | bug, error, fix, crash | | write-tests | tests, coverage, test suites | | refactor-component | refactor, restructure, clean up | | refactoring-expert | SOLID, code smell, technical debt | | code-reviewer | review, PR, pull request, code quality, audit | | ai-code-pitfalls | AI generated, hallucinated, copilot, cursor, LLM output | | security-review | security, audit, vulnerability, deploy | | threejs-skills | 3D, WebGL, visualization | | svelte-core-bestpractices | svelte, component, runes | | svelte-code-writer | svelte 5, sveltekit, component | | customize-opencode | opencode config, agent, plugin | | context-engineering | context, prompt, system message | | git-workflow-and-versioning | git, commit, branch, version, publish | | incremental-implementation | step by step, increment, gradual | | opencode-plugin-installation | installation, update, upgrade, cache, stale, version, publish |

Loading a skill

memory_skill_load(name="debug-workflow")

Skills are auto-injected when triggers match the task context. You can also load them explicitly with memory_skill_load.

Creating a skill

Skills are memory nodes with type: "skill". Create one with:

memory_set(
  label: "skill:my-skill",
  content: "## Skill instructions...",
  type: "skill",
  metadata: JSON.stringify({ triggers: ["keyword1", "keyword2"] }),
  sticky: true
)

Sub-agents

The plugin ships with two agent instruction files for specialized memory interaction:

| Agent | File | Purpose | |---|---|---| | memory-hints | agent/memory-hints.md | System-level hints for using memory effectively — injected by the agent when memory-related context is needed | | memory-researcher | agent/memory-researcher.md | Analyzes and reports on fractal memory state — invoked via memory_skill_load(name="memory-researcher") |

These are loaded by OpenCode's agent system and provide structured guidance for memory operations.

Management App

A local web UI for browsing, searching, and editing memory — available when the plugin is active.

Starting

The server starts automatically when management.enabled: true is set in ~/.config/opencode/opencode-mem.json (see Configuration), or manually:

bun run view

Opens at http://localhost:8787. The server starts as a background process and auto-stops on plugin shutdown.

Usage

3D Graph — the default view shows memory nodes as spheres connected by [[wiki-link]] relationships:

Drag to rotate the scene
Scroll to zoom in/out
Left-click a node to select and inspect it
Right-click drag to pan
Nodes are color-coded by level and type (skill = gold icosahedron, playbook = orange torus, note = blue sphere)
Playbook nodes render as orange torus shapes with steps visible in the detail panel

Filters — narrow down visible nodes:

Scope (global/project)
Level (L0–L5), Type (note, skill, playbook, etc.), Shape, Custom Type
Project — when multiple projects exist, filter by project name
Clear All Filters button resets everything at once
Search — find nodes by content, label, or type:
- Type a query and press Enter
- Results show relevance scores and preview snippets
- Click a result to navigate to it in the graph

Inspect — when you click a node (graph or search results):

View full content and summary
See metadata JSON (type, importance, access count, timestamps)
View embedding vector (truncated)
See linked nodes and navigate between them

Edit — modify node fields directly:

Update content, summary, importance, or type
Changes persist immediately to the SQLite database
Embedding auto-regenerates on content change

Inject — push a node directly into the agent's context:

Click "Inject" on any node
The node appears in the agent's next prompt
Useful for reminding the agent of past decisions mid-session

Manage — the node list view shows all nodes with:

Scope (global vs project), level, access count
Last accessed and last verified timestamps
Actions: edit, delete, verify, inject

Backup — the Backup tab lets you create and restore snapshots of your memory data:

Select sources to back up (config, global DB, project DB) via checkboxes
Backups stored at ~/.config/opencode/backups/ as flat directories with a manifest.json
DB snapshots use sqlite3_serialize() for consistent WAL-safe copies
Restore with per-source selection — a pre-restore safety backup is auto-created
Manual retention: list, inspect, and delete backups from the UI

How Plugin Initialization Works

When OpenCode loads the plugin, initStorage() runs automatically:

SQLite database — created at ~/.config/opencode/memory.db with all tables and indexes. Project-scope nodes are stored alongside global nodes with a project_name discriminator column
Seed nodes — rule nodes, built-in playbooks (6), and skills (15) inserted into memory_nodes
Model files — ensureModels() checks ~/.config/opencode/models/ and downloads ONNX + tokenizer (~24 MB) if missing
Agent files — ensureAgentFiles() copies agent/ directory to ~/.config/opencode/agent/
Command files — ensureCommandFiles() copies commands/ directory to ~/.config/opencode/commands/
Background embeddings — after 1s, generates embeddings for nodes that lack them
Auto-retrieve hook — if enabled in config, injects relevant context into prompts

Every initialization step is logged with timing in logs/memory-plugin.log, making it easy to diagnose startup issues.

All of this happens automatically — no manual intervention required.

Logs

All plugin logs are consolidated under ~/.config/opencode/logs/:

| Log | Path | Contents | |-----|------|----------| | Plugin | logs/memory-plugin.log | Plugin operations, init steps with timing, auto-retrieve, session events | | MCP server | logs/mcp-server.log | MCP tool calls, resources, errors | | Injection debug | logs/memory-injection.log | Full auto-retrieve injection payloads (rotated at 1 MB) | | Context dump | logs/context-dump.log | Full context snapshots for debugging | | OpenCode | ~/.local/share/opencode/log/ | Application lifecycle, tool calls |

Development

git clone <repo>
cd opencode-fractal-memory
bun install
bun run build
bun run typecheck

Testing

bun test

Installing locally (development)

bun run build
npm pack
cd ~/.config/opencode
rm -rf node_modules/opencode-fractal-memory package-lock.json
npm install --ignore-scripts <path-to-tgz>

Use --ignore-scripts to avoid trust prompts. Models download automatically on first plugin load via ensureModels() in initStorage().

Architecture

┌──────────────────────────────────────────────────────┐
│  Plugin Layer (plugin/index.ts)                       │
│  ┌──────────┬──────────┬──────────┬───────────────┐  │
│  │ Memory    │ Skills   │ Journal  │ Auto-         │  │
│  │ Store     │ (nodes)  │ Store    │ Retrieve      │  │
│  └────┬─────┴────┬─────┴────┬─────┴───────┬───────┘  │
│       │          │          │             │           │
│  ┌────┴──────────┴──────────┴─────────────┴───────┐  │
│  │ SQLite (~/.config/opencode/memory.db)           │  │
│  │  - memory_nodes (labels, content, embeds)       │  │
│  │    - scope: "global" | "project"                │  │
│  │    - project_name (for project-scope nodes)     │  │
│  │    - type: "note" / "skill" / "playbook"       │  │
│  │    - sticky playbooks/skills never pruned       │  │
│  │    - metadata.steps for playbook steps          │  │
│  │  - memory_links (wiki-link crossrefs)           │  │
│  │  - bm25_index (full-text search)               │  │
│  │  - injection_metrics / session_metrics          │  │
│  └─────────────────────────────────────────────────┘  │
│                                                       │
│  ┌─────────────────────────────────────────────────┐  │
│  │ HNSW Vector Index (in-memory, 384-dim)          │  │
│  └─────────────────────────────────────────────────┘  │
│                                                       │
│  ┌─────────────────────────────────────────────────┐  │
│  │ ONNX Embedding Model (all-MiniLM-L6-v2)         │  │
│  │ onnxruntime-web + @huggingface/tokenizers       │  │
│  └─────────────────────────────────────────────────┘  │
└──────────────────────────────────────────────────────┘

Storage

Unified SQLite database with project_name discriminator:

| Path | Purpose | |---|---| | ~/.config/opencode/memory.db | Global rules, persona, preferences (scope=global) + project-specific memory, nodes, playbooks (scope=project, discriminated by project_name) |

License

MIT

Changelog

v0.6.24 (2026-06-15)

Episodic / Semantic memory categories — all nodes auto-categorized on creation. Episodic types (event, session, task, etc.) decay with 7-day half-life and 0.5× search weight. Semantic types (concept, fact, lesson, rule, etc.) decay with 365-day half-life and 1.0× search weight. Dashboard shows category distribution; search/drilldown show [episodic]/[semantic] tags; category_filter arg on memory_search.
Consolidation bridge — autoConsolidate extracts semantic facts from episodic clusters on session.idle and stores them as persistent type: "fact" nodes with parentIds back to source nodes. New "fact" node type added.
Auto-retrieve relevance filters — maxLevel: 0 blocks L1+ compression summaries from injection; categoryFilter: "semantic" blocks episodic session traces. Config gains minQueryLength and injectionCooldownMs.
Auto-retrieve dedup + rate limit — session-level injection cache (prevents re-injecting same node IDs), query similarity skip (cosine > 0.95 skips re-injection), 30s cooldown, short message bypass (minQueryLength=10), skills cache with 5-minute TTL.
Migration v23 — adds category column to memory_nodes with index.
Bug fix: file summary UNIQUE constraint race — concurrent file reads of the same path can collide on (scope, label) UNIQUE constraint. Now catches the error and falls back to updateNode.
memory_temporal_edges tool — inspect temporal edges (conversation flow) between nodes.
category_filter arg added to memory_search and category_filter option to memory_drilldown.
Cross-project auto-retrieve pollution fix — added (scope === "global" || projectName === currentProject) post-search filter in auto-retrieve hook. Prevents nodes from other projects being injected into the current session.
memory_list scope=project auto-scopes to current project — memory_list scope=project now defaults project_name to the current project, avoiding confusing cross-project node listings. To see all projects, pass project_name="" explicitly.
Management UI project dropdown — replaced button-based project filter with a <select> dropdown for cleaner project selection.
Management API ?project_name= support — /api/nodes, /api/links, /api/stats accept optional project_name query param for server-side filtering.
Fix: validateLabel allows periods in file labels — validateLabel() regex now allows . characters, fixing file summary UNIQUE constraint recovery for files with extensions (e.g. file:sqlite.ts:5zc).
Bug fix: 10 unawaited async calls in sqlite.ts — queryDeleteNode inside withRetryableTransaction, session-tracking calls (insertAgentToolCall, createSessionMetricsRow, updateSessionMetrics, incrementSessionToolCall), and injection-event calls (insertInjectionMetrics, updateMemoryToolCall, finalizeInjection, insertInjectionFeedback, insertToolUsageLog) now properly awaited. Critical: queryDeleteNode inside a transaction callback could commit before the DELETE completed.
Bug fix: HNSW search returning ghost entries — HNSW.removeNode only removed the label-map entry (the HNSW library doesn't support point deletion), causing search() to return { id: "", score } for deleted nodes. Added .filter(r => r.id !== "") to strip ghost results.
Bug fix: pruneNodes not cleaning up HNSW index — pruneNodes deleted nodes from the database but didn't call hnsw.removeNode(), leaving ghost points in the HNSW graph. Added cleanup loop for each pruned node.

v0.6.23 (2026-06-08)

Backup/Restore — new Backup tab in the management UI. Create timestamped snapshots of config, global DB, and project DB. Restore with per-source selection; pre-restore safety backup auto-created. Backups stored as flat directories at ~/.config/opencode/backups/ — zero external deps.
projectName cross-project filtering — Added project_name arg to all CLI tools, MCP tools, and storage layer
Bug fix: global scope always skipped — Removed ?? store.projectName default that caused projectName to always be set, preventing global memory from being searched. Now project_name is only passed when explicitly provided; when omitted, searches both global and project scopes
Config unification — management.enabled/port and journal.* moved from separate agent-memory.json into the main opencode-mem.json config file. Single config source of truth
Arg description updates — All tool arg descriptions and command files updated to reflect the new behavior

v0.6.21 (2026-06-07)

Command file audit — consistent name=value named arg format across all command files
memory-rate.md — added frontmatter so it registers as a valid command
memory-set.md — replaced fictional Supabase example with generic JWT example
memory-list.md, memory-compress.md, memory-prune.md — added proper Usage/Arguments sections
agent/memory-hints.md — all examples converted to named arg syntax, types fixed

v0.6.20 (2026-06-07)

README update — cache staleness workaround, plugin version endpoint docs

v0.6.19 (2026-06-07)

Metadata support — memory_set and MCP memory_set now accept metadata JSON string arg
MemoryGet now displays metadata section when present
skill:opencode-plugin-installation created with auto-detection triggers
Docs: memory-set.md, memory-get.md, memory-help.md, agent/memory-hints.md updated

v0.6.18 (2026-06-07)

README cache staleness workaround added

v0.6.17 (2026-06-07)

Duplicate file node fix — replaced listNodes("project") with getNodeByLabel() in hooks.ts
File nodes now update on re-read instead of being skipped
DB cleanup: removed 552 duplicate file nodes (reduced 1375→825 nodes)

v0.6.16 (2026-06-07)

Plugin version displayed in management app sidebar
GET /api/version endpoint added

v0.6.15 (2026-06-06)

Project switcher — filter memory nodes by project name in management UI
Clear all filters — one-click reset of all active filters
Playbook nodes — now render as orange torus with step details in management UI
TYPE_COLORS — playbooks (orange) and skills (gold) have dedicated colors in 3D scene
Backend: project_name in API responses, GET /api/projects endpoint

v0.6.14 (2026-06-06)

Session reference counter fix — management server only stops when all sessions end

v0.6.13 (2026-06-06)

Event hook refactor — management server lifecycle tied to real session.created/session.deleted events

v0.6.12 (2026-06-06)

Fixed management server lifecycle — SIGKILL instead of SIGTERM, proper event hooks

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

opencode-fractal-memory

about me and the usage

Features

Prerequisites

Installation

For OpenCode users

Updating

For development / manual install

MCP server setup

Configuration

Config reference

Advanced Features

Ollama Reranking

LLM Compression

Auto-Distill

Episodic / Semantic Memory Categories

Consolidation

Predictive Rating

Commands

Memory tools

Playbook tools

Journal tools

MCP tools

Skills

Available skills

Loading a skill

Creating a skill

Sub-agents

Management App

Starting

Usage

How Plugin Initialization Works

Logs

Development

Testing

Installing locally (development)

Architecture

Storage

License

Changelog

v0.6.24 (2026-06-15)

v0.6.23 (2026-06-08)

v0.6.21 (2026-06-07)

v0.6.20 (2026-06-07)

v0.6.19 (2026-06-07)

v0.6.18 (2026-06-07)

v0.6.17 (2026-06-07)

v0.6.16 (2026-06-07)

v0.6.15 (2026-06-06)

v0.6.14 (2026-06-06)

v0.6.13 (2026-06-06)

v0.6.12 (2026-06-06)