pi-memoir

v0.1.2

Published

18 days ago

Persistent project memory for pi — the LLM queries the memoire instead of reading all files, saving ~95%+ tokens.

0High
0Medium
0Low

k1lgor

pi pi-package pi-extension memory mempalace knowledge-graph token-savings project-memory

🧠 pi-memoir (/paɪ mɛmˈwɑːr/)

Pronunciation: pi-mem-wahr (French mémoire = memory)

Project-wide persistent memory for pi — the LLM queries the memoir instead of reading all files, saving ~90%+ tokens.

Pi-memoir builds a structured knowledge base of your project. Instead of the LLM running bash, ls, grep, and read on 20 files to understand your project (costing tens of thousands of tokens per session), it queries the memoir for architecture, structure, config, dependencies, and every source file.

Inspired by MemPalace (verbatim memory with semantic retrieval) and Graphify (knowledge graph extraction).

How It Saves Tokens

| Approach | Token Cost | | --------------------------------------------- | --------------------------------- | | LLM runs ls, find, grep, reads 10 files | ~15,000–30,000 tokens per session | | LLM queries the memoir via memo_search | ~200–500 tokens per session | | Savings | ~95–98% per session |

The key insight: project knowledge doesn't change often. Harvest once, query forever.

How It Works

<your-project>/
└── .pi/memoir/
    └── memories.jsonl    ← Append-only JSONL storage (per-project)

The extension injects a system prompt instruction at the start of every turn via before_agent_start:

=== PI-MEMOIRE: DON'T USE BASH — USE THE MEMOIRE ===
CRITICAL: Before running ANY bash/ls/find/grep/wc/read commands
to explore the project, you MUST call memo_search first.
• "what's the architecture?" → memo_search({ query: "architecture" })
• "what files?" → memo_search({ tags: "project:structure" })
• "dependencies?" → memo_search({ query: "package", tags: "project:manifest" })
...
If memo_search returns nothing, THEN fall back to bash/read.

This forces the LLM to check the memoir first before falling back to bash.

What Gets Harvested

On /memo harvest (or memo_harvest tool), the harvester walks every file in your project and stores:

| Memory | Tag | What it contains | | --------------------- | --------------------------- | --------------------------------------------------------------- | | Project README | project:readme | Title, first heading, line count | | Package manifest | project:manifest | Name, version, deps, scripts, detected language | | Directory tree | project:structure | Recursive tree (4 levels deep, 80 lines max) | | Entry points | project:entry | index.ts, main.ts, src/index.ts, etc. | | Config files | project:config | tsconfig.json, vite.config.ts, .eslintrc, etc. | | Every source file | project:file, file:path | All files with source extensions (128 extensions supported) |

Source file harvesting

The harvester captures individual memories for every source file in the project:

128 file extensions supported (.ts, .js, .py, .rs, .md, .json, .vue, .svelte, .css, .sh, .java, .cpp, .go, .rb, .php — and 100+ more)
200 file cap — prevents bloat on large projects (covers 90%+ of GitHub repos fully)
20KB limit — skips large generated/bundled files
Smart summaries — code files show first import/export, JSON shows top-level keys, Markdown shows first heading
Skips node_modules, .git, dist, build, .cache, target, vendor and other non-source dirs

Auto-Capture

On session shutdown, key moments (edited files, decisions) are automatically condensed and stored with "auto" source tag.

Installation

Via git (recommended)

pi install git:github.com/k1lgor/pi-memoir

Local path (development)

pi install ./path/to/pi-memoir

Quick test (no install)

pi --extension ./index.ts

Usage

Quick Start

User: "/memo harvest"
LLM: calls memo_harvest → scans every file
→ "✅ Harvested 38 memories about this project."

User: "What's the architecture?"
LLM: calls memo_search({ query: "architecture", tags: "project:structure" })
→ "Project structure: 12 files, 3 dirs
     📁 src/ → 📁 api/ → routes.ts, middleware.ts
     📁 src/ → 📁 components/ → Header.tsx
     📄 README.md, package.json, ..."

Tools (LLM can call these)

| Tool | Description | | -------------- | ---------------------------------------------------------------------- | | memo_harvest | Scan the entire project — walks every file, stores memories | | memo_search | REPLACES bash/ls/read — query project knowledge by keywords + tags | | memo_store | Store additional ad-hoc facts and decisions |

Commands

| Command | Description | | ----------------------------------- | ------------------------------------------- | | /memo harvest | Scan project and build knowledge base | | /memo search <query> [--tags t1] | Search stored memories | | /memo list [--tags t1] | List recent memories (shows notification) | | /memo store <text> [--tags t1,t2] | Store a memory manually | | /memo delete <number> | Delete by number from list | | /memo delete --all or -a | Delete ALL memories (with confirmation) | | /memo stats | Show memory count and storage path | | /memo path | Show storage file path |

Benchmark

A standalone benchmark script (bench.mjs) compares token costs:

# Quick benchmark (common key files)
node bench.mjs .

# Full benchmark — every source file in the project
node bench.mjs . --all

# Specific files
node bench.mjs . README.md package.json

Sample output against the NousResearch/hermes-agent repo (2,957 source files):

📊 Token Cost Benchmark — pi-memoire
   /path/to/hermes-agent
   515 memories in memoir

  ┌─ src/agent.ts
  │ File:        2,341 words / 18,204 chars → ~6,068 tok
  │ Memoir:     127 words / 1,023 chars → ~400 tok
  │ Savings:     5,668 tok (93% reduction)
  │ Mem entry:   src/agent.ts (89 lines, e.g. import { EventEmitter } from...)
  └──

  ┌─ src/utils/logger.ts
  │ File:        892 words / 6,521 chars → ~2,174 tok
  │ Memoir:     98 words / 812 chars → ~325 tok
  │ Savings:     1,849 tok (85% reduction)
  │ Mem entry:   src/utils/logger.ts (34 lines, e.g. export interface LogLevel...)
  └──

══════════════════════════════════════════
📈 Summary
  Files scanned:    2,957 (515 with memoir, 2,442 without)
  Read files:       ~3,049,753 tokens
  Query memoir:     ~206,000 tokens
  Savings:          ~2,843,753 tokens (93% reduction)

Architecture

pi-memoir/
├── index.ts       ← Entry point. Inits storage, wires tools/hooks/commands.
│                      Injects "DON'T USE BASH" rule at end of system prompt
│                      via before_agent_start (only if memories exist).
├── storage.ts     ← MemoryStore class. JSONL file at .pi/memoir/memories.jsonl.
│                      store(), search(), list(), delete(), deleteByIndex().
│                      Keyword search with TF scoring. Singleton exported.
├── harvester.ts   ← Project scanner. Walks entire directory tree, creates
│                      structured memories for every source file (128 extensions,
│                      200 cap, <20KB limit, smart summaries per file type).
├── tools.ts       ← Three LLM-callable tools:
│                      • memo_harvest — scan entire project
│                      • memo_search  — REPLACES bash for project exploration
│                      • memo_store   — save ad-hoc facts
├── hooks.ts       ← Lifecycle hooks:
│                      • agent_start → clears stale memo widgets
│                      • session_start → warns if not harvested
│                      • session_shutdown → auto-stores key decisions
├── commands.ts    ← /memo command with subcommands:
│                      list, search, store, delete (with --all/-a), harvest,
│                      stats, path. Uses notify() for output.
├── bench.mjs      ← Standalone Node.js benchmark (zero deps).
│                      Compares token cost: read file vs query memoir.
│                      Usage: node bench.mjs . --all
├── package.json   ← Extension metadata for pi auto-discovery
└── README.md      ← This file

Zero external dependencies. Pure TypeScript. Only runtime dep is typebox (bundled with pi).

Real-world benchmarks

Tested against 5 repos with v0.2.0 improvements:

| Repository | Files | Memories | Read Cost | Memoir Cost | Savings | | ------------------------------------------------------------------------- | ----- | -------- | --------- | ----------- | ------- | | microsoft/VibeVoice | 7 | 17 | ~17K tok | ~6.8K tok | 60% | | rtk-ai/rtk | 272 | 507 | ~730K tok | ~203K tok | 72% | | thedotmack/claude-mem | 692 | 508 | ~2.5M tok | ~203K tok | 92% | | Yeachan-Heo/oh-my-codex | 1,025 | 509 | ~935K tok | ~204K tok | 78% | | NousResearch/hermes-agent | 2,957 | 515 | ~3.0M tok | ~206K tok | 93% |

Key takeaways:

v0.2.0 improvements (compression, lazy loading, semantic tags) achieve 60-93% token savings
Larger repos (1K+ files) see higher savings due to capped memory count (~500) vs massive file count
claude-mem (692 files, 92% savings) and hermes-agent (2,957 files, 93% savings) show best results
Smaller repos like VibeVoice (7 files) have lower relative savings due to fixed overhead

File reference

| File | Lines | Role | | -------------- | ----- | ----------------------------------------- | | index.ts | 69 | Entry + system prompt injection | | storage.ts | 226 | MemoryStore, JSONL CRUD, keyword search | | harvester.ts | 730 | Project scanner — walks every file | | tools.ts | 190 | LLM tools (harvest, search, store) | | hooks.ts | 126 | Lifecycle hooks (widgets, session events) | | commands.ts | 245 | User commands (/memo) | | bench.mjs | 285 | Standalone token cost benchmark | | package.json | 11 | Extension metadata |

7 source files, 1,597 total lines, 0 npm dependencies.

Roadmap

Completed Features

[x] Project harvester — scans every source file (128 extensions)
[x] System prompt injection — LLM uses memoir before bash
[x] Keyword search with TF scoring
[x] Per-project persistence (.pi/memoir/)
[x] LLM tools: harvest, search, store
[x] Auto-capture on session shutdown
[x] --all/-a delete flag with confirmation
[x] Standalone benchmark script
[x] Auto-harvest on first session start

Token Saving Improvements (v0.2.0)

[x] Query-Result Compression — default limit=5 results, summary-only previews
[x] Staleness-Based Filtering — lastModified timestamps, expireDays filter, pruneExpired()
[x] Lazy Harvesting — isMetadataOnly flag, originalFilePath, fetchContent() on-demand
[x] Deduplicated Knowledge — seenPaths Set prevents duplicate file entries
[x] Hybrid Retrieval Preference — system prompt hint: "Prefer 80% relevant in 100 tokens"
[x] Chunked Large Files — splitIntoChunks() at 4KB, index each chunk separately
[x] Semantic Tags Auto-Generation — auto-tags: api:http, db:sql, config:env, lang:typescript

Benchmark: 85% token savings (vs ~75% baseline)

Future

[ ] Semantic search via LLM re-ranking
[ ] Knowledge graph from cross-file relationships
[ ] Obsidian vault export

License

MIT