@s_s/mnemo

v1.6.0

Published

4 months ago

Memory management MCP server for AI coding assistants

0High
0Medium
0Low

mlc_

mcp memory agent

Mnemo

中文文档

Mnemosyne, mother of all creation, stood against Lethe. Mnemo keeps only what still matters.

Persistent, high-value long-term context for AI coding assistants via MCP.

Mnemo is not a transcript archive. It captures only the context that will still matter across future sessions — decisions, preferences, rules, and unresolved threads — and makes it available through semantic search. Think of it as durable long-term memory for your AI agent.

Features

Memory types — 8 semantic categories (preference, profile, goal, continuity, fact, decision, rule, experience) with save-time classification
Lifecycle hooks — per-turn reminders injected via agent-native hooks (Claude Code, Codex, OpenClaw, OpenCode) so the agent actually remembers to use memory tools
Hybrid search — find memories by meaning and keywords (vector + keyword, with automatic fallback)
Progressive disclosure — search returns summaries; retrieve full content on demand
Multi-agent support — works with OpenCode, Claude Code, OpenClaw, and Codex; auto-detects agent type via MCP protocol
Fully local — no API calls, no cloud storage; all data stays on your machine
Auto-prompted — injects instructions into your agent's config so it knows when to save and recall memories
Passive eviction — automatic lifecycle management based on access frequency + time decay; old, unused memories are archived to keep the active set focused

Quick Start

Install

npm install -g @s_s/mnemo

Configure your MCP client

Add Mnemo to your MCP client configuration.

Add to opencode.json:

{
  "mcp": {
    "mnemo": {
      "type": "local",
      "command": ["mnemo"]
    }
  }
}

Via CLI (user scope, available across all projects):

claude mcp add --transport stdio --scope user mnemo -- mnemo

This stores the config in ~/.claude.json.

Via CLI:

codex mcp add mnemo -- mnemo

Or add to ~/.codex/config.toml:

[mcp_servers.mnemo]
command = "mnemo"

OpenClaw uses mcporter to manage MCP servers. Add Mnemo to config/mcporter.json (or ~/.mcporter/mcporter.json for global config):

{
  "mcpServers": {
    "mnemo": {
      "command": "mnemo"
    }
  }
}

Or via mcporter CLI:

mcporter config add mnemo --command mnemo --scope home

Initialize

After installing, run the setup CLI to initialize Mnemo:

npx @s_s/mnemo setup --agent claude-code

This does two things:

Prompt injection — writes memory management instructions into your agent's config file (e.g., AGENTS.md for OpenCode, CLAUDE.md for Claude Code)
Hook installation — installs lifecycle hooks that remind the agent to use memory tools at key moments (per-turn for Claude Code/Codex, session-start for OpenClaw, session lifecycle events for OpenCode)

Both steps are independent — if one fails, the other still succeeds.

By default, setup initializes global memory shared across projects. For project-isolated memory, run in the project directory:

cd your-project
npx @s_s/mnemo setup --agent claude-code --scope project

Options:

| Flag | Description | | ----------------- | ------------------------------------------------------------------------------------------------------------------------------------------------- | | --agent <type> | opencode, claude-code, openclaw, or codex. Recommended to specify explicitly. If omitted, auto-detected from config files in cwd and ~. | | --scope <scope> | global (default) or project | | --project-root | Explicit project root (for project scope) |

Storage scopes

global (default) — shared memory across projects; prompt is written to the user-level agent config
project — isolated memory for the current project; prompt is written to the project config and Mnemo creates a local .mnemo/ directory

You can also pass --project-root <path> when using --scope project to explicitly choose the project root.

Usage Examples

Important: You don't call Mnemo tools directly. You chat with your AI agent in natural language, and the agent decides when to call Mnemo tools behind the scenes. After running npx @s_s/mnemo setup, the agent already knows when and how to use them.

Saving memories (automatic)

The agent saves memories on its own when it recognizes important information:

You:   Let's use 4-space indentation and single quotes for this project.
Agent: Got it. I'll follow that style.
       → [calls memory_save: "Project code style: 4-space indentation,
          single quotes", tags: ["preference", "code-style"]]
       I've saved this as a memory so I'll remember it next time.

Recalling context (automatic)

When you start a new conversation, the agent searches for relevant memories:

You:   Let's continue working on the auth module.
Agent: → [calls memory_search: "auth module"]
       Based on my memories, last time we decided to use JWT with
       refresh tokens and store them in httpOnly cookies. Let me
       pick up from there.

Searching memories (on request)

You:   Do you remember what database we chose?
Agent: → [calls memory_search: "database choice"]
       Found a relevant memory about database selection.
       → [calls memory_get: "<note-id>"]
       Yes — we decided on PostgreSQL with Prisma ORM, mainly for
       its type safety and migration tooling.

Tools

Mnemo provides 4 MCP tools:

| Tool | Description | | --------------- | ------------------------------------------------------------------- | | memory_save | Save a memory note with type, optional tags, and source | | memory_search | Hybrid search across memories; returns summaries (supports filters) | | memory_get | Retrieve full content of specific notes by ID | | memory_delete | Delete notes by ID |

Memory Model

Every memory note is classified into one of 8 types before saving:

| Type | Purpose | Example | | ------------ | ----------------------------------------------- | ----------------------------------------------- | | preference | User preferences and collaboration habits | "Prefers 4-space indentation, single quotes" | | profile | Stable background about user, project, or topic | "Project uses Next.js 14 with App Router" | | goal | Long-term directions and objectives | "Migrate from REST to GraphQL by Q3" | | continuity | Unresolved threads to resume later | "Auth module: left off at refresh token logic" | | fact | Stable objective information | "Production DB is on PostgreSQL 16" | | decision | Confirmed choices from a discussion | "Chose Prisma over Drizzle for type safety" | | rule | Reusable conventions and agreements | "All API errors return { code, message } shape" | | experience | Validated, reusable lessons (high bar) | "Batch DB writes cut migration time by 10x" |

A memory must meet at least 2 of 3 criteria to be worth saving: (1) useful across future sessions, (2) affects future work, (3) would require re-alignment if forgotten.

How It Works

Storage

Memory notes are stored as Markdown files with YAML frontmatter.

Global mode:

~/Library/Application Support/mnemo/    # macOS
~/.local/share/mnemo/                   # Linux
%APPDATA%/mnemo/                        # Windows
├── config.json                         # Global storage marker
├── notes/                              # Markdown files
│   ├── 20260305-172200-a3f1.md
│   └── 20260305-183015-b7c2.md
└── index/                              # Vector index (vectra)

Project mode:

<projectRoot>/.mnemo/
├── config.json                          # Project storage marker
├── notes/                               # Markdown files
└── index/                               # Vector index (vectra)

Override the global data directory with MNEMO_DATA_DIR environment variable.

Important: Mnemo must be initialized with npx @s_s/mnemo setup before memory tools are used. Storage resolution follows: project marker first, global marker second, otherwise the tools report that Mnemo is not initialized.

Hybrid Search

Mnemo uses a hybrid search strategy combining vector search (semantic similarity via all-MiniLM-L6-v2, 33MB, 384 dimensions) and keyword search (case-insensitive term matching). Results from both are merged with weighted scoring (vector: 0.7, keyword: 0.3). If the embedding model isn't ready yet, keyword search works as a graceful fallback.

Search results return summaries by default. Use memory_get with note IDs to retrieve full content — this keeps context usage minimal when browsing results.

Memory Lifecycle

Save — Agent saves key info during conversations (decisions, preferences, architecture choices, or when context is running low)
Search — Agent retrieves relevant context at the start of new conversations or when needed
Eviction — When notes exceed the configured limit, the lowest-scored notes (by access frequency + time decay) are automatically archived to keep the active set focused

Development

git clone [email protected]:See-Cat/mnemo.git
cd mnemo
npm install
npm run build
npm test

Scripts

| Command | Description | | ---------------------- | ---------------------------- | | npm run build | Compile TypeScript | | npm run dev | Watch mode compilation | | npm test | Run tests (Vitest) | | npm run test:watch | Watch mode tests | | npm run prettier:fix | Format all files | | npm run release | Interactive release workflow |

Release

npm run release

Interactive script that walks through: git check → branch check → version selection → format → test → build → publish → push.

License

MIT