@aibrain/mcp

v0.1.2

Published

2 months ago

AI agent memory MCP server with hybrid BM25 + vector search — no Docker required

0High
0Medium
0Low

nkamau

mcp memory ai agent vector-search lancedb

aiBrain MCP

AI agent memory server with hybrid BM25 + vector search. No Docker required — runs anywhere via npx.

Quick Start

# Add to Claude Code (or any MCP client)
npx -y @aibrain/mcp

# Optional: install Ollama for semantic search
npx -y @aibrain/mcp --setup

# Optional: set up the web dashboard
npx -y @aibrain/mcp --setup-ui
npx -y @aibrain/mcp --setup-ui ~/projects   # custom location

Features

6 memory tools: save_memory, search_memories, get_recent_memories, get_memory, delete_memory, list_tags
Hybrid search: BM25 full-text + vector semantic search with Reciprocal Rank Fusion
Zero external dependencies: embedded LanceDB + local ONNX embeddings via Transformers.js — no separate server needed
Persistent storage: memories saved to ~/.aibrain/memories by default
First-run model download: the nomic-embed-text-v1 ONNX model (~50MB) downloads once to ~/.cache/huggingface and is reused on all subsequent runs

Client Integration

Claude Code

Add to ~/.claude/settings.json:

{
  "mcpServers": {
    "aibrain": {
      "command": "npx",
      "args": ["-y", "@aibrain/mcp"]
    }
  }
}

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "aibrain": {
      "command": "npx",
      "args": ["-y", "@aibrain/mcp"]
    }
  }
}

Cursor

Add to .cursor/mcp.json in your project, or ~/.cursor/mcp.json globally:

{
  "mcpServers": {
    "aibrain": {
      "command": "npx",
      "args": ["-y", "@aibrain/mcp"]
    }
  }
}

Amp

Add to ~/.config/amp/settings.json (global) or .amp/settings.json in your project root:

{
  "mcpServers": {
    "aibrain": {
      "command": "npx",
      "args": ["-y", "@aibrain/mcp"]
    }
  }
}

Configuration

| Environment Variable | Default | Description | |---------------------|---------|-------------| | AIBRAIN_DATA_DIR | ~/.aibrain/memories | Where memories are stored | | EMBEDDING_PROVIDER | transformers | Embedding backend: transformers or ollama | | OLLAMA_URL | http://localhost:11434 | Ollama server URL (only used when EMBEDDING_PROVIDER=ollama) | | OLLAMA_MODEL | nomic-embed-text | Ollama embedding model (only used when EMBEDDING_PROVIDER=ollama) | | OLLAMA_TIMEOUT_MS | 5000 | Ollama request timeout (only used when EMBEDDING_PROVIDER=ollama) | | LOG_LEVEL | info | Log level (debug/info/warn/error) |

Set these in your shell profile or pass them via env in your MCP client config:

{
  "mcpServers": {
    "aibrain": {
      "command": "npx",
      "args": ["-y", "@aibrain/mcp"],
      "env": {
        "AIBRAIN_DATA_DIR": "/custom/path/to/memories"
      }
    }
  }
}

CLI Flags

| Flag | Description | |------|-------------| | --setup | Install Ollama and pull the embedding model (only needed for EMBEDDING_PROVIDER=ollama) | | --setup-ui | Clone and set up the aibrain-ui web dashboard | | --setup-ui <path> | Clone aibrain-ui into a custom directory (defaults to current directory) |

Tools

`save_memory`

Save a memory with content, summary, tags, and metadata.

`search_memories`

Hybrid BM25 + vector search. Falls back to fulltext if embeddings are unavailable.

`get_recent_memories`

Get most recent memories, optionally filtered by agent, session, or project.

`get_memory`

Fetch full content of a memory by ID.

`delete_memory`

Delete a memory by ID.

`list_tags`

List all tags sorted by usage count.

Embeddings

Default: Transformers.js (no setup required)

Embeddings run locally via Transformers.js using the nomic-embed-text-v1 ONNX model.

On first run, the model (~50MB) downloads automatically to ~/.cache/huggingface/hub. You'll see:

[aibrain] Loading embedding model on first run (may download ~50MB to ~/.cache/huggingface)...
[aibrain] Embedding model ready

Every subsequent run loads the model from disk instantly — no network call.

Alternative: Ollama

If you prefer Ollama (e.g. you're already running it, or want GPU acceleration):

# Install Ollama and pull the model
npx -y @aibrain/mcp --setup

# Then tell aibrain to use it
EMBEDDING_PROVIDER=ollama npx -y @aibrain/mcp

Or set it permanently in your MCP client config:

{
  "mcpServers": {
    "aibrain": {
      "command": "npx",
      "args": ["-y", "@aibrain/mcp"],
      "env": {
        "EMBEDDING_PROVIDER": "ollama"
      }
    }
  }
}

Agent Instructions

Installing the MCP server gives your AI agent access to the tools, but you also need to tell it when and how to use them. Add the instructions below to your agent's rules/instructions file.

Where to put them

| Scope | Claude Code | Amp | Cursor | |-------|------------|-----|--------| | Global (all projects) | ~/.claude/CLAUDE.md | ~/.config/amp/AGENTS.md | ~/.cursor/rules | | Project-local | CLAUDE.md in project root | .amp/AGENTS.md in project root | .cursor/rules in project root |

Use global for general memory behaviour you always want. Use project-local to scope memories to a specific codebase or workflow.

Recommended instructions

Add this block to your chosen file:

## Memory (aiBrain)

At the start of every session:
1. Call `aibrain:get_recent_memories` (limit: 10, filter by current `projectPath`) — returns summaries only
2. Call `aibrain:search_memories` with a query summarizing what the user just asked — returns summaries only
3. For any result that looks relevant, call `aibrain:get_memory` with its `id` to fetch the full content

During and after work, call `aibrain:save_memory` whenever you learn something worth remembering:
- Decisions made and why
- Bugs found and how they were fixed
- Architecture patterns or conventions in this project
- User preferences and feedback
- External service details (API quirks, endpoint structures, config conventions)

When saving:
- `projectPath`: absolute path of the current working directory (or `""` for global context)
- `tags`: lowercase kebab-case (e.g. `bug-fix`, `architecture`, `user-preference`)
- `summary`: under 200 chars — the tldr
- `content`: full detail

At the end of every session, save one memory summarizing what was accomplished, what was left incomplete, and any important context for the next session.

For in-progress or incomplete work, still save a memory but include the tag `in-progress`. Update or delete it once the work is complete.

Do NOT save: things already in the codebase, or git history.

## Directory Exploration → aiBrain

Whenever you read or explore a directory and discover new information (project structure, tech stack, conventions, dependencies, config patterns, etc.) that isn't already in aiBrain, save it immediately with `aibrain:save_memory`. Tag with `codebase-discovery` plus any relevant tags.

## aiBrain Subagents

Run all aiBrain operations (save, search, get, delete) as background subagents where possible, so they don't block the main conversation. Batch multiple saves into a single subagent call. Only await aiBrain results when the response directly depends on them (e.g. session-start memory load).

Minimal version

If you want lighter-touch behaviour, this shorter version works well:

## Memory (aiBrain)

- At session start: call `aibrain:get_recent_memories` and `aibrain:search_memories` to load relevant context
- During work: call `aibrain:save_memory` for decisions, bugs, conventions, and user preferences
- At session end: save a summary of what was done and what's left
- Always set `projectPath` to the current working directory when saving

Web Dashboard (aibrain-ui)

Browse, search, and manage your memories visually with aibrain-ui — a dark-themed web dashboard built on top of aibrain-mcp.

Automatic setup

# Clones aibrain-ui into current directory
npx -y @aibrain/mcp --setup-ui

# Or specify a custom location
npx -y @aibrain/mcp --setup-ui ~/projects

This clones aibrain-ui, installs dependencies, and creates the .env file. Then:

cd aibrain-ui
npm run dev

Dashboard opens at http://localhost:5173.

Manual setup

See the aibrain-ui README for manual installation instructions.

Troubleshooting

Server doesn't start

Ensure Node.js >= 20 is installed: node --version

First run is slow / hangs

The embedding model is downloading (~50MB). Wait for [aibrain] Embedding model ready to appear. This only happens once.

Search returns no results

Check that memories have been saved first via get_recent_memories
If using EMBEDDING_PROVIDER=ollama, verify Ollama is running: curl http://localhost:11434/api/tags

Embedding model fails to load

Check you have internet access for the one-time download
Check available disk space (~/.cache/huggingface needs ~200MB for model files)
To force a re-download, delete ~/.cache/huggingface/hub/models--Xenova--nomic-embed-text-v1

Ollama setup fails (when using EMBEDDING_PROVIDER=ollama)

Run EMBEDDING_PROVIDER=ollama npx -y @aibrain/mcp --setup again — it's idempotent
Or install Ollama manually from ollama.com, then ollama pull nomic-embed-text

Memories stored in wrong location

Set AIBRAIN_DATA_DIR to your preferred path (see Configuration above)

License

MIT