@mrabhishek1105/codemem

v0.26.0

Published

a month ago

AI-agnostic local memory layer for codebases. Index once, remember forever, switch AI freely.

0High
0Medium
0Low

mrabhishek1105

ai memory codebase semantic-search vector-database embeddings cli local offline rag claude copilot cursor

CodeMem

AI-agnostic local memory layer for codebases. Index once, remember forever, switch AI freely.

Status: v0.26.0 — Stable core with new agentic plan→patch→apply pipeline. Active development.

CodeMem runs as a local sidecar and gives any AI assistant a persistent, semantic memory of your codebase without sending your source code to the cloud. Use it from the CLI, from HTTP, or as an MCP tool bridge for Claude Desktop / Cursor.

What CodeMem Solves

AI chat contexts forget your project between sessions.
Pasting large files wastes tokens and slows down responses.
Different AI tools require different integration paths.

CodeMem solves this by indexing your codebase locally and retrieving only the most relevant chunks for each query. That keeps prompts small, answers accurate, and your code private.

Requirements

Node.js >= 18.0.0 (node -v)
npm >= 8 (npm -v)
Windows, macOS, or Linux
No Python required
No external database required
Disk space: ~50 MB for model cache + ~5 MB per 1,000 indexed files
RAM: ~300 MB while running

Verify your environment

node -v
npm -v

If a command is missing, install Node.js from https://nodejs.org or use your platform package manager.

Installation

Option A — Clone and build locally (recommended)

git clone https://github.com/mrabhishek1105/Codemem.git
cd Codemem
npm install
npm run build
npm link

Option B — Install from npm (when published)

npm install -g @mrabhishek1105/codemem

Getting Started

Open a terminal in your project folder:

cd /path/to/your-project

Initialize CodeMem for that project:

codemem init

This creates a .codemem/ directory and downloads the embedding model on the first run.

Start the local sidecar server:

codemem start

By default, the server listens on localhost:8432.

Search your codebase from the CLI:

codemem search "how does authentication work"

Query the HTTP API directly:

curl -X POST http://localhost:8432/api/v1/query \
  -H "Content-Type: application/json" \
  -d '{"query":"how does the payment flow work","options":{"top_k":5}}'

If the sidecar is running on a custom port, replace 8432 with that port.

Need command help?

codemem --help
codemem <command> --help

Core Usage

`codemem init`

Create the local project index and download the model.

codemem init
codemem init --yes
codemem init --debug

`codemem start`

Start the sidecar server.

codemem start
codemem start --port 9000
codemem start --debug

`codemem stop`

Stop the running sidecar.

codemem stop

`codemem status`

Show the current index and server state.

codemem status

`codemem stats`

Show token savings and usage statistics.

codemem stats

`codemem search <query>`

Search your indexed codebase.

codemem search "how does auth work"
codemem search "database models" --top 8

`codemem reindex`

Rebuild the local index.

codemem reindex
codemem reindex --full

`codemem ask <query>`

Ask an AI about your project with code-based context.

This command auto-detects provider from OPENAI_API_KEY or ANTHROPIC_API_KEY, but you can override using --provider.

codemem ask "what does the payment service do"
codemem ask "where is auth handled" --provider openai --model gpt-4o --mode direct
codemem ask "explain the codebase" --provider openai --base-url https://openrouter.ai/api/v1

`codemem chat`

Start an interactive multi-turn chat session with code memory.

codemem chat
codemem chat --provider openai --base-url https://openrouter.ai/api/v1

Hosted web AI providers

CodeMem now supports OpenAI-compatible hosted APIs through --base-url or OPENAI_BASE_URL.

export OPENAI_API_KEY=your-key
export OPENAI_BASE_URL=https://openrouter.ai/api/v1
codemem ask "summarize the authentication flow" --provider openai

This works with OpenAI-compatible gateways such as OpenRouter and similar web-based AI services.

`codemem plan <query>`

Generate a step-by-step implementation plan for a code change. Saves the plan to .codemem/last-plan.json for use with codemem apply.

Requires the sidecar to be running and an AI API key (OPENAI_API_KEY or ANTHROPIC_API_KEY).

codemem plan "add JWT validation to the login route"
codemem plan "refactor the database layer to use connection pooling" --provider anthropic
codemem plan "add error handling to all API routes" --top 10

`codemem apply`

Load the plan from .codemem/last-plan.json, generate complete file patches, show a diff preview, and apply them after your confirmation.

codemem apply
codemem apply --validate          # run build/tests after applying
codemem apply --provider openai   # override AI provider

The command will:

Show the saved plan summary
Generate complete updated file contents
Display a line-level diff preview per file
Prompt Apply? [y/N] — no files are written until you say y
Back up originals to .codemem/backups/<timestamp>/ before overwriting
Optionally run npm run build and npm test to verify the changes

`codemem ask <query>` (updated)

The ask command now supports --output=patch to instruct the AI to return complete updated files instead of an explanation.

codemem ask "fix the authentication bug" --output patch
codemem ask "add input validation to the signup endpoint" --output patch --stream

Output modes:

context (default) — explanation and code snippets
patch — complete updated file contents ready to copy-paste

`codemem mcp`

Start the MCP JSON-RPC bridge for Claude Desktop / Cursor style tools.

codemem mcp

If your sidecar is running on a non-default port:

CODEMEM_PORT=9000 codemem mcp

`codemem doctor`

Run a health check on the installation and environment.

codemem doctor

HTTP API

CodeMem provides a local REST API on the running sidecar.

`POST /api/v1/query`

Search the current project and return assembled code context.

curl -X POST http://localhost:8432/api/v1/query \
  -H "Content-Type: application/json" \
  -d '{"query":"how does authentication work","options":{"top_k":5}}'

`POST /api/v1/index`

Trigger indexing from the API.

curl -X POST http://localhost:8432/api/v1/index \
  -H "Content-Type: application/json" \
  -d '{"mode":"incremental"}'

`GET /api/v1/status`

Check server and index state.

curl http://localhost:8432/api/v1/status

If you use a custom port, replace 8432 with the port passed to codemem start --port <port>.

MCP Tool Bridge

codemem mcp starts a JSON-RPC tool bridge over stdin/stdout.

It exposes four tools for local code intelligence:

| Tool | Description | |------|-------------| | search_codebase | Semantic search — always call before answering code questions | | plan_change | Generate a step-by-step implementation plan | | generate_patch | Generate complete updated file contents based on a plan | | apply_patch | Apply patches to disk — requires approved: true from the user |

Use CODEMEM_PORT to point the MCP bridge at a custom sidecar port.

CODEMEM_PORT=9000 codemem mcp

This is useful for AI applications that support local tool integration via Claude Desktop, Cursor, or other JSON-RPC tool runners.

Agent HTTP API

The sidecar also exposes agent endpoints for programmatic use.

`POST /api/v1/plan`

Generate an implementation plan.

curl -X POST http://localhost:8432/api/v1/plan \
  -H "Content-Type: application/json" \
  -d '{"query":"add rate limiting to the API","top_k":8}'

Returns { query, summary, steps: [{ file, action, description }] }.

`POST /api/v1/patch`

Generate file patches from a plan.

curl -X POST http://localhost:8432/api/v1/patch \
  -H "Content-Type: application/json" \
  -d '{"plan":{...}}'

Returns { description, patches: [{ file, content }], preview }.

`POST /api/v1/validate`

Run the project's build and test scripts.

curl -X POST http://localhost:8432/api/v1/validate

Returns { success, ran, errors, duration_ms }.

`POST /api/v1/apply`

Apply patches. approved must be true — never set this without user review.

curl -X POST http://localhost:8432/api/v1/apply \
  -H "Content-Type: application/json" \
  -d '{"patches":[{"file":"src/api.ts","content":"..."}],"approved":true}'

Returns { applied, backups, backupDir }. Original files are saved to .codemem/backups/ before overwriting.

How to resolve common requirements

If codemem is not installed, run npm install and npm run build.
If the CLI is not found, run npm link from the repo root.
If the sidecar cannot connect, ensure it is running with codemem start and use CODEMEM_PORT when needed.
If the index is stale or corrupted, remove .codemem/db and run codemem init again.
If the model is missing, codemem init downloads it automatically.

Notes

.codemem/ is generated automatically by codemem init.
The first initialization downloads the model and performs the full index build.
Incremental updates after that are much faster.
Use different project folders for separate indexes.
If a port is busy, choose a different port and update CODEMEM_PORT for MCP.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

CodeMem

What CodeMem Solves

Requirements

Verify your environment

Installation

Option A — Clone and build locally (recommended)

Option B — Install from npm (when published)

Getting Started

Need command help?

Core Usage

codemem init

codemem start

codemem stop

codemem status

codemem stats

codemem search <query>

codemem reindex

codemem ask <query>

codemem chat

Hosted web AI providers

codemem plan <query>

codemem apply

codemem ask <query> (updated)

codemem mcp

codemem doctor