@rbalchii/anchor-engine

v4.9.5

Published

4 months ago

Deterministic semantic memory for LLMs - local-first, graph traversal, <1GB RAM

0High
0Medium
0Low

llm-memory semantic-search graph-database rag context-engine memory-layer deterministic-search pglite local-ai knowledge-graph

Anchor Engine ⚓

Deterministic semantic memory for LLMs – local-first, graph traversal, <1GB RAM

🌟 Why Anchor Engine?

Modern AI memory is broken.

Vector databases demand GPUs, gigabytes of RAM, and cloud infrastructure. They're opaque, expensive, and fundamentally incompatible with personal, local‑first AI.

Anchor Engine takes a different path.

It's a deterministic, explainable, CPU‑only memory engine that runs on anything—from a $200 laptop to a workstation—and retrieves context using a physics‑inspired graph algorithm instead of dense vectors.

If you want:

Local‑first AI – your data stays yours
Explainable retrieval – know why something was returned
Deterministic results – same query, same answer, every time
Zero cloud dependency – no API keys, no servers
<1GB RAM usage – runs alongside your browser
High‑speed ingestion – 100MB in minutes

…then Anchor Engine is built for you.

⚡ Quick Start (5 Minutes)

Option 1: npm Install (Recommended)

# Install the package
npm install @rbalchii/anchor-engine

# Or globally for CLI access
npm install -g @rbalchii/anchor-engine

# Start the engine
anchor start

# Open your browser
open http://localhost:3160

Option 2: From Source

# 1. Clone & Install
git clone https://github.com/RSBalchII/anchor-engine-node.git
cd anchor-engine-node
pnpm install
pnpm build

# 2. Start the engine
pnpm start

# 3. Open your browser
open http://localhost:3160

That's it! You now have a sovereign memory system for your LLM.

Data Directory Structure

All user data is stored in the local-data/ directory:

local-data/inbox/ - Your sovereign content (internal provenance, 3.0x boost)
- Files you create or write directly
- Higher priority in search results
local-data/external-inbox/ - External content (external provenance, 1.0x boost)
- Web scrapes, news articles, RSS feeds
- Content from external sources
local-data/mirrored_brain/ - Auto-generated mirror of ingested content
- Cleaned, processed versions of your files
- Stripped of noise, timestamps, and PII

Note: The local-data/ directory is gitignored by default to protect your data and privacy.

Full installation guide: docs/guides/INSTALL_NPM.md

📊 Anchor Engine vs. Vector RAG

| Feature | Anchor Engine | Vector RAG | |---------|---------------|------------| | Hardware | CPU‑only | GPU preferred | | RAM Usage | <1GB | 4–8GB | | Explainability | Native (tags, hops, decay) | None (black box) | | Deterministic | ✅ Yes | ❌ No | | Cloud Required | ❌ No | Often | | Retrieval Complexity | O(k·d̄) linear | O(n log n) | | License | AGPL-3.0 (open) | Varies (often proprietary) |

🧠 The STAR Algorithm

Anchor Engine uses STAR (Semantic Temporal Associative Retrieval)—a physics‑inspired scoring equation that combines:

W(q,a) = |T(q) ∩ T(a)| · γ^d(q,a)  ×  e^(−λΔt)  ×  (1 − H(h_q, h_a)/64)
         ↑ Semantic Gravity         ↑ Temporal Decay   ↑ Structural Gravity

| Component | What It Does | |-----------|--------------| | Semantic Gravity | Shared tags × hop‑distance damping (γ = 0.85) | | Temporal Decay | Recent memories pull harder (half-life ~115 min) | | Structural Gravity | SimHash proximity (64-bit fingerprints) |

Result: O(k·d̄) retrieval—dramatically faster than vector ANN for personal datasets. Every atom includes provenance: shared tags, hop distance, recency, and byte‑range pointers.

⚓ Why "Anchor"?

Drop a query into your memory graph.
The Anchor finds the semantic bottom.
The chain plays out to your chosen radius.
What's retrieved is what's relevant—nothing more, nothing less.
Same anchor, same spot, same result. Every time.
This isn't a metaphor. It's the algorithm.

| Anchor Quality | How It Maps to STAR | |----------------|---------------------| | Drops to a point | Query finds seed atoms (anchor discovery) | | Defines a radius | Chain length = token budget (hop distance) | | Holds against drift | Deterministic retrieval (no semantic drift) | | Connects to the ship | Retrieved context → LLM (your ship) | | Works in deep water | Scales from 10MB to 10GB graphs |

The name isn't marketing. It's architecture.

📥 Ingest Your Data

Option 1: Web UI (Easiest)

Open http://localhost:3160
Click "Manage Paths" → Add folders to watch
Or use "Paste & Ingest" tab for quick text

Option 2: API

curl -X POST http://localhost:3160/v1/research/upload-raw \
  -H "Content-Type: application/json" \
  -d '{
    "content": "Your text here...",
    "filename": "notes.md",
    "bucket": "inbox"
  }'

Option 3: MCP (Claude, Cursor, Qwen Code)

/anchor_ingest_text content="Meeting notes..." filename="meeting.md" bucket="inbox"

🔍 Search Your Memory

Web UI

Natural language queries
Adjustable token budget slider
Filter by buckets/tags

API

curl -X POST http://localhost:3160/v1/memory/search \
  -H "Content-Type: application/json" \
  -d '{
    "query": "What did we discuss about OAuth?",
    "token_budget": 2048
  }'

MCP Tools

/anchor_query query="OAuth authentication setup"
/anchor_search_index query="career planning"
/anchor_fetch_session session_id="abc-123"

🧠 MCP Server – Memory for Your AI Agent

Anchor Engine can act as an MCP (Model Context Protocol) server, giving any MCP‑compatible agent persistent, deterministic memory.

Install

# Install globally for CLI access
npm install -g @rbalchii/anchor-engine

# Or use npx directly
npx @rbalchii/anchor-engine

Start the MCP Server

# Option 1: Use the dedicated MCP binary
anchor-mcp

# Option 2: Start via main CLI
anchor mcp

# Option 3: Run directly with npx
npx anchor-mcp

The server runs in stdio mode, communicating directly with your AI agent.

Available Tools

| Tool | Description | |------|-------------| | anchor_query | Search memory with token budget and provenance tracking | | anchor_distill | Create checkpoint summaries from sessions | | anchor_illuminate | BFS graph traversal for exploration | | anchor_read_file | Read files by line ranges (token-efficient) | | anchor_list_compounds | List available source files | | anchor_get_stats | Check engine health and database stats | | anchor_ingest_text | Add raw text content (opt-in, disabled by default) | | anchor_ingest_file | Ingest files from filesystem (opt-in) |

Configuration

Create a user_settings.json in your project root:

{
  "server": {
    "port": 3160,
    "api_key": "your-secret-key"
  },
  "mcp": {
    "allow_write_operations": false,
    "default_bucket_for_writes": "external-inbox",
    "rate_limit_requests_per_minute": 60,
    "max_query_results": 50
  }
}

Or set environment variables:

export ANCHOR_API_URL="http://localhost:3160"
export ANCHOR_API_KEY="your-secret-key"

Agent Examples

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "anchor": {
      "command": "npx",
      "args": ["@rbalchii/anchor-engine", "mcp"]
    }
  }
}

Or if installed globally:

{
  "mcpServers": {
    "anchor": {
      "command": "anchor-mcp"
    }
  }
}

Qwen Code

Qwen Code automatically detects MCP servers. Just run:

anchor-mcp

Then use tools in chat:

/anchor_query query="What did we decide about authentication?"

Cursor

Add to Cursor MCP settings:

{
  "command": "anchor-mcp",
  "env": {
    "ANCHOR_API_URL": "http://localhost:3160"
  }
}

Quick-Start Demo (2 Minutes)

# 1. Start the MCP server
anchor-mcp &

# 2. Ingest some text (via API for demo)
curl -X POST http://localhost:3160/v1/research/upload-raw \
  -H "Content-Type: application/json" \
  -d '{
    "content": "The STAR algorithm uses physics-based scoring with temporal decay.",
    "filename": "star-notes.md",
    "bucket": "inbox"
  }'

# 3. Search for it
curl -X POST http://localhost:3160/v1/memory/search \
  -H "Content-Type: application/json" \
  -d '{
    "query": "STAR algorithm physics scoring",
    "token_budget": 1024
  }'

# 4. Or use MCP tools in your AI agent
# In Claude/Qwen: "/anchor_query query=STAR algorithm"

Security

Write operations disabled by default – Must explicitly enable in settings
Rate limiting – Configurable requests per minute
API key authentication – Optional but recommended
Localhost restriction – Only accept local connections by default
Bucket safety – Defaults to external-inbox for untrusted content

See full documentation: mcp-server/README.md

🏛️ Our Philosophy: AI Memory Should Work Like Your Brain

Human memory is remarkably efficient. It runs on ~20 watts, forgets irrelevant details, and over time clarifies core truths rather than drowning in noise. It doesn't store raw experiences—it stores patterns, relationships, and meaning.

Most AI memory systems do the opposite: they hoard data, brute‑force compute similarity, and require massive infrastructure.

Anchor Engine was built on a different premise: AI memory should work like the human mind—lightweight, connected, and self‑clarifying.

| Principle | What It Means | How Anchor Implements It | |-----------|---------------|--------------------------| | 🧠 Forgetting is a feature | The brain forgets constantly, leaving only what matters | distill: command removes redundancy; temporal decay | | 🔗 Meaning lives in relationships | We store how concepts connect, not isolated facts | Graph model with typed edges; STAR algorithm | | ⚡ Low power, high efficiency | The brain achieves its magic on ~20 watts | Pointer‑only database; content on disk; <1GB RAM | | 💎 Clarity through distillation | Memory builds higher‑level abstractions over time | Decision Records v2.0 extract the why | | 🔍 Explainability builds trust | You know why a memory came to mind | Provenance tracking; receipts with timestamps |

Why This Matters: Most AI memory systems are built for scale, not for sense. Anchor Engine is designed for sense‑making—for agents that need to remember not just what happened, but why, and to get clearer over time.

🏗️ Architecture at a Glance

┌─────────────────────────────────────────────────────────────┐
│                         YOU                                  │
└────────────────────┬────────────────────────────────────────┘
                     │
                     ▼
┌─────────────────────────────────────────────────────────────┐
│              ⚡ ANCHOR ENGINE                                │
│         (Deterministic Memory Layer)                        │
│                                                              │
│  - Graph traversal (STAR algorithm)                         │
│  - Pointer-only index (<1GB RAM)                            │
│  - Deterministic retrieval (same query = same result)       │
└────────────────────┬────────────────────────────────────────┘
                     │
         ┌───────────┼───────────┐
         │           │           │
         ▼           ▼           ▼
┌──────────────┐ ┌──────────┐ ┌─────────────┐
│ PGlite       │ │ mirrored │ │ MCP Clients │
│ (WASM DB)    │ │ _brain/  │ │ (Claude,    │
│              │ │ (Content)│ │  Cursor)    │
└──────────────┘ └──────────┘ └─────────────┘

Key Insight: Content lives in mirrored_brain/ filesystem. Database stores pointers only (byte offsets + metadata). This makes the database disposable and rebuildable—wipe it and restore in minutes.

📊 Benchmarks (Real Production Data)

Dataset: 91MB chat history (~25M tokens)

| Metric | Result | |--------|--------| | Molecules | 280,000 | | Atoms | 151,876 | | Files | 436 | | Ingestion Time | 178 seconds | | Search Latency (p95) | <200ms | | Memory (idle) | ~600MB | | Memory (peak) | ~1.6GB | | Restore Speed | 281,690 atoms in 13.8 min |

Hardware: AMD Ryzen / Intel i7, 16GB RAM, NVMe SSD, no GPU.

🛠️ What's New in v4.8.0

MCP Write Operations

anchor_ingest_text - Ingest raw text directly
anchor_ingest_file - Ingest files from filesystem
Security toggle (opt-in via user_settings.json)

Session Index

anchor_search_index - Fast chat session lookup
anchor_fetch_session - Targeted session retrieval
Two-tier memory retrieval

Web UI Improvements

Paste & Ingest tab - Quick text ingestion
Version badge (v4.8.0)
Bucket selector (inbox vs external-inbox)

Documentation Overhaul

5 new docs (API, Deployment, Troubleshooting, Source Overview, Testing)
Philosophy embedded throughout
7 redundant files archived

📚 Documentation

Getting Started

Quick Start - Install & first query
docs/API.md - Complete API reference
docs/DEPLOYMENT.md - Deployment guide (local, Docker, VPS, K8s)
docs/TROUBLESHOOTING.md - Common issues & fixes

Deep Dive

docs/whitepaper.md - STAR algorithm whitepaper
specs/spec.md - System specification with diagrams
specs/current-standards/ - Active standards (001-010)
engine/src/README.md - Source code overview

Integration

mcp-server/README.md - MCP integration (Claude, Cursor, Qwen)
tests/README.md - Testing guide
benchmarks/README.md - Performance benchmarks

🤝 Contributing

We're building in the open and welcome your input!

Star the repo – Helps others find it
Open an issue – Bugs, features, questions
Start a discussion – Share use cases
Contribute – PRs welcome!

See CONTRIBUTING.md for guidelines.

Community Health:

CODEOWNERS - Automatic reviewer assignment
CODE_OF_CONDUCT.md - Community standards
CONTRIBUTING.md - Contribution guide

📜 License

AGPL-3.0 – see LICENSE.

🙏 Acknowledgments

Built with ❤️ by Robert Balch II and contributors.
Inspired by the r/LocalLLaMA and r/aiagents communities.

Citation:

@software{anchor_engine,
  title = {STAR: Semantic Temporal Associative Retrieval},
  author = {Balch II, R.S.},
  version = {4.8.0},
  date = {2026-03-18},
  url = {https://github.com/RSBalchII/anchor-engine-node},
  doi = {10.5281/zenodo.18841399}
}

Your AI's anchor to reality. ⚓