n2-arachne

v4.0.4

Published

2 months ago

Arachne — Weaves code into optimal AI context via MCP. BM25 + semantic search, 4-layer assembly, any AI provider.

Downloads

0High
0Medium
0Low

lagi01

mcp ai context code-indexing llm rag local-first typescript

️ Arachne (n2-arachne)

한국어 | English | 日本語

ARACHNE V4.0.0 TITANIUM EDITION

1GB (1,000,000,000 Bytes) Codebase Search in 0.54 Seconds. Powered by Zero-Marshaling Architecture. 0 Bytes JS Heap Bloat. No OOM Crashes.

Weave your codebase into the perfect context for AI

What's New in v4.0 (Titanium Edition)

Arachne has been completely rewritten from JS to strict TypeScript, with heavy computational paths natively rewritten in C++ SIMD (sqlite-vec) and Rust (napi-rs).

TypeScript (Strict): 100% type safety with zero runtime regressions.
Rust Acceleration: BM25 1.3x faster (memchr SIMD + rayon parallel); BatchCosine 19.9x faster (96ms → 4.8ms).
C++ SIMD Search: sqlite-vec scans 10,000 × 768D vectors in just 25ms natively.
KV-Cache Integration: Soul-bridge incremental re-indexing enables sub-second history load on start. — like Arachne, the greatest weaver of Greek mythology. ️

The Problem — Why AI Gets Your Code Wrong

Imagine going to a doctor and saying "I have a headache."

Bad doctor: reads your entire 500-page medical history, gets confused, prescribes the wrong medicine
Good doctor: looks at relevant records only — recent symptoms, medications, allergies — and nails the diagnosis

AI coding assistants are like that bad doctor.

When your project has 500 files, AI can't read them all. So what happens?

 Your Project (500 files, 2M tokens)
│
├── auth/login.ts        ←  The bug is HERE
├── auth/session.ts      ←  login imports this
├── api/http.ts          ←  session imports this
├── utils/config.ts      ← ️ timeout settings live here
│
├── pages/home.tsx       ←  completely irrelevant
├── pages/about.tsx      ←  completely irrelevant
├── components/Button.tsx ←  completely irrelevant
└── ... 493 more files    ←  all irrelevant

| Approach | What AI receives | Result | |----------|-----------------|--------| | Dump everything | 2,000,000 tokens | Exceeds context window, AI confused | | Random files | ~50,000 tokens | Misses critical code, wrong fix | | Arachne | 30,000 tokens (4 relevant files) | Precise fix, every time |

Tokens = units of text AI reads. More tokens = more cost, slower, less accurate. AI has a limited "context window" — like a desk that can only hold so many papers.

Real-World Benchmark (N2 Browser — 3,219 files)

| Metric | Value | |--------|:-----:| | Project size | 3,219 files, 4.68M tokens | | Arachne output | 14,074 tokens | | Compression | 333x (99.7% reduction) | | Index time | 627ms (incremental: 0ms) | | DB size | 24 MB |

Measured on a real production project. Arachne delivered exactly what AI needed — 333x less data, same accuracy.

️ The Solution — Arachne Picks Exactly What AI Needs

Arachne is a local MCP server that acts like that good doctor. It reads your entire codebase once, understands the structure, and only sends what's relevant to AI.

You: "Fix the login timeout bug"
                │
                ▼
┌──────────────────────────────────────────────────────┐
│  ️ Arachne: "I'll find exactly what you need"      │
│                                                      │
│  L1  Project tree (so AI knows the structure)      │
│  L2  login.ts (the file you're working on)         │
│  L3  http.ts, session.ts (found via search +       │
│        dependency chain: login → session → http)     │
│  L4 ️ config.ts (frequently accessed, has timeout)  │
│                                                      │
│  → 30,000 tokens of perfectly curated context        │
└──────────────────────────────────────────────────────┘
                │
                ▼
        AI generates accurate fix

No manual file selection. No prompt engineering. Just ask.

Why Arachne?

98.5% token savings — 30K instead of 2M tokens. Real money saved on API calls
Beats "Lost in the Middle" — Smart output ordering (L1→L3→L4→L2) keeps critical code where AI pays attention (research-backed)
Zero external deps — No Docker, no cloud, no API keys. Just npm install and go
Blazing fast — 21 files indexed in 12ms. Incremental updates in sub-second
Ultralight — Only 3 deps: better-sqlite3, sqlite-vec, zod. No bloat
Free for personal/OSS — Apache-2.0 for individuals and open-source projects. No telemetry
Plug & play — Add MCP config → done. Zero code changes to your project
Multi-language — Follows import chains across JS/TS, Python, Rust, Go, Java
Ollama optional — Works perfectly without Ollama (BM25 search). Add Ollama for bonus semantic search

Arachne in 4 Panels — Titanium Edition

Arachne V4.0 Titanium — AI drowning in 500 files, Titanium Spider arrives, scans 1GB in 0.54s, delivers 4 perfect files. 30K tokens, done.

Soul + Arachne Synergy

Soul remembers past sessions. Arachne finds the code. Together, AI never forgets and never misses.

Key Features

| Feature | Description | |---------|-------------| | MCP Standard | Works with Claude, Gemini, GPT, Ollama — any AI provider | | Local-First | All indexing in local SQLite. Zero data leaves your machine | | Incremental | Only re-indexes changed files. Sub-second updates | | Hybrid Search | BM25 keyword + semantic vector search (Ollama embeddings) | | ️ 4-Layer Assembly | Smart context paging within token budget | | Dependency Graph | Follows import chains across JS/TS, Python, Rust, Go, Java | | ️ Backup & Restore | SQLite online backup with in-backup search |

️ Architecture: 4-Layer Context Assembly

┌─────────────────────────────────────────────┐
│              Token Budget (e.g. 30K)        │
├────────────┬────────────────────────────────┤
│ L1: Fixed  │ File tree overview (10%)       │
│ (always)   │ Project structure snapshot     │
├────────────┼────────────────────────────────┤
│ L2: Short  │ Current file + recent (20%)   │
│ (context)  │ What you're working on now     │
├────────────┼────────────────────────────────┤
│ L3: Assoc  │ Search + dependencies (50%)  │
│ (relevant) │ BM25 + semantic + dep chain   │
├────────────┼────────────────────────────────┤
│ L4: Spare  │ Frequently accessed (20%)     │
│ (backup)   │ Files you use most            │
└────────────┴────────────────────────────────┘

Output order: L1 → L3 → L4 → L2  (mitigates "Lost in the Middle")

Semantic Search (Optional, Zero Lock-in)

When Ollama is available, Arachne upgrades from keyword-only to hybrid search:

BM25 Score (keyword) ──┐
                       ├── Weighted Merge (α=0.5) → Best Results
Cosine Similarity ─────┘
(nomic-embed-text 768D)

sqlite-vec for SIMD-accelerated (AVX2/SSE2/Neon) KNN vector search
768-dimensional embeddings via Ollama nomic-embed-text — runs 100% local
Graceful degradation: No Ollama? Falls back to BM25-only. Zero crashes. Always works.
Enable in config: embedding.enabled = true
Vector storage: ~3KB per chunk. 5000 chunks = just 15MB on disk

Benchmark (v4.0)

All benchmarks run on AMD Ryzen 5 5600G, Node v24, Windows x64. Three engines: TypeScript (V8 JIT), Rust (napi-rs), sqlite-vec (C++ SIMD).

Search Performance (10,000 chunks / 768D vectors)

| Search Mode | Engine | Performance | Notes | |-------------|--------|:-----------:|-------| | Keyword | 🦀 Rust BM25 (memchr + rayon) | 4.98 ms / query | 1.3x faster than TS | | Keyword | SQLite LIKE | 0.021 ms / query | DB index, fastest | | Semantic KNN | sqlite-vec (C++ SIMD) | 29.52 ms / query | 10K × 768D in-DB native scan | | Batch Cosine | 🦀 Rust (napi-rs) | 4.91 ms / query | Legacy: 22.3x faster, but triggers IPC heap spikes |

Architecture Pivot: 1GB+ Scale Stability

While Rust BatchCosine achieved blazing speeds (4.9ms), loading millions of vectors across the Node.js FFI boundary triggered massive V8 Garbage Collection (GC) pauses and Heap OOM crashes on 1GB+ codebases.

To achieve 100% stability on large-scale datasets, Arachne v4.0 enforces a Zero-Marshaling Policy:

Semantic Search: Delegated 100% to sqlite-vec. It takes ~29ms (slightly slower than Rust), but memory overhead is 0 bytes in JS. The Node event loop remains completely unblocked.
Keyword Search: Powered by the Rust BM25 Cache. Chunk data is cached heavily in the Rust heap, crossing the FFI boundary only for small queries and resulting IDs.

npm run build && node test/bench-hybrid-engine.js   # Raw engine comparison
npm run build && node test/bench-10mb.js             # Memory scale impact

Results saved to data-hybrid-bench/benchmark-report.json.

Java Support — Built for Enterprise

Arachne provides first-class Java support, designed for large-scale enterprise codebases (5M+ LOC):

| Feature | Description | |---------|-------------| | Smart Chunking | Detects class, interface, enum, method, @interface (annotations) | | Large Class Splitting | Classes over 500 tokens are automatically sub-chunked into individual methods | | Import Resolution | Parses import com.example.Service and import static org.junit.Assert.* | | Access Modifiers | Handles public, private, protected, abstract, final, synchronized | | Generics | Correctly processes <T extends Comparable<T>> and complex generic types | | Spring/JUnit | Tested with Spring Boot @RestController, JUnit5 static imports, Mockito | | Binary Exclusion | Automatically ignores .class, .jar, .war, .ear files |

How Large Class Sub-Chunking Works

// 500+ token class → automatically split into methods
public class UserService {       // ← detected as container
    public User findById() {}    // ← sub-chunk 1
    public List<User> findAll()  // ← sub-chunk 2
    public User save() {}        // ← sub-chunk 3
    // ... fields, constructor   // ← remainder chunk
}

// Small class (<500 tokens) → kept as single chunk (no overhead)
public class TinyDTO { ... }     // ← single chunk, efficient

Why this matters for 5M LOC projects: A single Java class can have 50+ methods spanning thousands of lines. Without sub-chunking, AI would receive the entire class as one blob. With Arachne, AI gets individual methods — enabling precise, targeted code generation.

Token Impact: Less Is More

Without sub-chunking:
  AI asks: "Fix the findById bug"
  → BM25 hits UserService class
  → Entire class sent: 6,000 tokens  

With sub-chunking:
  AI asks: "Fix the findById bug"
  → BM25 hits findById() method only
  → Just the method sent: 80 tokens    75x savings!

Sub-chunking doesn't cost extra — it saves tokens by sending only what's relevant instead of entire classes.

️ Stability: 128 Tests, Zero Failures

Arachne is built for production. Every edge case is tested:

| Category | What's Tested | |----------|---------------| | SQL Injection | 5 attack patterns including Bobby Tables | | ️ Null/Empty Input | null, undefined, empty string → safe return | | Huge Input | 10KB queries → no crash | | Special Characters | Unicode, emoji, regex chars → handled | | Ollama Disconnect | Bad endpoint → graceful BM25 fallback | | Idempotency | Triple re-indexing → same result | | Extreme Budgets | Budget 0, 1, 1M → all safe | | Edge topK | topK = -1, 0, 99999 → no crash | | Schema Safety | Triple init → data survives |

Phase 1 (Indexing/Search):    15/15 
Phase 2 (Assembly/Deps):      26/26 
Phase 2 (KV-Cache Bridge):   33/33 
Phase 3 (Semantic/Hybrid):    10/10 
Stability (Reddit-proof):     44/44 
─────────────────────────────────────
Total:                       128/128

Installation

Pro tip: The best way to install? Just ask your AI agent: "Install n2-arachne for me." It knows what to do. ️

npm install n2-arachne

MCP Config (Claude Desktop / Cursor / etc.)

{
  "mcpServers": {
    "n2-arachne": {
      "command": "node",
      "args": ["/path/to/n2-arachne/dist/index.js"],
      "env": {
        "ARACHNE_PROJECT_DIR": "/path/to/your/project"
      }
    }
  }
}

Configuration

Create config.local.js in the Arachne directory:

module.exports = {
    projectDir: '/path/to/your/project',
    dataDir: './data',

    indexing: {
        autoIndex: true,
        maxFileSize: 512 * 1024,    // 512KB max per file
    },

    // Enable semantic search (requires Ollama)
    embedding: {
        enabled: true,              // default: false
        provider: 'ollama',
        model: 'nomic-embed-text',
        endpoint: 'http://localhost:11434',
    },

    assembly: {
        defaultBudget: 30000,       // tokens
    },
};

Usage (MCP Tool)

Arachne registers a single MCP tool n2_arachne with these actions:

| Action | Description | |--------|-------------| | search | BM25 keyword search (+ semantic if enabled) | | assemble | 4-Layer context assembly within token budget | | index | Index/re-index project files | | status | Show indexing stats + embedding status | | files | List indexed files | | backup | Create/list/restore backups |

Example: Assemble Context

{
  "action": "assemble",
  "query": "HTTP request timeout error handling",
  "activeFile": "lib/executor.js",
  "budget": 20000
}

Connect with Soul / QLN

Arachne works great standalone, but becomes far more powerful with Soul (session memory) and QLN (tool routing).

Setup is simple — just register them together in your MCP config:

Soul + Arachne Together

{
  "mcpServers": {
    "n2-soul": {
      "command": "node",
      "args": ["/path/to/n2-soul/index.js"]
    },
    "n2-arachne": {
      "command": "node",
      "args": ["/path/to/n2-arachne/dist/index.js"],
      "env": {
        "ARACHNE_PROJECT_DIR": "/path/to/your/project"
      }
    }
  }
}

Zero extra config needed! Register both servers in the same MCP config and AI automatically uses both tools.
Soul remembers past session work and decisions
Arachne finds the exact code and delivers it to AI
Result: AI picks up right where you left off — no "what was I working on?"

Full N2 Stack (Soul + Arachne + QLN)

{
  "mcpServers": {
    "n2-soul": {
      "command": "node",
      "args": ["/path/to/n2-soul/index.js"]
    },
    "n2-arachne": {
      "command": "node",
      "args": ["/path/to/n2-arachne/dist/index.js"],
      "env": {
        "ARACHNE_PROJECT_DIR": "/path/to/your/project"
      }
    },
    "n2-qln": {
      "command": "node",
      "args": ["/path/to/n2-qln/index.js"]
    }
  }
}

Add QLN and even with 100+ MCP tools, AI automatically finds and uses only what it needs via QLN's semantic routing.

N2 Ecosystem — Better Together

| Package | Role | npm | Standalone | |---------|------|-----|:----------:| | QLN | Tool routing (1000+ tools → 1 router) | n2-qln | | | Soul | Agent memory & session management | n2-soul | | | Ark | Security policies & code verification | n2-ark | | | Arachne | Code context auto-assembly ️ | n2-arachne | |

Every package works 100% standalone. But when combined, magic happens:

Synergy: How They Work Together

User: "Fix the login timeout bug"
     │
     ▼
┌─── QLN (Router) ──────────────────────────────────────┐
│ 1000+ tools → Semantic routing finds:                 │
│   → n2_arachne.assemble (context)                     │
│   → n2_arachne.search (code search)                   │
│ Token cost: 2 tool defs instead of 1000+              │
└────────────────┬──────────────────────────────────────┘
                 │
                 ▼
┌─── Arachne (Context) ─────────────────────────────────┐
│ L1: Project tree overview                              │
│ L2: auth/login.ts (current file)                       │
│ L3: BM25 + semantic search → timeout-related code      │
│     + dependency chain: login.ts → api.ts → http.ts    │
│ L4: Frequently accessed config files                   │
│ → 30K tokens of perfectly curated context              │
└────────────────┬──────────────────────────────────────┘
                 │
                 ▼
┌─── Soul (Memory) ─────────────────────────────────────┐
│ "Last session, Rose fixed a similar timeout in         │
│  api.ts line 47. Decision: increased to 30s."          │
│ → Past context + decisions + handoff notes             │
│ → KV-Cache: instant session restoration                │
└────────────────┬──────────────────────────────────────┘
                 │
                 ▼
┌─── Ark (Security) ────────────────────────────────────┐
│  No hardcoded credentials in generated code          │
│  Timeout value from config, not magic number         │
│  Error handling follows project conventions           │
│ → Code verification before commit                      │
└───────────────────────────────────────────────────────┘

Solo vs Combined

| Scenario | Solo | Combined | |----------|------|----------| | Token usage | AI sees all 1000+ tools | QLN routes → AI sees 2-3 tools | | Context quality | AI guesses which files matter | Arachne provides exact relevant code | | Memory | AI forgets everything each turn | Soul remembers past sessions + decisions | | Code safety | No guardrails | Ark validates before deploy | | Setup | Each tool works independently | Zero extra config — auto-detection |

Real-World Impact

QLN + Arachne: QLN routes the request to Arachne → Arachne provides perfect context → AI generates accurate code on the first try. No more "which file was that in?"
Soul + Arachne: Soul remembers what you worked on last session → Arachne indexes those files with higher priority → continuity across sessions
Ark + Arachne: Arachne provides code context → AI generates code → Ark validates it follows project patterns. Catch bugs before they ship.
All 4 together: The AI becomes a team member who remembers everything, finds anything, uses the right tools, and follows the rules.

License

This project is dual-licensed:

| Use Case | License | Cost | | --- | --- | --- | | Personal / Educational | Apache 2.0 | Free | | Open-source (non-commercial) | Apache 2.0 | Free | | Commercial / Enterprise | Commercial License | Contact us |

See LICENSE for full details.

Star History

No coffee? A star is fine too

Arachne — the greatest weaver. Your code, perfectly woven. ️