rag-lite-ts

v2.3.1

Published

a month ago

Local-first TypeScript retrieval engine with Chameleon Multimodal Architecture for semantic search over text and image content

🦎 RAG-lite TS

Simple by default, powerful when needed

Local-first semantic search that actually works

Quick Start • Features • Documentation • Examples • UI • MCP Integration

🎯 Why RAG-lite TS?

Stop fighting with complex RAG frameworks. Get semantic search running in 30 seconds:

npm install -g rag-lite-ts
raglite ingest ./docs/
raglite search "your query here"

That's it. No API keys, no cloud services, no configuration hell. Prefer a UI? Run raglite ui for a visual interface with drag‑and‑drop ingestion and search.

🎬 See It In Action

// 1. Ingest your docs
const pipeline = new IngestionPipeline('./db.sqlite', './index.bin');
await pipeline.ingestDirectory('./docs/');

// 2. Search semantically
const search = new SearchEngine('./index.bin', './db.sqlite');
const results = await search.search('authentication flow');

// 3. Get relevant results instantly
console.log(results[0].text);
// "To authenticate users, first obtain a JWT token from the /auth endpoint..."

Real semantic understanding - not just keyword matching. Finds "JWT token" when you search for "authentication flow".

What Makes It Different?

🏠 100% Local - Your data never leaves your machine
🚀 Actually Fast - Sub-100ms queries, not "eventually consistent"
🦎 Chameleon Architecture - Automatically adapts between text and multimodal modes
🖼️ True Multimodal - Search images with text, text with images (CLIP unified space)
📦 Zero Runtime Dependencies - No Python, no Docker, no external services
🎯 TypeScript Native - Full type safety, modern ESM architecture
🔌 MCP Ready - Built-in Model Context Protocol server for AI agents

Pipeline

🎉 What's New in 2.0

Chameleon Multimodal Architecture - RAG-lite TS now seamlessly adapts between text-only and multimodal search:

🖼️ Multimodal Search

CLIP Integration - Unified 512D embedding space for text and images
Cross-Modal Search - Find images with text queries, text with image queries
Image-to-Text Generation - Automatic descriptions using vision-language models
Smart Reranking - Automatic strategy selection with cross-encoder and text-derived methods

🏗️ Architecture Improvements

Layered Architecture - Clean separation: core (model-agnostic) → implementation (text/multimodal) → public API
Mode Persistence - Configuration stored in database, auto-detected during search
Unified Content System - Memory-based ingestion for AI agents, format-adaptive retrieval
Simplified APIs - createEmbedder() and createReranker() replace complex factory patterns

🤖 MCP Server Enhancements

Multimodal Tools - multimodal_search, ingest_image with URL download
Base64 Image Delivery - Automatic encoding for AI agent integration
Content-Type Filtering - Filter results by text, image, pdf, docx
Dynamic Tool Descriptions - Context-aware tool documentation

📦 Migration from 1.x

Existing databases need schema updates for multimodal support. Two options:

Automatic Migration: Use migrateToRagLiteStructure() function
Fresh Start: Re-ingest content with v2.0.0

See CHANGELOG.md for complete details.

📋 Table of Contents

🚀 Quick Start

Installation

npm install -g rag-lite-ts

Basic Usage (CLI)

# Ingest documents
raglite ingest ./docs/

# Search your documents
raglite search "machine learning concepts"

# Get more results with reranking (use --rerank for better quality)
raglite search "API documentation" --top-k 10 --rerank

Web Interface (UI)

Launch the visual web interface for interactive document management:

# Start the UI (opens in browser)
raglite ui

# UI provides:
# - Drag & drop file upload
# - Real-time ingestion progress
# - Visual search interface
# - Image search with upload
# - Knowledge base statistics

→ Complete UI Guide

Using Different Models

# Use higher quality model
raglite ingest ./docs/ --model Xenova/all-mpnet-base-v2

# Switch models or refresh your data (DESTRUCTIVE: wipes DB+index and rebuilds from scratch)
raglite ingest ./docs/ --model Xenova/all-mpnet-base-v2 --force-rebuild

# Search automatically uses the correct model
raglite search "complex query"

Content Retrieval and MCP Integration

import { SearchEngine, IngestionPipeline } from 'rag-lite-ts';

// Memory-based ingestion for AI agents
const pipeline = new IngestionPipeline('./db.sqlite', './index.bin');
const content = Buffer.from('Document from AI agent');
await pipeline.ingestFromMemory(content, {
  displayName: 'agent-document.txt'
});

// Format-adaptive content retrieval
const search = new SearchEngine('./index.bin', './db.sqlite');
const results = await search.search('query');

// Get file path for CLI clients
const filePath = await search.getContent(results[0].contentId, 'file');

// Get base64 content for MCP clients
const base64 = await search.getContent(results[0].contentId, 'base64');

Multimodal Search (Text + Images)

RAG-lite TS now supports true multimodal search using CLIP's unified embedding space, enabling cross-modal search between text and images:

# Enable multimodal processing for text and image content
raglite ingest ./docs/ --mode multimodal

# Cross-modal search: Find images using text queries
raglite search "architecture diagram" --content-type image
raglite search "red sports car" --content-type image

# Image-to-image search: Find similar images using image files
raglite search ./photo.jpg                    # Find similar images
raglite search ./diagram.png --top-k 5       # Find similar images with custom count

# Find text documents about visual concepts
raglite search "user interface design" --content-type text

# Search across both content types (default)
raglite search "system overview"

# Automatic reranking based on mode (text: cross-encoder, multimodal: text-derived)

Key Features:

Unified embedding space: Text and images embedded in the same 512-dimensional CLIP space
Cross-modal search: Text queries find semantically similar images
Automatic mode detection: Set mode once during ingestion, automatically detected during search
Automatic reranking: Cross-encoder for text, text-derived for multimodal, with enable/disable control
Seamless experience: Same CLI commands work for both text-only and multimodal content

→ Complete Multimodal Tutorial

Programmatic Usage

import { SearchEngine, IngestionPipeline } from 'rag-lite-ts';

// Text-only mode (default)
const ingestion = new IngestionPipeline('./db.sqlite', './vector-index.bin');
await ingestion.ingestDirectory('./docs/');

// Multimodal mode (text + images)
const multimodalIngestion = new IngestionPipeline('./db.sqlite', './index.bin', {
  mode: 'multimodal',
  embeddingModel: 'Xenova/clip-vit-base-patch32',
  rerankingStrategy: 'text-derived'
});
await multimodalIngestion.ingestDirectory('./mixed-content/');

// Search (mode auto-detected from database)
const search = new SearchEngine('./vector-index.bin', './db.sqlite');
const results = await search.search('machine learning', { top_k: 10 });

// Cross-modal search in multimodal mode
const imageResults = results.filter(r => r.contentType === 'image');
const textResults = results.filter(r => r.contentType === 'text');

Memory Ingestion & Unified Content System (NEW)

// Ingest content directly from memory (perfect for MCP integration)
const content = Buffer.from('# AI Guide\n\nComprehensive AI concepts...');
const contentId = await ingestion.ingestFromMemory(content, {
  displayName: 'AI Guide.md',
  contentType: 'text/markdown'
});

// Retrieve content in different formats based on client needs
const filePath = await search.getContent(contentId, 'file');     // For CLI clients
const base64Data = await search.getContent(contentId, 'base64'); // For MCP clients

// Batch content retrieval for efficiency
const contentIds = ['id1', 'id2', 'id3'];
const contents = await search.getContentBatch(contentIds, 'base64');

// Content management with deduplication
const stats = await ingestion.getStorageStats();
console.log(`Content directory: ${stats.contentDirSize} bytes, ${stats.fileCount} files`);

// Cleanup orphaned content
const cleanupResult = await ingestion.cleanup();
console.log(`Removed ${cleanupResult.removedFiles} orphaned files`);

Configuration Options

import { SearchEngine, IngestionPipeline } from 'rag-lite-ts';

// Custom model configuration
const search = new SearchEngine('./vector-index.bin', './db.sqlite', {
  embeddingModel: 'Xenova/all-mpnet-base-v2',
  enableReranking: true,
  topK: 15
});

// Ingestion with custom settings
const ingestion = new IngestionPipeline('./db.sqlite', './vector-index.bin', {
  embeddingModel: 'Xenova/all-mpnet-base-v2',
  chunkSize: 400,
  chunkOverlap: 80
});

→ Complete CLI Reference | API Documentation

💡 Real-World Examples

import { SearchEngine, IngestionPipeline } from 'rag-lite-ts';

// Ingest your docs once
const pipeline = new IngestionPipeline('./db.sqlite', './index.bin');
await pipeline.ingestDirectory('./docs/');

// Search instantly
const search = new SearchEngine('./index.bin', './db.sqlite');
const results = await search.search('authentication flow');

results.forEach(r => {
  console.log(`${r.metadata.title}: ${r.text}`);
  console.log(`Relevance: ${r.score.toFixed(3)}\n`);
});

Use case: Internal documentation, API references, knowledge bases

# Ingest mixed content (text + images)
raglite ingest ./assets/ --mode multimodal

# Find images using text descriptions
raglite search "architecture diagram" --content-type image
raglite search "team photo" --content-type image
raglite search "product screenshot" --content-type image

# Or find similar images using image files directly
raglite search ./reference-diagram.png --content-type image
raglite search ./sample-photo.jpg --top-k 5

Use case: Digital asset management, photo libraries, design systems

// Agent ingests conversation context
const content = Buffer.from('User prefers dark mode. Uses TypeScript.');
await pipeline.ingestFromMemory(content, {
  displayName: 'user-preferences.txt'
});

// Later, agent retrieves relevant context
const context = await search.search('user interface preferences');
// Agent now knows: "User prefers dark mode"

Use case: Chatbots, AI assistants, context-aware agents

// Index your codebase
await pipeline.ingestDirectory('./src/', {
  chunkSize: 500,  // Larger chunks for code
  chunkOverlap: 100
});

// Find code by intent, not keywords
const results = await search.search('authentication middleware');
// Finds relevant code even if it doesn't contain those exact words

Use case: Code navigation, refactoring, onboarding

{
  "mcpServers": {
    "my-docs": {
      "command": "raglite-mcp",
      "env": {
        "RAG_DB_FILE": "./docs/db.sqlite",
        "RAG_INDEX_FILE": "./docs/index.bin"
      }
    }
  }
}

Now Claude can search your docs directly! Works with any MCP-compatible AI tool.

Use case: AI-powered documentation, intelligent assistants

✨ Features

🎯 Developer Experience

One-line setup - new SearchEngine() just works
TypeScript native - Full type safety
Zero config - Sensible defaults everywhere
Hackable - Clean architecture, easy to extend

🚀 Performance

Sub-100ms queries - Fast vector search
Offline-first - No network calls
Efficient chunking - Smart semantic boundaries
Optimized models - Multiple quality/speed options

🦎 Chameleon Architecture

Auto-adapting - Text or multimodal mode
Mode persistence - Set once, auto-detected
No fallbacks - Reliable or clear failure
Polymorphic runtime - Same API, different modes

🖼️ Multimodal Search

CLIP unified space - Text and images together
Cross-modal queries - Text finds images, vice versa
Smart reranking - Automatic strategy selection by mode
Seamless experience - Same commands, more power

🔌 Integration Ready

MCP server included - AI agent integration
Memory ingestion - Direct buffer processing
Format-adaptive - File paths or base64 data
Multi-instance - Run multiple databases

🛠️ Production Ready

Content management - Deduplication, cleanup
Model compatibility - Auto-detection, rebuilds
Error recovery - Clear messages, helpful hints

🎨 Web Interface (UI)

Visual ingestion - Drag & drop file upload
Real-time progress - Live ingestion tracking
Image search - Upload images to find similar content
Interactive search - Visual results with previews
Knowledge base stats - Overview of your data

📚 Documentation

Comprehensive guides - CLI, API, multimodal tutorials
Quick start guides - Get running in minutes
Troubleshooting - Solutions for common issues
Examples - Real-world use cases

📁 Supported File Formats

RAG-lite TS supports the following file formats with full processing implementations:

Text Mode:

Markdown: .md, .mdx
Plain text: .txt
Documents: .pdf, .docx

Multimodal Mode (includes all text formats plus):

Images: .jpg, .jpeg, .png, .gif, .webp, .bmp

All formats work seamlessly with both single file and directory ingestion:

# Single file ingestion
raglite ingest ./document.pdf
raglite ingest ./readme.md
raglite ingest ./notes.txt

# Directory ingestion (processes all supported formats)
raglite ingest ./docs/

# Multimodal ingestion (includes images)
raglite ingest ./mixed-content/ --mode multimodal

🔧 How It Works

RAG-lite TS follows a clean, efficient pipeline:

📄 Documents → 🧹 Preprocessing → ✂️ Chunking → 🧠 Embedding → 💾 Storage
                                                                      ↓
🎯 Results ← 🔄 Reranking ← 🔍 Vector Search ← 🧠 Query Embedding ← ❓ Query

Pipeline Steps

| Step | What Happens | Technologies | |------|--------------|--------------| | 1. Ingestion | Reads .md, .txt, .pdf, .docx, images | Native parsers | | 2. Preprocessing | Cleans JSX, Mermaid, code blocks, generates image descriptions | Custom processors | | 3. Chunking | Splits at natural boundaries with token limits | Semantic chunking | | 4. Embedding | Converts text/images to vectors | transformers.js | | 5. Storage | Indexes vectors, stores metadata | hnswlib + SQLite | | 6. Search | Finds similar chunks via cosine similarity | HNSW algorithm | | 7. Reranking | Re-scores results for relevance | Cross-encoder/text-derived |

🦎 Chameleon Architecture

The system automatically adapts based on your content:

📝 Text Mode

Text Docs → Sentence Transformer
              ↓
         384D Vectors
              ↓
      HNSW Index + SQLite
              ↓
    Cross-Encoder Reranking

Best for: Documentation, articles, code

🖼️ Multimodal Mode

Text + Images → CLIP Embedder
                    ↓
              512D Unified Space
                    ↓
          HNSW Index + SQLite
                    ↓
        Text-Derived Reranking

Best for: Mixed content, visual search

🎯 Key Benefits:

Set mode once during ingestion → Auto-detected during search
Cross-modal search - Text queries find images, image queries find text
No fallback complexity - Each mode works reliably or fails clearly
Same API - Your code doesn't change between modes

→ Document Preprocessing Guide | Model Management Details

🧠 Supported Models

Choose the right model for your use case:

📝 Text Mode Models

| Model | Dims | Speed | Quality | Best For | |-------|------|-------|---------|----------| | sentence-transformers/all-MiniLM-L6-v2 ⭐ | 384 | ⚡⚡⚡ | ⭐⭐⭐ | General purpose (default) | | Xenova/all-mpnet-base-v2 | 768 | ⚡⚡ | ⭐⭐⭐⭐ | Complex queries, higher accuracy |

🖼️ Multimodal Models

| Model | Dims | Speed | Quality | Best For | |-------|------|-------|---------|----------| | Xenova/clip-vit-base-patch32 ⭐ | 512 | ⚡⚡ | ⭐⭐⭐ | Text + images (default) | | Xenova/clip-vit-base-patch16 | 512 | ⚡ | ⭐⭐⭐⭐ | Higher visual quality |

✨ Model Features

✅ Auto-download - Models cached locally on first use
✅ Smart compatibility - Detects model changes, prompts rebuilds
✅ Offline support - Pre-download for air-gapped environments
✅ Zero config - Works out of the box with sensible defaults
✅ Cross-modal - CLIP enables text ↔ image search

→ Complete Model Guide | Performance Benchmarks

📚 Documentation

🚀 Getting Started

🔧 Advanced

🛠️ Support

🎯 Quick Start by Role

| I want to... | Start here | |--------------|------------| | 🆕 Try it out | CLI Reference → npm i -g rag-lite-ts | | 🎨 Use visual interface | UI Guide → raglite ui | | 🖼️ Search images | Multimodal Tutorial → --mode multimodal | | 💻 Build an app | API Reference → new SearchEngine() | | 🤖 Integrate with AI | MCP Guide → raglite-mcp | | ⚡ Optimize performance | Model Guide → Choose your model | | 🐛 Fix an issue | Troubleshooting → Common solutions |

📖 Complete Documentation Hub

🎨 Web Interface (UI)

RAG-lite TS includes a modern web-based interface for visual document management and search.

Quick Start

# Launch the UI (starts both frontend and backend)
raglite ui

# Opens in browser at http://localhost:3000

Features

📤 Visual Ingestion: Drag & drop file upload with real-time progress
🔍 Interactive Search: Text and image search with visual results
📊 Knowledge Base Stats: Overview of documents, chunks, and model information
⚙️ Configuration: Visual interface for all ingestion options
🖼️ Image Search: Upload images to find similar content
📁 Folder Support: Upload entire directory structures

UI vs CLI

| Feature | UI | CLI | |---------|----|----| | File upload | ✅ Drag & drop | ✅ Command-line | | Progress tracking | ✅ Visual bars | ✅ Console output | | Image search | ✅ Upload interface | ✅ File path | | Configuration | ✅ Visual options | ✅ Command flags | | Batch processing | ✅ | ✅ | | Scripting | ❌ | ✅ |

Use UI for: Interactive exploration, visual feedback, learning the system
Use CLI for: Automation, scripting, headless environments

→ Complete UI Guide

🔌 MCP Server Integration

Give your AI agents semantic memory. RAG-lite TS includes a built-in Model Context Protocol (MCP) server.

# Start MCP server (works with Claude, Cline, and other MCP clients)
raglite-mcp

Single Instance Configuration

MCP Configuration:

{
  "mcpServers": {
    "rag-lite": {
      "command": "raglite-mcp",
      "args": []
    }
  }
}

Multiple Instance Configuration (NEW)

Run multiple MCP server instances for different databases with intelligent routing:

{
  "mcpServers": {
    "rag-lite-text-docs": {
      "command": "npx",
      "args": ["rag-lite-mcp"],
      "env": {
        "RAG_DB_FILE": "./text-docs/db.sqlite",
        "RAG_INDEX_FILE": "./text-docs/index.bin"
      }
    },
    "rag-lite-multimodal-images": {
      "command": "npx",
      "args": ["rag-lite-mcp"],
      "env": {
        "RAG_DB_FILE": "./mixed-content/db.sqlite",
        "RAG_INDEX_FILE": "./mixed-content/index.bin"
      }
    }
  }
}

Dynamic Tool Descriptions: Each server automatically detects and advertises its capabilities:

[TEXT MODE] - Text-only databases clearly indicate supported file types
[MULTIMODAL MODE] - Multimodal databases advertise image support and cross-modal search
AI assistants can intelligently route queries to the appropriate database

Available Tools: search, ingest, ingest_image, multimodal_search, rebuild_index, get_stats, get_mode_info, list_supported_models, list_reranking_strategies, get_system_stats

Multimodal Features:

Search across text and image content
Retrieve image content as base64 data
Cross-modal search capabilities (text queries find images)
Automatic mode detection from database
Content type filtering
Multiple reranking strategies

→ Complete MCP Integration Guide | MCP Multimodal Guide | Multi-Instance Setup

🛠️ Development

Building from Source

# Clone and setup
git clone https://github.com/your-username/rag-lite-ts.git
cd rag-lite-ts
npm install

# Build and link for development
npm run build
npm link  # Makes raglite/raglite-mcp available globally

# Run tests
npm test
npm run test:integration

Project Structure

src/
├── index.ts              # Main exports and factory functions
├── search.ts             # Public SearchEngine API
├── ingestion.ts          # Public IngestionPipeline API
├── core/                 # Model-agnostic core layer
│   ├── search.ts         # Core search engine
│   ├── ingestion.ts      # Core ingestion pipeline
│   ├── db.ts             # SQLite operations
│   ├── config.ts         # Configuration system
│   ├── content-manager.ts # Content storage and management
│   └── types.ts          # Core type definitions
├── text/                 # Text-specific implementations
│   ├── embedder.ts       # Sentence-transformer embedder
│   ├── reranker.ts       # Cross-encoder reranking
│   └── tokenizer.ts      # Text tokenization
├── multimodal/           # Multimodal implementations
│   ├── embedder.ts       # CLIP embedder (text + images)
│   ├── reranker.ts       # Text-derived reranking for multimodal
│   ├── image-processor.ts # Image description and metadata
│   └── content-types.ts  # Content type detection
├── cli.ts                # CLI interface
├── mcp-server.ts         # MCP server
└── preprocessors/        # Content type processors

dist/                     # Compiled output

Design Philosophy

Simple by default, powerful when needed:

✅ Simple constructors work immediately with sensible defaults
✅ Configuration options available when you need customization
✅ Advanced patterns available for complex use cases
✅ Clean architecture with minimal dependencies
✅ No ORMs or heavy frameworks - just TypeScript and SQLite
✅ Extensible design for future capabilities

This approach ensures that basic usage is effortless while providing the flexibility needed for advanced scenarios.

🤝 Contributing

We welcome contributions! Whether it's:

🐛 Bug fixes
✨ New features
📝 Documentation improvements
🧪 Test coverage
💡 Ideas and suggestions

Guidelines:

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Make your changes with tests
Ensure all tests pass (npm test)
Submit a pull request

We maintain clean architecture principles while enhancing functionality and developer experience.

🎯 Why We Built This

Existing RAG solutions are either:

🔴 Too complex - Require extensive setup and configuration
🔴 Cloud-dependent - Need API keys and external services
🔴 Python-only - Not ideal for TypeScript/Node.js projects
🔴 Heavy - Massive dependencies and slow startup

RAG-lite TS is different:

✅ Simple - Works out of the box with zero config
✅ Local-first - Your data stays on your machine
✅ TypeScript native - Built for modern JS/TS projects
✅ Lightweight - Fast startup, minimal dependencies

🙏 Acknowledgments

Built with amazing open-source projects:

transformers.js - Client-side ML models by Xenova
hnswlib - Fast approximate nearest neighbor search
better-sqlite3 - Fast SQLite3 bindings

📄 License

MIT License - see LICENSE file for details.

⭐ Star us on GitHub — it helps!

Report Bug • Request Feature • Documentation

Made with ❤️ by developers, for developers

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

🦎 RAG-lite TS

Simple by default, powerful when needed

🎯 Why RAG-lite TS?

🎬 See It In Action

What Makes It Different?

🎉 What's New in 2.0

🖼️ Multimodal Search

🏗️ Architecture Improvements

🤖 MCP Server Enhancements

📦 Migration from 1.x

📋 Table of Contents

🚀 Quick Start

Installation

Basic Usage (CLI)

Web Interface (UI)

Using Different Models

Content Retrieval and MCP Integration

Multimodal Search (Text + Images)

Programmatic Usage

Memory Ingestion & Unified Content System (NEW)

Configuration Options

💡 Real-World Examples

✨ Features

🎯 Developer Experience

🚀 Performance

🦎 Chameleon Architecture

🖼️ Multimodal Search

🔌 Integration Ready

🛠️ Production Ready

🎨 Web Interface (UI)

📚 Documentation

📁 Supported File Formats

🔧 How It Works

Pipeline Steps

🦎 Chameleon Architecture

📝 Text Mode

🖼️ Multimodal Mode

🧠 Supported Models

📝 Text Mode Models

🖼️ Multimodal Models

✨ Model Features

📚 Documentation

🚀 Getting Started

🔧 Advanced

🛠️ Support

🎯 Quick Start by Role

🎨 Web Interface (UI)

Quick Start

Features

UI vs CLI

🔌 MCP Server Integration

Single Instance Configuration

Multiple Instance Configuration (NEW)

🛠️ Development

Building from Source

Project Structure

Design Philosophy

🤝 Contributing

🎯 Why We Built This

🙏 Acknowledgments

📄 License