@echoes-io/rag

v1.4.0

Published

3 months ago

Semantic search and context retrieval system for Echoes.io storytelling platform

0High
0Medium
0Low

zweer

echoes storytelling multi-pov rag semantic-search vector-embeddings ai

@echoes-io/rag

Semantic search and context retrieval system for Echoes.io storytelling platform

Overview

The RAG (Retrieval-Augmented Generation) system provides semantic search capabilities across all Echoes timeline content, enabling intelligent context retrieval for AI-powered storytelling assistance.

Features

Vector Embeddings: Generate embeddings for chapters using Gemini or local E5 models
Semantic Search: Find similar content across timelines and characters
Character Extraction: Automatic NER (Named Entity Recognition) to extract character names
Context Retrieval: Intelligent context selection for AI interactions
Timeline Awareness: Search within specific timelines or across all content
Character Filtering: Find chapters by characters present
SQLite Storage: Lightweight, file-based vector database with cosine similarity search

Architecture

RAG System (LlamaIndexTS + LanceDB)
├── LlamaIndexTS - Document indexing and retrieval framework
├── LanceDB - High-performance vector database with ANN search
├── NER Extractor - Extract character names from content
└── Search API - Query interface with semantic + metadata filtering

Installation

npm install @echoes-io/rag

Usage

Basic Setup

import { RAGSystem } from '@echoes-io/rag';

const rag = new RAGSystem({
  provider: 'qwen3', // or 'nomic-embed', 'bge-base', 'e5-large', 'e5-small', 'gemini'
  geminiApiKey: process.env.GEMINI_API_KEY, // Only required for 'gemini'
  dbPath: './lancedb'
});

Indexing Chapters

await rag.addChapter({
  id: 'anima-discovery-03-05',
  metadata: {
    pov: 'nic',
    title: 'Sotto le stelle',
    timeline: 'anima',
    arc: 'discovery',
    // ... other metadata
  },
  content: 'Nic guardò Alex negli occhi. Sara sorrise...'
});

// Characters are automatically extracted: ['Nic', 'Alex', 'Sara']

Semantic Search

// Search across all timelines
const results = await rag.search('passionate encounter in London');

// Search within specific timeline
const erosResults = await rag.search('work dynamics', { 
  timeline: 'eros' 
});

// Search by POV
const nicChapters = await rag.search('internal conflict', { 
  pov: 'nic' 
});

Character-Based Search

// Find chapters where specific characters appear
const results = await rag.search('romantic moment', {
  characters: ['Nic', 'Alex'],
  allCharacters: true  // Both must be present (AND)
});

// Find chapters with any of the characters (OR)
const results = await rag.search('conflict', {
  characters: ['Nic', 'Alex'],
  allCharacters: false  // At least one present (default)
});

// Get all characters that interact with a specific character
const characters = await rag.getCharacterMentions('Nic');
// → ['Alex', 'Sara', 'Marco', ...]

Context Retrieval for AI

// Get relevant context for AI interaction
const context = await rag.getContext({
  query: 'relationship development',
  timeline: 'anima',
  maxChapters: 5,
  characters: ['Nic', 'Alex']
});

Use Cases

Story Queries

// "In which chapter do X and Y kiss?"
await rag.search("kiss", { 
  characters: ['Nic', 'Alex'],
  allCharacters: true 
});

// "What's the relationship between X and Y?"
await rag.search("relationship Nic Alex", {
  characters: ['Nic', 'Alex']
});

// "In which chapter does XYZ happen?"
await rag.search("xyz event description");

// "Who does character X interact with?"
await rag.getCharacterMentions('Nic');

Configuration

const config = {
  provider: 'qwen3',                     // 'qwen3', 'nomic-embed', 'bge-base', 'e5-large', 'e5-small', or 'gemini'
  geminiApiKey: process.env.GEMINI_API_KEY,  // Required for 'gemini' provider
  dbPath: './lancedb',                   // LanceDB directory
  maxResults: 10,                        // Default max results
  storeFullContent: true                 // true = store full content (default)
};

Embedding Providers

qwen3 - Qwen3-Embedding-0.6B-ONNX (1024 dimensions, SOTA multilingual, #1 on MTEB)
nomic-embed - Nomic Embed v1 (768 dimensions, excellent accuracy, 86.2% benchmark)
bge-base - BGE-Base-v1.5 (768 dimensions, balanced performance, 84.7% accuracy)
e5-large - E5 multilingual large (1024 dimensions, robust multilingual)
e5-small - E5 multilingual small (384 dimensions, fast, offline, good baseline)
gemini - Google's gemini-embedding-001 (768 dimensions, API required)

All local embeddings run via HuggingFace Transformers.js and don't require API keys.

Character Extraction (NER)

The system automatically extracts character names from chapter content using Named Entity Recognition:

Model: Xenova/bert-base-multilingual-cased-ner-hrl
Language Support: Multilingual (including Italian)
Performance: ~100-200ms per chapter (cached after first extraction)
Automatic: No configuration needed, always enabled

Characters are stored in metadata.characterNames and can be used for filtering.

API Reference

RAGSystem

Main class for semantic search operations.

Constructor

new RAGSystem(config: RAGConfig)

Methods

`search(query, options?)`

Semantic search across content.

search(
  query: string,
  options?: {
    timeline?: string,
    arc?: string,
    pov?: string,
    maxResults?: number,
    characters?: string[],      // Filter by characters present
    allCharacters?: boolean     // true = AND, false = OR
  }
): Promise<SearchResult[]>

`getContext(options)`

Retrieve context for AI interactions.

getContext(options: {
  query: string,
  timeline?: string,
  arc?: string,
  pov?: string,
  maxChapters?: number,
  characters?: string[]
}): Promise<SearchResult[]>

`addChapter(chapter)`

Add single chapter to vector database (with automatic character extraction).

addChapter(chapter: EmbeddingChapter): Promise<void>

`addChapters(chapters)`

Batch add chapters.

addChapters(chapters: EmbeddingChapter[]): Promise<void>

`deleteChapter(id)`

Remove chapter from database.

deleteChapter(id: string): Promise<void>

`getCharacterMentions(characterName)`

Get all characters that appear in chapters with the specified character.

getCharacterMentions(characterName: string): Promise<string[]>

Types

interface SearchOptions {
  timeline?: string;
  arc?: string;
  pov?: string;
  maxResults?: number;
  characters?: string[];
  allCharacters?: boolean;
}

interface ContextOptions {
  query: string;
  timeline?: string;
  arc?: string;
  pov?: string;
  maxChapters?: number;
  characters?: string[];
}

interface SearchResult {
  id: string;
  metadata: ChapterWithCharacters;
  content: string;
  similarity: number;  // 0-1 cosine similarity
}

interface ChapterWithCharacters extends Chapter {
  characterNames?: string[];  // Extracted by NER
}

interface RAGConfig {
  provider: 'qwen3' | 'nomic-embed' | 'bge-base' | 'e5-large' | 'e5-small' | 'gemini';
  geminiApiKey?: string;
  openaiApiKey?: string;
  dbPath?: string;
  maxResults?: number;
}

Storage

The system uses LanceDB for vector storage with optimized ANN (Approximate Nearest Neighbor) search. This approach is:

High Performance: Native ANN indices (IVF, HNSW) for fast similarity search
Scalable: Handles thousands to millions of vectors efficiently
File-based: Directory-based storage, portable and easy to backup
Zero-copy: Efficient memory usage with columnar storage
Type-safe: Full TypeScript support via LlamaIndexTS

The LanceDB directory contains embeddings, metadata (including extracted characters), and content for all indexed chapters.

Database Management

Test databases: Ignored in git (pattern: test-*, *-test*)
Production databases: Should be committed in timeline repositories (e.g., lancedb/, rag-db/)
Backup: Simply copy the LanceDB directory

Development

# Install dependencies
npm install

# Build
npm run build

# Test
npm test

# Lint
npm run lint

Integration

With MCP Server

The RAG system integrates with @echoes-io/mcp-server to provide context-aware AI tools:

semantic-search - Find similar content
get-context - Retrieve relevant chapters for AI prompts
find-echoes - Discover narrative connections across timelines

With Timeline Content

Automatically processes content from timeline repositories:

timeline-anima/content/
timeline-eros/content/
timeline-bloom/content/

Performance

Embedding Model Comparison

| Model | Accuracy | Speed | Dimensions | Best For | |-------|----------|-------|------------|----------| | qwen3 | 70.58 MTEB | Medium | 1024 | Best overall, multilingual | | nomic-embed | 86.2% | Slow | 768 | High accuracy tasks | | bge-base | 84.7% | Medium | 768 | Balanced performance | | e5-large | 83.5% | Fast | 1024 | Good multilingual baseline | | e5-small | 78.1% | Very Fast | 384 | Speed-critical applications |

Indexing: ~100-200ms per chapter (including NER)
Search: <50ms for typical queries
Character Extraction: Cached after first run
Database: Suitable for 100-1000+ chapters

License

MIT

Echoes - Multi-POV storytelling platform ✨

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@echoes-io/rag

Overview

Features

Architecture

Installation

Usage

Basic Setup

Indexing Chapters

Semantic Search

Character-Based Search

Context Retrieval for AI

Use Cases

Story Queries

Configuration

Embedding Providers

Character Extraction (NER)

API Reference

RAGSystem

Constructor

Methods

search(query, options?)

getContext(options)

addChapter(chapter)

addChapters(chapters)

deleteChapter(id)

getCharacterMentions(characterName)

Types

Storage

Database Management

Development

Integration

With MCP Server

With Timeline Content

Performance

Embedding Model Comparison

License

`search(query, options?)`

`getContext(options)`

`addChapter(chapter)`

`addChapters(chapters)`

`deleteChapter(id)`

`getCharacterMentions(characterName)`