flux-vector

v1.0.1

Published

7 months ago

Lightweight browser-based semantic search library with HNSW vector index and transformer embeddings

0High
0Medium
0Low

vector-search semantic-search embeddings hnsw vector-database similarity-search nlp machine-learning transformers indexeddb browser rag document-processing pdf ocr

Flux-Vector: Semantic Search with HNSW and Embeddings

A lightweight, browser-based semantic search library that combines the Mememo HNSW (Hierarchical Navigable Small World) index with transformer-based embeddings for fast, accurate vector similarity search.

Features

🚀 Fast Vector Search: Optimized HNSW algorithm with LRU caching and cross-layer prefetching (~75% faster queries)
🧠 Semantic Understanding: Uses transformer models to create meaningful text embeddings
💾 Persistent Storage: IndexedDB integration with incremental saves (~90% faster for updates)
🔧 Flexible Configuration: Bring your own embedding model or use the default
📦 Modular Architecture: Clean separation between indexing, embeddings, and storage
⚡ Production-Ready: Battle-tested optimizations eliminate race conditions and memory bloat

Architecture

The library consists of four main components working together:

┌─────────────────────────────────────────────────────────────────┐
│                    VectorSearchManager                          │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐          │
│  │  Embedding   │  │ ContentStore │  │    Mememo    │          │
│  │   Engine     │  │  (IndexedDB) │  │  (HNSW Index)│          │
│  └──────────────┘  └──────────────┘  └──────────────┘          │
│        ▲                  ▲                  ▲                   │
│        │                  │                  │                   │
│        └──────────────────┴──────────────────┘                   │
│                    Orchestrates all                              │
└─────────────────────────────────────────────────────────────────┘
                           ▲
                           │ Uses
                           │
┌──────────────────────────┴──────────────────────────────────────┐
│              DocumentProcessor (Optional)                        │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐          │
│  │  Document    │  │     Text     │  │   Chunking   │          │
│  │  Extractor   │→│   Content    │→│   Strategies │          │
│  │ (PDF/Image)  │  │              │  │ (Recursive)  │          │
│  └──────────────┘  └──────────────┘  └──────────────┘          │
└─────────────────────────────────────────────────────────────────┘

1. Mememo (`/mememo`)

A TypeScript implementation of the HNSW (Hierarchical Navigable Small World) algorithm for efficient vector similarity search.

Configurable distance functions (cosine, cosine-normalized)
IndexedDB persistence with incremental saves
Customizable graph parameters (m, efConstruction)
Fast approximate nearest neighbor search with intelligent caching
Performance Optimizations:
- LRU cache system (~75% fewer IndexedDB fetches)
- Cross-layer prefetching (4ms avg query time for 100 nodes)
- Incremental saves with dirty tracking (~90% faster updates)
- Race condition protection with ready() pattern

2. Embeddings (`/embeddings`)

Handles text-to-vector conversion and document storage:

EmbeddingPipeline: Default HuggingFace Transformers.js implementation (Xenova/all-MiniLM-L6-v2)
IEmbeddingEngine: Interface for custom embedding models (OpenAI, Cohere, local models)
ContentStore: Dexie (IndexedDB) storage for original text documents
VectorSearchManager: Orchestrates embeddings, storage, and search

3. VectorSearchManager (`/embeddings/VectorSearchManager.ts`)

The main interface that combines all components:

Accepts text → generates embeddings → stores in ContentStore → indexes in Mememo
Handles search queries end-to-end
Returns results with original text and similarity scores
Manages persistence across browser sessions

4. Document Processing (`/document-processing`)

Optional module for extracting and chunking text from various document formats:

DocumentExtractor: PDF (pdf.js), Images (Tesseract.js OCR), Text files
TextChunker: Smart text chunking strategies (recursive, sentence, paragraph)
DocumentProcessor: End-to-end pipeline from files to searchable chunks

Quick Start

Installation

npm install

Basic Usage

import VectorSearchManager from './embeddings/VectorSearchManager';

// Initialize with default configuration
const searchManager = new VectorSearchManager();

// Add documents
await searchManager.addDocument("A small, fast boat is called a skiff.");
await searchManager.addDocument("Apples and oranges are types of fruit.");
await searchManager.addDocument("A car is a form of wheeled transport.");

// Search
const results = await searchManager.search("What is a vehicle?", 3);
console.log(results);
// Output: [{ key: "...", text: "A car is...", distance: 0.12 }, ...]

Configuration

Using Default Configuration

The library comes with sensible defaults:

import VectorSearchManager, { DEFAULT_CONFIG } from './embeddings/VectorSearchManager';

// Uses default settings:
// - Embedding: Xenova/all-MiniLM-L6-v2 (384 dimensions)
// - Distance: cosine-normalized
// - m: 16
// - efConstruction: 200
// - IndexedDB: enabled
const searchManager = new VectorSearchManager();

Custom Configuration

import VectorSearchManager from './embeddings/VectorSearchManager';

const searchManager = new VectorSearchManager({
  indexConfig: {
    distanceFunction: 'cosine',
    m: 32,                    // More connections = better recall, slower build
    efConstruction: 400,      // Higher = better quality, slower build
    useIndexedDB: true,       // Enable persistence
  }
});

Custom Embedding Engine

You can provide your own embedding model by implementing the IEmbeddingEngine interface:

import { IEmbeddingEngine } from './embeddings/EmbeddingPipeline';
import VectorSearchManager from './embeddings/VectorSearchManager';

// Example: Custom OpenAI-based embedding engine
class OpenAIEmbeddingEngine implements IEmbeddingEngine {
  private apiKey: string;

  constructor(apiKey: string) {
    this.apiKey = apiKey;
  }

  async embed(text: string): Promise<number[]> {
    const response = await fetch('https://api.openai.com/v1/embeddings', {
      method: 'POST',
      headers: {
        'Authorization': `Bearer ${this.apiKey}`,
        'Content-Type': 'application/json',
      },
      body: JSON.stringify({
        model: 'text-embedding-ada-002',
        input: text,
      }),
    });

    const data = await response.json();
    return data.data[0].embedding;
  }
}

// Use custom engine
const customSearchManager = new VectorSearchManager({
  embeddingEngine: new OpenAIEmbeddingEngine('your-api-key'),
  indexConfig: {
    distanceFunction: 'cosine-normalized',
  }
});

Example: Local Model with ONNX

import { IEmbeddingEngine } from './embeddings/EmbeddingPipeline';

class CustomONNXEngine implements IEmbeddingEngine {
  private session: any; // Your ONNX session

  async embed(text: string): Promise<number[]> {
    // Your custom embedding logic here
    // 1. Tokenize text
    // 2. Run through ONNX model
    // 3. Return embedding vector
    return [/* your embedding vector */];
  }
}

const searchManager = new VectorSearchManager({
  embeddingEngine: new CustomONNXEngine(),
});

API Reference

VectorSearchManager

`constructor(config?: VectorSearchConfig)`

Creates a new VectorSearchManager instance.

Parameters:

config.embeddingEngine (optional): Custom embedding engine implementing IEmbeddingEngine
config.indexConfig.distanceFunction (optional): 'cosine' | 'cosine-normalized' (default: 'cosine-normalized')
config.indexConfig.m (optional): Number of bi-directional links per node (default: 16)
config.indexConfig.efConstruction (optional): Dynamic candidate list size (default: 200)
config.indexConfig.useIndexedDB (optional): Enable IndexedDB persistence (default: true)

`async addDocument(text: string, id?: string): Promise<string>`

Adds a document to the search index.

Parameters:

text: The document text to index
id (optional): Custom document ID. If not provided, a UUID is generated.

Returns: The document ID

`async search(queryText: string, k?: number): Promise<ISearchResult[]>`

Searches for similar documents.

Parameters:

queryText: The search query
k (optional): Number of results to return (default: 3)

Returns: Array of search results with key, text, and distance fields

`async size(): Promise<number>`

Returns the total number of indexed documents.

`async getDocument(id: string): Promise<IDocument | undefined>`

Retrieves a document by ID.

Returns: Document object with id and text, or undefined if not found.

`async hasDocument(id: string): Promise<boolean>`

Checks if a document exists and is active (not deleted).

`async updateDocument(id: string, newText: string): Promise<void>`

Updates an existing document. Regenerates the embedding and updates both ContentStore and HNSW index.

Throws: Error if document doesn't exist.

`async deleteDocument(id: string): Promise<void>`

Soft deletes a document. Removes from ContentStore and marks as deleted in HNSW (excluded from searches).

`async compactIndex(): Promise<void>`

Permanently removes all soft-deleted nodes from the index, reclaiming memory. Expensive operation (O(n log n)).

`async getStats(): Promise<IndexStats>`

Returns index statistics:

totalNodes: Total nodes (active + deleted)
activeNodes: Active nodes (not deleted)
deletedNodes: Soft-deleted nodes

IEmbeddingEngine Interface

interface IEmbeddingEngine {
  embed(text: string): Promise<number[]>;
}

Implement this interface to create custom embedding engines.

Default Embedding Model

The default model is Xenova/all-MiniLM-L6-v2:

Dimensions: 384
Optimized for: Semantic similarity search
Normalization: Embeddings are normalized (use 'cosine-normalized' distance)
Source: HuggingFace Transformers.js (runs in browser/Node.js)

Performance Tips

Adjust m parameter: Higher values (32-48) improve recall but increase memory and build time
Tune efConstruction: Higher values (200-400) improve index quality at the cost of build time
Use normalized embeddings: If your embeddings are normalized, use 'cosine-normalized' for faster distance calculations

Enable autosave: For frequent updates, enable autosave to automatically persist changes without blocking operations

const manager = new VectorSearchManager();
// Enable autosave with 5 second debounce
manager.index.setAutosave(true, 5000);

Use incremental saves: For large indexes, use incrementalSaveIndex() instead of full saves

// Only saves changed nodes/layers (90% faster for small updates)
await manager.index.incrementalSaveIndex();

Batch operations: When adding multiple documents, the cross-layer cache sharing automatically optimizes repeated queries

Document Processing & Storage Pipeline

The library provides an end-to-end pipeline for processing documents and storing them in a searchable vector index.

How Documents Are Stored

When you call searchManager.addDocument(text, id), the system performs three operations:

Embedding Generation: Converts text to a vector using the embedding engine
Text Storage: Saves original text in ContentStore (IndexedDB database)
Vector Indexing: Adds the vector to the HNSW graph for similarity search

This design ensures that:

✅ Original text is preserved and retrievable
✅ Search returns actual content, not just IDs
✅ Data persists across browser sessions
✅ No external database required

Under the hood:

// What happens when you call addDocument
async addDocument(text: string, id?: string): Promise<string> {
  // 1. Generate embedding vector
  const vector = await this.embeddingEngine.embed(text);
  
  // 2. Generate or use provided ID
  const key = id || uuidv4();
  
  // 3. Store original text in ContentStore (IndexedDB)
  await this.contentStore.addDocument(key, text);
  
  // 4. Add vector to HNSW index for similarity search
  await this.index.add(key, vector);
  
  return key;
}

ContentStore: Your Document Database

The ContentStore is a Dexie-based IndexedDB wrapper that stores your documents:

// Access the ContentStore directly
const contentStore = searchManager.contentStore;

// Get a single document
const doc = await contentStore.documents.get('document-id');
console.log(doc.text); // Original text

// Get multiple documents
const docs = await contentStore.getDocuments(['id1', 'id2', 'id3']);

// Query all documents
const allDocs = await contentStore.documents.toArray();

// Delete a document
await contentStore.documents.delete('document-id');

// Clear all documents
await contentStore.documents.clear();

Complete Document Processing Workflow

Here's a complete example of processing a PDF and making it searchable:

import { DocumentProcessor } from './document-processing';
import VectorSearchManager from './embeddings/VectorSearchManager';

// Step 1: Initialize components
const processor = new DocumentProcessor({
  chunkingConfig: {
    chunkSize: 1000,      // Target chunk size in characters
    chunkOverlap: 200,    // Overlap between chunks for context
  },
  extractorConfig: {
    enableOCRFallback: true,  // Enable OCR for scanned PDFs
  }
});

const searchManager = new VectorSearchManager();

// Step 2: Process a document (PDF, image, or text file)
const chunks = await processor.processDocument(
  pdfFile,                    // File, Buffer, or ArrayBuffer
  'application/pdf',
  { filename: 'research-paper.pdf' }
);

console.log(`Extracted ${chunks.length} chunks from the PDF`);

// Step 3: Index all chunks
// Each chunk is automatically:
// - Embedded into a vector
// - Stored in ContentStore (IndexedDB) with original text
// - Indexed in HNSW graph for similarity search
for (const chunk of chunks) {
  const chunkId = `${chunk.metadata.filename}_chunk_${chunk.index}`;
  await searchManager.addDocument(chunk.text, chunkId);
  
  console.log(`Indexed chunk ${chunk.index}/${chunk.metadata.totalChunks}`);
}

// Step 4: Search across all indexed chunks
const results = await searchManager.search('machine learning algorithms', 5);

// Step 5: Results include original text and similarity scores
results.forEach((result, i) => {
  console.log(`\n--- Result ${i + 1} ---`);
  console.log(`Similarity: ${(1 - result.distance).toFixed(3)}`);
  console.log(`Document: ${result.key}`);
  console.log(`Text: ${result.text.substring(0, 200)}...`);
});

What's happening behind the scenes:

DocumentProcessor extracts text from the PDF and splits it into manageable chunks
Each chunk contains:
- text: The actual content
- index: Position in the document
- metadata: Filename, mime type, total chunks, etc.
VectorSearchManager.addDocument() for each chunk:
- Generates an embedding vector from the text
- Stores the original text in ContentStore (IndexedDB)
- Adds the vector to the HNSW index
Search queries are embedded the same way and matched against the index
Results include the original stored text from ContentStore

How Storage Works

When you call searchManager.addDocument(text, id):

// Internally, VectorSearchManager performs:
async addDocument(text: string, id?: string): Promise<string> {
  // 1. Generate embedding vector
  const vector = await this.embeddingEngine.embed(text);
  
  // 2. Generate or use provided ID
  const key = id || uuidv4();
  
  // 3. Store original text in ContentStore (IndexedDB)
  await this.contentStore.addDocument(key, text);
  
  // 4. Add vector to HNSW index for similarity search
  await this.index.add(key, vector);
  
  return key;
}

The ContentStore is a Dexie (IndexedDB) database that persists your original documents alongside the vector index. This means:

✅ Original text is preserved and retrievable
✅ Search returns actual content, not just IDs
✅ Everything persists in the browser between sessions
✅ No need for separate storage management

Processing Different Document Types

// PDF with text
const pdfChunks = await processor.processDocument(
  pdfFile,
  'application/pdf',
  { filename: 'document.pdf' }
);

// Scanned PDF (uses OCR)
const scannedChunks = await processor.processDocument(
  scannedPdfFile,
  'application/pdf',
  { filename: 'scanned.pdf' }
);

// Image (automatic OCR)
const imageChunks = await processor.processDocument(
  imageFile,
  'image/png',
  { filename: 'screenshot.png' }
);

// Plain text
const textChunks = await processor.processDocument(
  textFile,
  'text/plain',
  { filename: 'notes.txt' }
);

// Markdown
const mdChunks = await processor.processDocument(
  markdownFile,
  'text/markdown',
  { filename: 'readme.md' }
);

// Index all chunks from any format
for (const chunk of [...pdfChunks, ...imageChunks, ...textChunks]) {
  await searchManager.addDocument(chunk.text, chunk.metadata.filename + '_' + chunk.index);
}

Batch Processing Multiple Documents

const documents = [
  { file: file1, mimeType: 'application/pdf', filename: 'doc1.pdf' },
  { file: file2, mimeType: 'image/png', filename: 'diagram.png' },
  { file: file3, mimeType: 'text/plain', filename: 'notes.txt' },
];

let totalChunks = 0;

for (const doc of documents) {
  console.log(`Processing: ${doc.filename}`);
  
  const chunks = await processor.processDocument(
    doc.file,
    doc.mimeType,
    { filename: doc.filename }
  );
  
  // Add each chunk to the search index
  for (const chunk of chunks) {
    const id = `${doc.filename}_chunk_${chunk.index}`;
    await searchManager.addDocument(chunk.text, id);
    totalChunks++;
  }
}

console.log(`Indexed ${totalChunks} chunks from ${documents.length} documents`);

// Now search across all documents
const results = await searchManager.search('important concept', 10);

Advanced: Custom Chunking Strategies

import { 
  DocumentProcessor,
  SentenceTextSplitter,
  ParagraphTextSplitter 
} from './document-processing';

const processor = new DocumentProcessor({
  // Register multiple chunking strategies
  customChunkers: [
    { name: 'sentence', chunker: new SentenceTextSplitter({ chunkSize: 500 }) },
    { name: 'paragraph', chunker: new ParagraphTextSplitter({ chunkSize: 1200 }) },
  ],
  defaultChunkingStrategy: 'sentence'
});

// Use sentence-based chunking
const sentenceChunks = await processor.processDocument(
  file,
  'application/pdf',
  { 
    filename: 'article.pdf',
    chunkingStrategy: 'sentence'  // Override default
  }
);

Retrieving Stored Documents

// Access the ContentStore directly for advanced operations
const contentStore = searchManager.contentStore;

// Get a single document by ID
const document = await contentStore.documents.get('document_id');
if (document) {
  console.log(`ID: ${document.id}`);
  console.log(`Text: ${document.text}`);
}

// Get multiple documents in one call
const docIds = ['id1', 'id2', 'id3'];
const docs = await contentStore.getDocuments(docIds);
docs.forEach(doc => {
  if (doc) {
    console.log(doc.text);
  }
});

// Query all documents (useful for exports or backups)
const allDocs = await contentStore.documents.toArray();
console.log(`Total documents: ${allDocs.length}`);

// Delete a document (note: also remove from HNSW index separately)
await contentStore.documents.delete('document_id');

// Clear all documents
await contentStore.documents.clear();

// Check total number of indexed documents
const totalDocs = await searchManager.size();
console.log(`Indexed documents: ${totalDocs}`);

Important: The ContentStore is automatically managed by VectorSearchManager. When you call addDocument(), both the text storage and vector indexing happen together. If you delete from ContentStore directly, remember to also remove from the HNSW index if needed.

Supported Document Formats

| Format | MIME Type | Example | |--------|-----------|---------| | PDF | application/pdf | processor.processDocument(file, 'application/pdf') | | Text | text/plain | processor.processDocument(file, 'text/plain') | | Markdown | text/markdown | processor.processDocument(file, 'text/markdown') | | JSON | application/json | processor.processDocument(file, 'application/json') | | PNG | image/png | processor.processDocument(file, 'image/png') | | JPEG | image/jpeg | processor.processDocument(file, 'image/jpeg') |

Note:

PDFs use pdf.js for text extraction, with OCR fallback for scanned documents
Images use Tesseract.js for automatic OCR text extraction
All text formats are supported via TextDecoder

See document-processing/README.md for detailed documentation on extractors, chunking strategies, and customization options.

Use Cases

Documentation Search: Semantic search through documentation with automatic chunking
PDF Analysis: Extract, chunk, and search through PDF documents
Content Recommendation: Find similar articles or products based on semantic similarity
FAQ Matching: Match user questions to FAQ entries using semantic understanding
Chatbot Context: Retrieve relevant context for conversational AI applications
Code Search: Find similar code snippets semantically (not just keyword matching)
Document Q&A: Question-answering over large document collections
Image Text Search: OCR + search through scanned documents and images

Example: FAQ Chatbot

// Index FAQs
const faqs = [
  { q: "How do I reset password?", a: "Go to settings and click reset..." },
  { q: "Where is my order?", a: "Check tracking in your account..." },
];

for (const faq of faqs) {
  await manager.addDocument(faq.q + " " + faq.a, `faq_${faq.id}`);
}

// Match user questions semantically
const matches = await manager.search(userQuestion, 3);
const bestMatch = matches[0];
console.log(`Best match: ${bestMatch.text}`);

Example: Knowledge Base

// Index documentation from API
const docs = await fetch('/api/docs').then(r => r.json());

for (const doc of docs) {
  await manager.addDocument(doc.content, doc.id);
}

// Semantic search
const results = await manager.search("how to configure authentication", 10);
results.forEach(r => console.log(r.text));

Quick Reference

Common Patterns

Simple text search:

const manager = new VectorSearchManager();
await manager.addDocument("Your text here");
const results = await manager.search("query", 5);

Process and index a PDF:

const processor = new DocumentProcessor();
const chunks = await processor.processDocument(pdfFile, 'application/pdf');
for (const chunk of chunks) {
  await manager.addDocument(chunk.text);
}

Custom embedding model:

const manager = new VectorSearchManager({
  embeddingEngine: new MyCustomEngine(),
});

Access stored documents:

const doc = await manager.contentStore.documents.get('doc-id');
const totalDocs = await manager.size();

Key Configuration Parameters

| Parameter | Default | Description | |-----------|---------|-------------| | distanceFunction | 'cosine-normalized' | Distance metric for similarity | | m | 16 | Graph connectivity (higher = better recall) | | efConstruction | 200 | Build quality (higher = better index) | | chunkSize | 1000 | Characters per chunk | | chunkOverlap | 200 | Overlap between chunks | | enableOCRFallback | true | OCR for scanned PDFs/images | | clearOnInit | true | Clear IndexedDB on initialization (set false for persistence) |

CRUD Operations

Update Documents

// Update existing document (regenerates embedding)
await manager.updateDocument(documentId, 'Updated text content');

// Verify update
const updatedDoc = await manager.getDocument(documentId);
console.log(updatedDoc.text); // "Updated text content"

Delete Documents

Soft Delete (fast, marks as deleted but keeps in memory):

await manager.deleteDocument(documentId);

Hard Delete (reclaims memory by compacting index):

// Delete several documents
await manager.deleteDocument(id1);
await manager.deleteDocument(id2);

// Compact index to permanently remove deleted nodes
await manager.compactIndex();

When to Compact:

Deleted nodes exceed 20-30% of total nodes
Memory usage is a concern
Application is idle (compaction is expensive)

Check Document Status

// Check if document exists
const exists = await manager.hasDocument(documentId);

// Get index statistics
const stats = await manager.getStats();
console.log(`Total: ${stats.totalNodes}, Active: ${stats.activeNodes}, Deleted: ${stats.deletedNodes}`);

Performance Characteristics

| Operation | Time Complexity | Notes | |-----------|----------------|-------| | Add Document | O(log n) | Includes embedding generation + HNSW insertion | | Update Document | O(log n) | Re-embedding + update connections in graph | | Soft Delete | O(1) | Just marks node as deleted | | Hard Delete (Compact) | O(n log n) | Rebuilds entire index | | Search | O(log n) | HNSW approximate nearest neighbor search |

Persistence & Index Management

Enabling Persistence

By default, IndexedDB is cleared on initialization. To persist data across sessions:

const manager = new VectorSearchManager({
  indexConfig: {
    useIndexedDB: true,
    clearOnInit: false  // Enable persistence
  }
});

// Wait for async initialization
await new Promise(resolve => setTimeout(resolve, 100));

// Check if persisted data was loaded
const size = await manager.size();
console.log(`Loaded ${size} documents from persistence`);

Saving and Loading Index

// Save index structure (graph topology)
await manager.index.saveIndex();

// Load persisted index (automatic if clearOnInit: false)
const persistedData = await manager.index.loadPersistedIndex();
if (persistedData) {
  console.log('Index loaded from persistence');
}

Storage Size Estimates

| Index Size | Graph Structure | Total Storage | |------------|------------------|---------------| | 1k nodes | ~100-500KB | ~2-5MB | | 10k nodes | ~1-5MB | ~15-20MB | | 100k nodes | ~10-50MB | ~150-200MB |

What's Stored:

Embeddings: Always persisted in IndexedDB mememo table (when useIndexedDB: true)
Graph Structure: Persisted in indexMetadata table (when clearOnInit: false)
Text Documents: Persisted in ContentStore (IndexedDB)

Best Practices for Persistence

// 1. Always save after bulk operations
for (const doc of manyDocs) {
  await manager.addDocument(doc.text, doc.id);
}
await manager.index.saveIndex(); // Don't forget!

// 2. Use incremental saves for large indexes with small changes
await manager.index.incrementalSaveIndex(); // 90% faster than full save

// 3. Enable autosave for automatic background persistence
manager.index.setAutosave(true, 5000); // Auto-save with 5s debounce

Progress Callbacks

Track embedding progress for user feedback during long-running operations:

Model Loading Progress

import { DefaultEmbeddingEngine } from './embeddings/EmbeddingPipeline';

const engine = new DefaultEmbeddingEngine((progress) => {
  console.log(`Model loading: ${progress.status} - ${Math.round(progress.progress * 100)}%`);
});

const manager = new VectorSearchManager({ embeddingEngine: engine });

Embedding Progress

await manager.addDocument(
  'Document text',
  'doc-1',
  {}, // metadata
  (progress) => {
    console.log(`Embedding: ${Math.round(progress * 100)}%`);
    // Update UI progress bar
  }
);

Batch Processing with Progress

const documents = ['doc1', 'doc2', 'doc3'];

for (let i = 0; i < documents.length; i++) {
  await manager.addDocument(
    documents[i],
    `doc-${i}`,
    {},
    (embeddingProgress) => {
      const overallProgress = ((i + embeddingProgress) / documents.length) * 100;
      console.log(`Overall: ${Math.round(overallProgress)}%`);
    }
  );
}

Progress Stages:

0% - Operation started
30% - Model loaded (if needed)
90% - Embedding computed
100% - Operation complete

Note: After first embedding, model stays loaded, so subsequent embeddings skip 0-30% stage.

Advanced Storage Management

Monitor Storage Usage

// Check document count
const total = await manager.size();
const allDocs = await manager.contentStore.documents.toArray();
const totalBytes = allDocs.reduce((sum, d) => sum + d.text.length, 0);

console.log(`Documents: ${total}`);
console.log(`Size: ${(totalBytes / 1024 / 1024).toFixed(2)} MB`);

// Check browser storage quota
if (navigator.storage?.estimate) {
  const { usage, quota } = await navigator.storage.estimate();
  console.log(`Used: ${((usage/quota)*100).toFixed(1)}%`);
}

Export and Backup

// Export all documents
const backup = await manager.contentStore.documents.toArray();
const json = JSON.stringify(backup, null, 2);
// Save to file or server

// Restore from backup
const restored = JSON.parse(backupJson);
for (const doc of restored) {
  await manager.addDocument(doc.text, doc.id);
}

Advanced ContentStore Queries

Since ContentStore extends Dexie, you can use all Dexie query features:

// Filter documents
const filtered = await manager.contentStore.documents
  .filter(doc => doc.text.includes('machine learning'))
  .toArray();

// Limit results
const first10 = await manager.contentStore.documents
  .limit(10)
  .toArray();

// Count documents
const count = await manager.contentStore.documents.count();

// Iterate with cursor
await manager.contentStore.documents.each(doc => {
  console.log(doc.id);
});

React Integration

Custom Hook Example

import { useState, useEffect } from 'react';
import VectorSearchManager from 'flux-vector/embeddings/VectorSearchManager';

export function useVectorSearch() {
  const [searchManager, setSearchManager] = useState(null);
  const [loading, setLoading] = useState(true);
  const [indexSize, setIndexSize] = useState(0);

  useEffect(() => {
    const initSearch = async () => {
      const manager = new VectorSearchManager({
        indexConfig: {
          distanceFunction: 'cosine-normalized',
          useIndexedDB: true,
          clearOnInit: false  // Enable persistence
        }
      });

      await new Promise(resolve => setTimeout(resolve, 100));
      const size = await manager.size();
      setIndexSize(size);
      setSearchManager(manager);
      setLoading(false);
    };

    initSearch();

    return () => {
      if (searchManager) {
        searchManager.index.saveIndex().catch(console.error);
      }
    };
  }, []);

  const addDocument = async (text, id) => {
    if (!searchManager) return;
    const docId = await searchManager.addDocument(text, id);
    await searchManager.index.saveIndex();
    setIndexSize(await searchManager.size());
    return docId;
  };

  const search = async (query, k = 5) => {
    if (!searchManager) return [];
    return await searchManager.search(query, k);
  };

  return { searchManager, loading, indexSize, addDocument, search };
}

Component Example

import React, { useState } from 'react';
import { useVectorSearch } from './hooks/useVectorSearch';

function SemanticSearch() {
  const { loading, indexSize, search } = useVectorSearch();
  const [query, setQuery] = useState('');
  const [results, setResults] = useState([]);

  const handleSearch = async () => {
    if (!query.trim()) return;
    const searchResults = await search(query, 10);
    setResults(searchResults);
  };

  if (loading) return <div>Loading semantic search index...</div>;

  return (
    <div>
      <p>{indexSize} documents indexed</p>
      <input
        value={query}
        onChange={(e) => setQuery(e.target.value)}
        onKeyPress={(e) => e.key === 'Enter' && handleSearch()}
      />
      <button onClick={handleSearch}>Search</button>
      {results.map((result, i) => (
        <div key={result.key}>
          <p>Similarity: {(1 - result.distance).toFixed(3)}</p>
          <p>{result.text.substring(0, 200)}...</p>
        </div>
      ))}
    </div>
  );
}

Best Practices

ID Strategy

// ✅ Good: Descriptive, unique IDs
await manager.addDocument(text, "whitepaper_2024_intro");
await manager.addDocument(text, "faq_question_42");

// ❌ Avoid: Generic IDs unless auto-generated
await manager.addDocument(text, "1"); // Too generic

Batch Processing

// ✅ Good: Process in batches
for (let i = 0; i < files.length; i += 10) {
  const batch = files.slice(i, i + 10);
  await Promise.all(batch.map(f => processFile(f)));
}

// ❌ Avoid: All at once (memory issues)
await Promise.all(files.map(f => processFile(f)));

Error Handling

// ✅ Good: Handle errors gracefully
for (const chunk of chunks) {
  try {
    await manager.addDocument(chunk.text, id);
  } catch (error) {
    console.error(`Failed: ${id}`, error);
    // Continue with others
  }
}

Index Maintenance

// Monitor and compact when needed
const stats = await manager.getStats();
if (stats.deletedNodes > stats.totalNodes * 0.25) {
  console.log('Compacting index...');
  await manager.compactIndex();
}

Storage Monitoring

// Check usage before adding large batches
if (navigator.storage?.estimate) {
  const { usage, quota } = await navigator.storage.estimate();
  const percentUsed = (usage / quota) * 100;
  if (percentUsed > 80) {
    console.warn('Storage almost full:', percentUsed.toFixed(1) + '%');
  }
}

Troubleshooting

No Search Results?

const count = await manager.size();
console.log(`Documents indexed: ${count}`);
if (count === 0) {
  console.log('No documents indexed yet');
}

Out of Storage?

if (navigator.storage?.estimate) {
  const { usage, quota } = await navigator.storage.estimate();
  const percentUsed = (usage / quota) * 100;
  if (percentUsed > 80) {
    console.warn(`Storage ${percentUsed.toFixed(1)}% full`);
    // Clear old data or prompt user
  }
}

Slow Processing?

// Disable OCR if not needed
const processor = new DocumentProcessor({
  extractorConfig: { enableOCRFallback: false }
});

// Process in batches to avoid memory issues
for (let i = 0; i < files.length; i += 10) {
  const batch = files.slice(i, i + 10);
  await processBatch(batch);
  await new Promise(resolve => setTimeout(resolve, 100)); // GC pause
}

Persistence Not Working?

Verify clearOnInit: false is set
Check that saveIndex() was called before closing
Ensure useIndexedDB: true (can't persist in-memory)
Wait for async initialization: await new Promise(resolve => setTimeout(resolve, 100))

High Memory Usage?

// Check deleted nodes
const stats = await manager.getStats();
if (stats.deletedNodes > stats.totalNodes * 0.25) {
  await manager.compactIndex(); // Reclaim memory
}

Browser Compatibility

IndexedDB: Required for persistence (supported in all modern browsers)
WebAssembly: Required for transformers.js (supported in all modern browsers)

Development

# Install dependencies
npm install

# Build TypeScript
npx tsc

# Run tests
npm test

# Run Mememo-specific tests (45 tests covering all optimizations)
npm test -- Mememo

Performance Benchmarks

Based on test results with 384-dimensional embeddings:

| Operation | Before Optimization | After Optimization | Improvement | |-----------|--------------------|--------------------|-------------| | Query (100 nodes, 5 queries) | ~16ms | ~4ms avg | ~75% faster | | IndexedDB fetches (repeated queries) | Every prefetch | LRU cached | ~75% reduction | | Save (large index, small change) | Full export | Incremental | ~90% faster | | Page reload | Race condition crash | Stable | 100% reliable | | Memory usage | Unbounded growth | LRU eviction | Stable at 50MB |

Test Coverage: 45 comprehensive tests across 4 optimization phases

Phase 1: Race condition fix (16 tests)
Phase 2: LRU caching (6 tests)
Phase 3: Cross-layer cache sharing (8 tests)
Phase 4: Incremental saves (15 tests)

License

ISC

Credits

Mememo: HNSW implementation by Jay Wang
Transformers.js: HuggingFace transformers library for JavaScript
Dexie: IndexedDB wrapper

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Flux-Vector: Semantic Search with HNSW and Embeddings

Features

Architecture

1. Mememo (/mememo)

2. Embeddings (/embeddings)

3. VectorSearchManager (/embeddings/VectorSearchManager.ts)

4. Document Processing (/document-processing)

Quick Start

Installation

Basic Usage

Configuration

Using Default Configuration

Custom Configuration

Custom Embedding Engine

Example: Local Model with ONNX

API Reference

VectorSearchManager

constructor(config?: VectorSearchConfig)

async addDocument(text: string, id?: string): Promise<string>

async search(queryText: string, k?: number): Promise<ISearchResult[]>

async size(): Promise<number>

async getDocument(id: string): Promise<IDocument | undefined>

async hasDocument(id: string): Promise<boolean>

async updateDocument(id: string, newText: string): Promise<void>

async deleteDocument(id: string): Promise<void>

async compactIndex(): Promise<void>

async getStats(): Promise<IndexStats>

IEmbeddingEngine Interface

Default Embedding Model

Performance Tips

Document Processing & Storage Pipeline

How Documents Are Stored

ContentStore: Your Document Database

Complete Document Processing Workflow

How Storage Works

Processing Different Document Types

Batch Processing Multiple Documents

Advanced: Custom Chunking Strategies

Retrieving Stored Documents

Supported Document Formats

Use Cases

Example: FAQ Chatbot

Example: Knowledge Base

Quick Reference

Common Patterns

Key Configuration Parameters

CRUD Operations

Update Documents

Delete Documents

Check Document Status

Performance Characteristics

Persistence & Index Management

Enabling Persistence

Saving and Loading Index

Storage Size Estimates

Best Practices for Persistence

Progress Callbacks

Model Loading Progress

Embedding Progress

Batch Processing with Progress

Advanced Storage Management

Monitor Storage Usage

Export and Backup

Advanced ContentStore Queries

React Integration

Custom Hook Example

Component Example

Best Practices

ID Strategy

Batch Processing

Error Handling

Index Maintenance

Storage Monitoring

Troubleshooting

1. Mememo (`/mememo`)

2. Embeddings (`/embeddings`)

3. VectorSearchManager (`/embeddings/VectorSearchManager.ts`)

4. Document Processing (`/document-processing`)

`constructor(config?: VectorSearchConfig)`

`async addDocument(text: string, id?: string): Promise<string>`

`async search(queryText: string, k?: number): Promise<ISearchResult[]>`

`async size(): Promise<number>`

`async getDocument(id: string): Promise<IDocument | undefined>`

`async hasDocument(id: string): Promise<boolean>`

`async updateDocument(id: string, newText: string): Promise<void>`

`async deleteDocument(id: string): Promise<void>`

`async compactIndex(): Promise<void>`

`async getStats(): Promise<IndexStats>`