olingo

v2.4.0

Published

a month ago

Lightweight semantic search database with text embeddings for Node.js and Bun

0High
0Medium
0Low

artmann

embeddings semantic-search vector-database bun nodejs rag nlp machine-learning text-search cosine-similarity

Olingo

A lightweight semantic search database with text embeddings for Node.js and Bun

Olingo lets you build semantic search into your applications with just a few lines of code. Store text, search by meaning, and find similar content—perfect for RAG systems, chatbots, and recommendation engines.

What is Olingo?

Olingo is an embedding database that automatically converts text into vector embeddings and stores them in an efficient binary format. Instead of searching by exact keywords, you can search by semantic similarity—finding documents that mean the same thing, even if they use different words.

Example: Search for "how to reset password" and find results like "forgot my login credentials" or "change account password".

Why Olingo?

Simple API - No complex setup, just store and search
Semantic Search - Find content by meaning, not just keywords
Fast & Efficient - Binary storage format ~50% smaller than JSON
Zero Dependencies - Embeddings generated locally, no API keys needed
Works Everywhere - Compatible with Node.js and Bun
Built for RAG - Perfect for Retrieval Augmented Generation systems

Use Cases

FAQ Bots - Match user questions to answers by meaning
Document Search - Semantic search over documentation or knowledge bases
Code Search - Find code snippets by describing what they do
Content Recommendations - "More like this" functionality
RAG Systems - Retrieve relevant context for LLM prompts

Installation

# Using npm
npm install olingo

# Using bun
bun add olingo

Quick Start

Programmatic API

import { EmbeddingEngine } from 'olingo'

const engine = new EmbeddingEngine({
  storePath: './my-database.raptor'
})

// Store documents
await engine.storeMany([
  { key: 'doc1', text: 'How to reset your password' },
  { key: 'doc2', text: 'Machine learning basics' },
  { key: 'doc3', text: 'Getting started with Bun' }
])

// Search by meaning
const results = await engine.search('forgot my password', 5)
console.log(results[0].key) // 'doc1' - matched by meaning!
console.log(results[0].similarity) // 0.87 - high similarity score

// "More like this" — find documents similar to an existing one
const similar = await engine.similarTo('doc1', 5)
console.log(similar[0].key) // most similar document to doc1

Command Line Interface

# Store documents
olingo store doc1 "How to reset your password"
olingo store doc2 "Machine learning basics"

# Search by meaning
olingo search "forgot my password" --limit 5

# Find documents similar to an existing key ("more like this")
olingo similar-to doc1 --limit 5

# Retrieve by key
olingo get doc1

# Database management
olingo count                    # Show record count
olingo keys                     # List all keys
olingo stats                    # Show database statistics
olingo verify                   # Check database integrity
olingo compact                  # Remove dead records

Examples

See the examples/ directory for complete, runnable examples:

| Example | Description | Run | | -------------------------- | ------------------------------------------------------- | ----------------------------------------------- | | Document Search / RAG | Semantic search over documentation chunks | bun run examples/01-document-search.ts | | FAQ Bot | Match user questions to FAQs with confidence thresholds | bun run examples/02-faq-bot.ts | | Code Snippet Library | Search code by natural language descriptions | bun run examples/03-code-snippets.ts | | Content Recommendation | "More like this" functionality | bun run examples/04-content-recommendation.ts |

API Reference

`EmbeddingEngine`

`constructor(options)`

Create a new embedding engine.

const engine = new EmbeddingEngine({
  storePath: './database.raptor', // Path to storage file
  embeddingCacheSize: 100 // Optional: cache up to 100 text-to-embedding mappings
})

Options:

storePath - Path to the database file (required)
cacheDir - Directory to cache downloaded models (default: ./.cache/models)
readOnly - Open database in read-only mode (default: false)
embeddingCacheSize - Size of the LRU cache for text-to-embedding lookups (default: 0 = disabled)
embeddingProvider - Custom embedding provider (optional). When provided, the default model is not loaded. See Custom Embedding Providers.

`store(key, text)`

Store a single text entry with auto-generated embedding.

await engine.store('doc1', 'The quick brown fox')

`storeMany(items)`

Store multiple entries in batch (faster than multiple store() calls).

await engine.storeMany([
  { key: 'doc1', text: 'First document' },
  { key: 'doc2', text: 'Second document' }
])

`search(query, limit?, minSimilarity?)` / `search(query, options?)`

Search for semantically similar entries. Supports both positional arguments and a SearchOptions object.

// Positional arguments (backward compatible)
const results = await engine.search('artificial intelligence', 10, 0.7)

// SearchOptions object
const results2 = await engine.search('artificial intelligence', {
  limit: 10,
  minSimilarity: 0.7
})

// Detailed results with similarity breakdown
const detailed = await engine.search('artificial intelligence', {
  limit: 10,
  minSimilarity: 0.7,
  includeDetails: true
})
detailed.forEach((r) => {
  console.log(r.key, r.similarity)
  console.log(r.queryNorm, r.resultNorm, r.dotProduct)
})

`similarTo(key, limit?, minSimilarity?)` / `similarTo(key, options?)`

Find the documents most similar to an existing key, using that entry's stored embedding as the query. The source key is excluded from its own results. Throws KeyNotFoundError if the key doesn't exist. Accepts the same arguments and returns the same shape as search().

// Positional arguments
const neighbors = await engine.similarTo('doc1', 5, 0.7)

// SearchOptions object (incl. includeDetails)
const detailed = await engine.similarTo('doc1', {
  limit: 5,
  minSimilarity: 0.7,
  includeDetails: true
})

`update(key, text)`

Update the text for an existing key. Throws KeyNotFoundError if the key doesn't exist.

await engine.update('doc1', 'new text for doc1')

`storeEmbedding(key, embedding)`

Store a pre-computed embedding directly, bypassing the embedding model.

const embedding = new Float32Array(384) // or number[]
await engine.storeEmbedding('doc1', embedding)

`storeManyEmbeddings(items)`

Store multiple pre-computed embeddings in batch.

await engine.storeManyEmbeddings([
  { key: 'doc1', embedding: new Float32Array(384) },
  { key: 'doc2', embedding: [0.1, 0.2 /* ... */] }
])

`searchMany(queries, limit?, minSimilarity?)`

Search for multiple queries in batch. Deduplicates identical queries and generates embeddings in parallel.

const results = await engine.searchMany(['hello', 'goodbye', 'test'], 10, 0.7)
// Returns Map<string, SearchResult[]>
for (const [query, queryResults] of results) {
  console.log(`${query}: ${queryResults.length} results`)
}

`keysIterator()`

Returns an async iterator over all keys. More memory-efficient than keys().

for await (const key of engine.keysIterator()) {
  console.log(key)
}

`searchStream(query, options?)`

Returns an async iterator that yields search results one at a time.

for await (const result of engine.searchStream('query', {
  minSimilarity: 0.7
})) {
  console.log(result.key, result.similarity)
}

`stats()`

Returns database statistics.

const stats = await engine.stats()
console.log(stats.recordCount) // Number of live records
console.log(stats.dataFileSize) // Data file size in bytes
console.log(stats.walFileSize) // WAL file size in bytes
console.log(stats.dimension) // Embedding dimension (e.g., 384)
console.log(stats.isReadOnly) // Whether in read-only mode

`get(key)`

Retrieve a specific entry by key.

const entry = await engine.get('doc1')
if (entry) {
  console.log(entry.key) // 'doc1'
  console.log(entry.embedding) // Float32Array (384 dimensions)
}

`delete(key)`

Delete an entry by key. Returns true if the entry was deleted, false if it didn't exist.

const deleted = await engine.delete('doc1')

`has(key)`

Check if a key exists in the database.

const exists = await engine.has('doc1')

`count()`

Get the number of live entries in the database.

const count = await engine.count()

`keys()`

Get all keys in the database as an array.

const keys = await engine.keys()

`generateEmbedding(text)`

Generate an embedding for a text string without storing it.

const embedding = await engine.generateEmbedding('hello world')
// Returns number[] (384 dimensions)

`dispose()`

Dispose of the engine and release all resources. Always call this when done to properly clean up native resources and persist the HNSW index.

await engine.dispose()

CLI Reference

Commands

# Data operations
olingo store <key> <text> [--storePath path]    # Store text with embedding
olingo search <query> [options] [--storePath]    # Semantic search
olingo similar-to <key> [options] [--storePath]  # Find similar to existing key
olingo get <key> [--storePath path]              # Retrieve by key
olingo delete <key> [--storePath path]           # Delete by key

# Database management
olingo count [--storePath path]                  # Show record count
olingo keys [--storePath path]                   # List all keys
olingo stats [--storePath path]                  # Show database statistics
olingo verify [--storePath path]                 # Check database integrity
olingo compact [--storePath path]                # Remove dead records
olingo wal [--storePath path]                    # Display WAL entries

Options

-s, --storePath - Path to database file (default: ./database.raptor)
-l, --limit - Maximum results to return (default: 10)
-m, --minSimilarity - Minimum similarity threshold 0-1 (default: 0)

Examples

# Store documents
olingo store doc1 "The quick brown fox jumps over the lazy dog"
olingo store doc2 "Machine learning is a subset of artificial intelligence"
olingo store doc3 "Bun is a fast JavaScript runtime"

# Search with default settings
olingo search "artificial intelligence"

# Search with custom limit and threshold
olingo search "AI and ML" --limit 3 --minSimilarity 0.7

# Find documents most similar to an existing key
olingo similar-to doc2 --limit 5

# Use custom database path
olingo store key1 "Some text" --storePath ./data/custom.raptor

How It Works

Text → Embeddings: Olingo uses the BGE-Small-EN-V1.5 model to convert text into 384-dimensional vector embeddings
Storage: Embeddings are stored in a WAL-based binary format (.raptor files) with CRC32 checksums for integrity
Search: Queries are compared against stored embeddings using an HNSW (Hierarchical Navigable Small World) index for fast approximate nearest neighbor search
Results: Returns the most similar results ranked by cosine similarity

Embedding Model: BAAI/bge-small-en-v1.5 (384 dimensions, ~67MB)

`backup(destPath)`

Create a consistent backup of the database. Copies the data file, WAL, and HNSW sidecar to the destination.

await engine.backup('./backups/database.raptor')

`compact()`

Compact the database by rewriting only live records. Removes dead records from deletes and updates, reducing file size.

const result = await engine.compact()
console.log(result.recordsBefore) // Total records before (including dead)
console.log(result.recordsAfter) // Live records after
console.log(result.bytesBefore) // File size before
console.log(result.bytesAfter) // File size after

`verify()`

Verify the integrity of the database by scanning all records and validating checksums.

const result = await engine.verify()
console.log(result.totalRecords) // Total records scanned
console.log(result.validRecords) // Valid records
console.log(result.corruptRecords) // Corrupt records
console.log(result.issues) // Array of { offset, message }

Events

EmbeddingEngine extends EventEmitter and emits events for database operations:

engine.on('store', ({ key }) => console.log(`Stored: ${key}`))
engine.on('update', ({ key }) => console.log(`Updated: ${key}`))
engine.on('delete', ({ key }) => console.log(`Deleted: ${key}`))
engine.on('search', ({ query, resultCount }) =>
  console.log(`Search: "${query}" → ${resultCount} results`)
)
engine.on('similarTo', ({ key, resultCount }) =>
  console.log(`Similar to "${key}" → ${resultCount} results`)
)

Custom Embedding Providers

You can use any embedding model or service by implementing the EmbeddingProvider interface:

import { EmbeddingEngine } from 'olingo'
import type { EmbeddingProvider } from 'olingo'

const myProvider: EmbeddingProvider = {
  dimension: 1536,
  async generateEmbedding(text: string): Promise<Float32Array> {
    // Call your embedding API (OpenAI, Cohere, etc.)
    const response = await fetch('https://api.openai.com/v1/embeddings', {
      method: 'POST',
      headers: { Authorization: `Bearer ${apiKey}` },
      body: JSON.stringify({ model: 'text-embedding-3-small', input: text })
    })
    const data = await response.json()
    return new Float32Array(data.data[0].embedding)
  }
}

const engine = new EmbeddingEngine({
  storePath: './database.raptor',
  embeddingProvider: myProvider
})

When a custom provider is set, the default BGE model is never downloaded or loaded.

Performance

HNSW search index: Fast approximate nearest neighbor search via persistent HNSW index (saved to .raptor-hnsw sidecar file, loaded on startup)
Batch operations: Use storeMany() for faster bulk inserts
WAL-based durability: Write-ahead log ensures crash safety
Embedding cache: Use embeddingCacheSize to cache text-to-embedding mappings (~1.5KB per cached entry)
Deduplication: Latest entry automatically used for duplicate keys
Compaction: Use compact() to reclaim space from deleted records

This library uses node-llama-cpp which includes native bindings. When bundling, mark these packages as external:

// esbuild
{
  external: ['node-llama-cpp', '@node-llama-cpp/*']
}

// Vite
{
  build: {
    rollupOptions: {
      external: ['node-llama-cpp', /^@node-llama-cpp\/.*/]
    }
  }
}

// Webpack
{
  externals: ['node-llama-cpp', /^@node-llama-cpp\/.*/]
}

Multi-Process Safety

Olingo uses file-based locking to prevent concurrent writes. If a process crashes without releasing the lock, the lock file becomes stale. Olingo automatically detects stale locks by checking if the PID in the lock file is still alive. If the owning process has exited, the lock is automatically recovered.

Error Handling

Olingo provides custom error classes for common failure scenarios:

KeyNotFoundError - Thrown when attempting to update a key that doesn't exist. Includes the key property.
DimensionMismatchError - Thrown when an embedding's dimension doesn't match the database's expected dimension (384 for the default model). Includes expectedDimension and actualDimension properties.
DatabaseLockedError - Thrown when the database is locked by another process.
ReadOnlyError - Thrown when attempting to write to a read-only database.
LockPermissionError - Thrown when the lock file cannot be created due to permissions.
ModelInitializationError - Thrown when the embedding model fails to initialize.
EmbeddingGenerationError - Thrown when embedding generation fails.

import { DimensionMismatchError } from 'olingo'

try {
  await engine.store('doc1', 'some text')
} catch (error) {
  if (error instanceof DimensionMismatchError) {
    console.log(
      `Expected ${error.expectedDimension}d, got ${error.actualDimension}d`
    )
  }
}

Contributing

Contributions are welcome! Please see CONTRIBUTING.md for development setup, architecture details, and guidelines.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Olingo

What is Olingo?

Why Olingo?

Use Cases

Installation

Quick Start

Programmatic API

Command Line Interface

Examples

API Reference

EmbeddingEngine

constructor(options)

store(key, text)

storeMany(items)

search(query, limit?, minSimilarity?) / search(query, options?)

similarTo(key, limit?, minSimilarity?) / similarTo(key, options?)

update(key, text)

storeEmbedding(key, embedding)

storeManyEmbeddings(items)

searchMany(queries, limit?, minSimilarity?)

keysIterator()

searchStream(query, options?)

stats()

get(key)

delete(key)

has(key)

count()

keys()

generateEmbedding(text)

dispose()

CLI Reference

Commands

Options

Examples

How It Works

backup(destPath)

compact()

verify()

Events

Custom Embedding Providers

Performance

Multi-Process Safety

Error Handling

Contributing

License

`EmbeddingEngine`

`constructor(options)`

`store(key, text)`

`storeMany(items)`

`search(query, limit?, minSimilarity?)` / `search(query, options?)`

`similarTo(key, limit?, minSimilarity?)` / `similarTo(key, options?)`

`update(key, text)`

`storeEmbedding(key, embedding)`

`storeManyEmbeddings(items)`

`searchMany(queries, limit?, minSimilarity?)`

`keysIterator()`

`searchStream(query, options?)`

`stats()`

`get(key)`

`delete(key)`

`has(key)`

`count()`

`keys()`

`generateEmbedding(text)`

`dispose()`

`backup(destPath)`

`compact()`

`verify()`