@soulcraft/brainy
v6.6.1
Published
Universal Knowledge Protocol™ - World's first Triple Intelligence database unifying vector, graph, and document search in one API. Stage 3 CANONICAL: 42 nouns × 127 verbs covering 96-97% of all human knowledge.
Maintainers
Readme
Brainy
The Knowledge Operating System
Every piece of knowledge in your application — living, connected, and intelligent.
Stop fighting with vector databases, graph databases, and document stores. Stop stitching together Pinecone + Neo4j + MongoDB. Brainy does all three, in one elegant API, from prototype to planet-scale.
const brain = new Brainy()
await brain.init()
// That's it. You now have semantic search, graph relationships,
// and document filtering. Zero configuration. Just works.Built by developers who were tired of:
- Spending weeks configuring embeddings, indexes, and schemas
- Choosing between vector similarity OR graph relationships OR metadata filtering
- Rewriting everything when you need to scale from 1,000 to 1,000,000,000 entities
Brainy makes the impossible simple: All three paradigms. One API. Any scale.
👉 Choose Your Path
New to Brainy? Pick your starting point:
🚀 Path 1: I want to build something NOW
→ Complete API Reference ⭐ Most developers start here ⭐
- Every method documented with examples
- Quick start in 60 seconds
- 1,870 lines of copy-paste ready code
- This is your primary resource
🧠 Path 2: I want to understand the big picture first
→ Keep reading below for demos, architecture, and use cases
📊 Path 3: I'm evaluating database options
→ Jump to Why Revolutionary or Benchmarks
See It In Action
30 seconds to understand why Brainy is different:
import { Brainy, NounType, VerbType } from '@soulcraft/brainy'
const brain = new Brainy()
await brain.init()
// Add knowledge with context
const reactId = await brain.add({
data: "React is a JavaScript library for building user interfaces",
type: NounType.Concept,
metadata: { category: "frontend", year: 2013 }
})
const nextId = await brain.add({
data: "Next.js framework for React with server-side rendering",
type: NounType.Concept,
metadata: { category: "framework", year: 2016 }
})
// Create relationships
await brain.relate({ from: nextId, to: reactId, type: VerbType.BuiltOn })
// NOW THE MAGIC: Query with natural language
const results = await brain.find({
query: "modern frontend frameworks", // 🔍 Vector similarity
where: { year: { greaterThan: 2015 } }, // 📊 Document filtering
connected: { to: reactId, depth: 2 } // 🕸️ Graph traversal
})
// ALL THREE PARADIGMS. ONE QUERY. 10ms response time.This is impossible with traditional databases. Brainy makes it trivial.
Quick Start
npm install @soulcraft/brainyYour First Knowledge Graph (60 seconds)
import { Brainy, NounType, VerbType } from '@soulcraft/brainy'
const brain = new Brainy()
await brain.init()
// Add knowledge
const jsId = await brain.add({
data: "JavaScript is a programming language",
type: NounType.Concept,
metadata: { category: "language", year: 1995 }
})
const nodeId = await brain.add({
data: "Node.js runtime environment",
type: NounType.Concept,
metadata: { category: "runtime", year: 2009 }
})
// Create relationships
await brain.relate({ from: nodeId, to: jsId, type: VerbType.Executes })
// Query with Triple Intelligence
const results = await brain.find({
query: "JavaScript", // 🔍 Vector
where: { category: "language" }, // 📊 Document
connected: { from: nodeId, depth: 1 } // 🕸️ Graph
})Done. No configuration. No complexity. Production-ready from day one.
→ Ready to dive deeper? Complete API Documentation has every method with examples.
Entity Extraction (NEW in v5.7.6)
Extract entities from text with AI-powered classification:
import { Brainy, NounType } from '@soulcraft/brainy'
const brain = new Brainy()
await brain.init()
// Extract all entities
const entities = await brain.extractEntities('John Smith founded Acme Corp in New York')
// Returns:
// [
// { text: 'John Smith', type: NounType.Person, confidence: 0.95 },
// { text: 'Acme Corp', type: NounType.Organization, confidence: 0.92 },
// { text: 'New York', type: NounType.Location, confidence: 0.88 }
// ]
// Extract with filters
const people = await brain.extractEntities(resume, {
types: [NounType.Person],
confidence: 0.8
})
// Advanced: Direct access to extractors
import { SmartExtractor } from '@soulcraft/brainy'
const extractor = new SmartExtractor(brain, { minConfidence: 0.7 })
const result = await extractor.extract('CEO', {
formatContext: { format: 'excel', columnHeader: 'Title' }
})Features:
- 🎯 4-Signal Ensemble - ExactMatch (40%) + Embedding (35%) + Pattern (20%) + Context (5%)
- 📊 Format Intelligence - Adapts to Excel, CSV, PDF, YAML, DOCX, JSON, Markdown
- ⚡ Fast - ~15-20ms per extraction with LRU caching
- 🌍 42 Types - Person, Organization, Location, Document, and 38 more
→ Neural Extraction Guide | Import Preview Mode
From Prototype to Planet Scale
The same API. Zero rewrites. Any scale.
👤 Individual Developer → Weekend Prototype
const brain = new Brainy() // Zero config, starts in memory
await brain.init()Perfect for: Hackathons, side projects, prototyping, learning
👥 Small Team → Production MVP
const brain = new Brainy({
storage: { type: 'filesystem', path: './data', compression: true }
})Scale: Thousands to hundreds of thousands • Performance: <5ms queries → Production Service Architecture — Singleton patterns, caching, and scaling for Express/Node.js services
🏢 Growing Company → Multi-Million Scale
const brain = new Brainy({
storage: { type: 's3', s3Storage: { bucketName: 'my-kb', region: 'us-east-1' } },
hnsw: { typeAware: true } // 87% memory reduction
})Scale: Millions of entities • Performance: <10ms queries, 12GB @ 10M entities
🌍 Enterprise → Billion+ Scale
const brain = new Brainy({
storage: { type: 'gcs', gcsStorage: { bucketName: 'global-kb' } },
hnsw: { typeAware: true, M: 32, efConstruction: 400 }
})Scale: Billions (tested @ 1B+) • Performance: 18ms queries, 50GB memory Cost: $138k/year → $6k/year with intelligent tiering (96% savings)
→ Capacity Planning Guide | Cost Optimization
🎯 The Point
Start simple. Scale infinitely. Never rewrite.
Most systems make you choose: Simple (SQLite) OR Scalable (Kubernetes + 7 databases). Brainy gives you both. Starts simple as SQLite. Scales like Google.
Why Brainy Is Revolutionary
🧠 Triple Intelligence™ — The Impossible Made Possible
The world's first to unify three database paradigms in ONE API:
| What You Get | Like Having | But Unified | |-------------|-------------|-------------| | 🔍 Vector Search | Pinecone, Weaviate | Find by meaning | | 🕸️ Graph Relationships | Neo4j, ArangoDB | Navigate connections | | 📊 Document Filtering | MongoDB, Elasticsearch | Query metadata |
Every other system makes you choose. Brainy does all three together.
Why this matters: Your data isn't just vectors or just documents or just graphs. It's all three at once. A research paper is semantically similar to other papers (vector), written by an author (graph), and published in 2023 (document). Brainy is the only system that understands this.
🎯 42 Noun Types × 127 Verb Types = Universal Protocol
Model any domain with mathematical completeness:
42 Nouns × 127 Verbs × ∞ Metadata = 5,334+ base combinations
Stage 3 CANONICAL: 96-97% coverage of all human knowledgeReal-world expressiveness:
- Healthcare:
Patient → diagnoses → Condition - Finance:
Account → transfers → Transaction - Manufacturing:
Product → assembles → Component - Education:
Student → completes → Course - YOUR domain → Your types + relationships = Your knowledge graph
⚡ Zero Configuration Philosophy
We hate configuration files. So we eliminated them.
const brain = new Brainy() // Auto-detects everything
await brain.init() // Optimizes for your environmentBrainy automatically:
- Detects optimal storage (memory/filesystem/cloud)
- Configures memory based on available RAM
- Optimizes for containers (Docker/K8s)
- Tunes indexes for your data patterns
- Manages embedding models and caching
You write business logic. Brainy handles infrastructure.
🚀 Git-Style Version Control — Database & Entity Level (v5.0.0+)
Clone your entire database in <100ms. Track every entity change. Full Git-style workflow.
// Fork instantly - Snowflake-style copy-on-write
const experiment = await brain.fork('test-migration')
// Make changes safely in isolation
await experiment.add({ type: 'user', data: { name: 'Test User' } })
await experiment.updateAll({ /* migration logic */ })
// Commit your work
await experiment.commit({ message: 'Add test user', author: '[email protected]' })
// Switch to experimental branch to make it active
await brain.checkout('test-migration')
// Time-travel: Query database at any past commit (read-only)
const commits = await brain.getHistory({ limit: 10 })
const snapshot = await brain.asOf(commits[5].id)
const pastResults = await snapshot.find({ query: 'historical data' })
await snapshot.close()
// Entity versioning: Track changes to individual entities (v5.3.0+)
const userId = await brain.add({ type: 'user', data: { name: 'Alice' } })
await brain.versions.save(userId, { tag: 'v1.0', description: 'Initial profile' })
await brain.update(userId, { data: { name: 'Alice Smith', role: 'admin' } })
await brain.versions.save(userId, { tag: 'v2.0', description: 'Added role' })
// Compare versions or restore previous state
const diff = await brain.versions.compare(userId, 1, 2) // See what changed
await brain.versions.restore(userId, 1) // Restore v1.0Database-level version control (v5.0.0):
- ✅
fork()- Instant clone in <100ms - ✅
merge()- Merge with conflict resolution - ✅
commit()- Snapshot state - ✅
asOf()- Time-travel queries (query at any commit) - ✅
getHistory()- View commit history - ✅
checkout(),listBranches()- Full branch management - ✅ CLI support for all features
Entity-level version control (v5.3.0):
- ✅
versions.save()- Save entity snapshots with tags - ✅
versions.restore()- Restore previous versions - ✅
versions.compare()- Diff between versions - ✅
versions.list()- View version history - ✅ Automatic deduplication (content-addressable storage)
How it works: Snowflake-style COW shares HNSW index structures, copying only modified nodes (10-20% memory overhead).
Perfect for: Safe migrations, A/B testing, feature branches, distributed development, time-travel debugging, audit trails, document versioning, compliance tracking
What Can You Build?
If your app needs to remember, understand, or connect information — Brainy makes it trivial.
🤖 AI Agents with Perfect Memory
Give your AI unlimited context that persists forever. Not just chat history — true understanding of relationships, evolution, and meaning over time.
Examples: Personal assistants, code assistants, conversational AI, research agents
📚 Living Documentation & Knowledge Bases
Documentation that understands itself. Auto-links related concepts, detects outdated information, finds connections across your entire knowledge base.
Examples: Internal wikis, research platforms, smart documentation, learning systems
🔍 Semantic Search at Any Scale
Find by meaning, not keywords. Search codebases, research papers, customer data, or media libraries with natural language.
Examples: Code search, research platforms, content discovery, recommendation engines
🏢 Enterprise Knowledge Management
Corporate memory that never forgets. Track every customer interaction, product evolution, and business relationship.
Examples: CRM systems, product catalogs, customer intelligence, institutional knowledge
🎮 Rich Interactive Experiences
NPCs that remember. Characters that persist across stories. Worlds that evolve based on real relationships.
Examples: Game worlds, interactive fiction, educational platforms, creative tools
🎨 Content & Media Platforms
Every asset knows its relationships. Intelligent tagging, similarity-based discovery, and relationship-aware management.
Examples: DAM systems, media libraries, writing assistants, content management
The pattern: Knowledge that needs to live, connect, and evolve. That's what Brainy was built for.
Core Features
🧠 Natural Language Queries
// Ask naturally - Brainy understands
await brain.find("recent React components with tests")
await brain.find("JavaScript libraries similar to Vue")
// Or use structured Triple Intelligence queries
await brain.find({
query: "React",
where: { type: "library", year: { greaterThan: 2020 } },
connected: { to: "JavaScript", depth: 2 }
})→ See all query methods in API Reference
🌐 Virtual Filesystem — Intelligent File Management
Build file explorers and IDEs that never crash:
const vfs = brain.vfs()
// Tree-aware operations prevent infinite recursion
const tree = await vfs.getTreeStructure('/projects', { maxDepth: 3 })
// Semantic file search
const reactFiles = await vfs.search('React components with hooks')📖 VFS Quick Start → | Common Patterns → | Neural Extraction →
🚀 Import Anything — CSV, Excel, PDF, URLs
await brain.import('customers.csv') // Auto-detects everything
await brain.import('sales-data.xlsx', { excelSheets: ['Q1', 'Q2'] })
await brain.import('research-paper.pdf', { pdfExtractTables: true })
await brain.import('https://api.example.com/data.json')🧠 Neural API — Advanced Semantic Analysis
// Clustering, similarity, outlier detection, visualization
const clusters = await brain.neural.clusters({ algorithm: 'kmeans' })
const similarity = await brain.neural.similar('item1', 'item2')
const outliers = await brain.neural.outliers(0.3)
const vizData = await brain.neural.visualize({ maxNodes: 100 })Framework Integration
Works with any modern framework. React, Vue, Angular, Svelte, Solid.js — your choice.
// React
const [brain] = useState(() => new Brainy())
useEffect(() => { brain.init() }, [])
// Vue
async mounted() { this.brain = await new Brainy().init() }
// Angular
@Injectable() export class BrainyService { brain = new Brainy() }Supports: All bundlers (Webpack, Vite, Rollup) • SSR/SSG • Edge runtimes • Browser/Node.js
📖 Framework Integration Guide → | Next.js → | Vue →
Storage — From Memory to Planet-Scale
Development → Just Works
const brain = new Brainy() // Memory storage, zero configProduction → Persistence with Compression
const brain = new Brainy({
storage: { type: 'filesystem', path: './data', compression: true }
})
// 60-80% space savings with gzipCloud → AWS, GCS, Azure, Cloudflare R2
// AWS S3 / Cloudflare R2
const brain = new Brainy({
storage: {
type: 's3',
s3Storage: {
bucketName: 'my-knowledge-base',
region: 'us-east-1'
}
}
})
// Enable Intelligent-Tiering: 96% cost savings
await brain.storage.enableIntelligentTiering('entities/', 'auto-tier')Cost optimization at scale:
| Scale | Standard | With Intelligent Tiering | Annual Savings | |-------|----------|--------------------------|----------------| | 5TB | $1,380 | $59 | $1,321 (96%) | | 50TB | $13,800 | $594 | $13,206 (96%) | | 500TB | $138,000 | $5,940 | $132,060 (96%) |
📖 Cloud Storage Guide → | AWS Cost Optimization → | GCS → | Azure →
Production Features
🎯 Type-Aware HNSW Indexing
Efficient type-based organization for large-scale deployments:
- Type-based queries: Faster via directory structure (measured at 1K-1M scale)
- Type count tracking: 284 bytes (Uint32Array, measured)
- Billion-scale projections: NOT tested at 1B entities (extrapolated from 1M)
const brain = new Brainy({ hnsw: { typeAware: true } })📖 How Type-Aware Indexing Works →
⚡ Enterprise-Ready Operations (v4.0.0)
- Batch operations with retry logic (1000x faster deletes)
- Gzip compression (60-80% space savings)
- OPFS quota monitoring (browser storage)
- Metadata/Vector separation (billion-entity scalability)
- Circuit breakers & backpressure (enterprise reliability)
// Batch operations
await brain.storage.batchDelete(keys, { maxRetries: 3 })
// Monitor storage
const status = await brain.storage.getStorageStatus()📊 Adaptive Memory Management
Auto-scales 2GB → 128GB+ based on environment:
- Container-aware (Docker/K8s cgroups)
- Environment-optimized (dev/staging/production)
- Built-in cache monitoring with tuning recommendations
const stats = brain.getCacheStats() // Performance insightsBenchmarks
| Operation | Performance | Memory | |-----------|-------------|--------| | Initialize | 450ms | 24MB | | Add entity | 12ms | +0.1MB | | Vector search (1K) | 3ms | - | | Metadata filter (10K) | 0.8ms | - | | Bulk import (1K) | 2.3s | +8MB | | 10M entities | 5.8ms | 12GB | | 1B entities | 18ms | 50GB |
🧠 Deep Dive: How Brainy Actually Works
Want to understand the magic under the hood?
🔍 Triple Intelligence & find() API
Understand how vector search, graph relationships, and document filtering work together in one unified query:
📖 Triple Intelligence Architecture → 📖 Natural Language Guide → 📖 API Reference: find() →
🗂️ Type-Aware Indexing & HNSW
Learn about our indexing architecture with measured performance optimizations:
📖 Data Storage Architecture → 📖 Architecture Overview →
📈 Scaling: Individual → Planet
Understand how the same code scales from prototype to billions of entities:
📖 Capacity Planning → 📖 Cloud Deployment Guide →
🎯 The Universal Type System
Explore the mathematical foundation: 42 nouns × 127 verbs = Stage 3 CANONICAL taxonomy:
CLI Tools
npm install -g brainy
brainy add "JavaScript is awesome" --metadata '{"type":"opinion"}'
brainy find "awesome programming languages"
brainy search "programming"47 commands available, including storage management, imports, and neural operations.
📖 Complete Documentation
For most developers: Start with the Complete API Reference ⭐
This comprehensive guide includes:
- ✅ Every method with parameters, returns, and examples
- ✅ Quick start in 60 seconds
- ✅ Core CRUD → Advanced features (branching, versioning, time-travel)
- ✅ TypeScript types and patterns
- ✅ 1,870 lines of copy-paste ready code
🎯 Essential Reading (Start Here)
📖 Complete API Reference ⭐ START HERE ⭐
- Your primary resource for building with Brainy
- Every method documented with working examples
- Complete reference for operators, compound filters, and optimization tips
- Master the
find()method and Triple Intelligence queries
- Master the
- Upgrading from v3 (100% backward compatible)
🧠 Core Concepts & Architecture
- Triple Intelligence Architecture — How vector + graph + document work together
- Noun-Verb Taxonomy — The universal type system (42 nouns × 127 verbs)
- Transactions — Atomic operations with automatic rollback
- Architecture Overview — System design and components
- Data Storage Architecture — Type-aware indexing and HNSW
☁️ Production & Operations
- Cloud Deployment Guide — Deploy to AWS, GCS, Azure
- Capacity Planning — Memory, storage, and scaling to billions
- Cost Optimization: AWS S3 | GCS | Azure | Cloudflare R2
🌐 Framework Integration
- Framework Integration Guide — React, Vue, Angular, Svelte
- Next.js Integration
- Vue.js Integration
🌳 Virtual Filesystem (VFS)
- VFS Quick Start — Build file explorers that never crash
- VFS Core Documentation
- Semantic VFS Guide — AI-powered file navigation
- Neural Extraction API
📦 Data Import & Processing
- Import Anything Guide — CSV, Excel, PDF, URLs with auto-detection
What's New in v4.0.0
Enterprise-scale cost optimization and performance improvements:
- 🎯 96% cloud storage cost savings with intelligent tiering (AWS, GCS, Azure)
- ⚡ 1000x faster batch deletions (533 entities/sec vs 0.5/sec)
- 📦 60-80% compression with gzip (FileSystem storage)
- 🔄 Enhanced metadata/vector separation for billion-scale deployments
📖 Full v4.0.0 Changelog → | Migration Guide → (100% backward compatible)
Requirements
Node.js 22 LTS (recommended) or Node.js 20 LTS
nvm use # We provide .nvmrcWhy Brainy Exists
The Vision: Traditional systems force you to choose between vector databases, graph databases, and document stores. You need all three, but combining them is complex and fragile.
Brainy solved the impossible: One API. All three paradigms. Any scale.
Like HTTP standardized web communication, Brainy standardizes knowledge representation. One protocol that any AI model understands. One system that scales from prototype to planet.
📖 Read the Mathematical Proof →
Enterprise & Support
🏢 Brain Cloud — Managed Brainy with team sync, persistent memory, and enterprise connectors. Visit soulcraft.com for more information.
💖 Support Development:
- ⭐ Star us on GitHub
- 💝 Sponsor via GitHub Sponsors
- 🐛 Report issues and contribute code
- 📣 Share with your team and community
Brainy is 100% free and open source. No paywalls, no premium tiers, no feature gates.
Contributing
We welcome contributions! See CONTRIBUTING.md for guidelines.
License
MIT © Brainy Contributors
