@memberjunction/ai-vectordb

v5.2.0

Published

4 days ago

MemberJunction: AI Vector Database Module

0High
0Medium
0Low

@memberjunction/ai-vectordb

A provider-agnostic abstraction layer for vector databases in MemberJunction. This package defines the abstract base class, type system, and query interfaces that concrete vector database implementations (such as Pinecone) must fulfill.

Architecture

graph TD
    subgraph Abstraction["@memberjunction/ai-vectordb"]
        VDBB["VectorDBBase (abstract)"]
        VR["VectorRecord"]
        QO["QueryOptions"]
        QR["QueryResponse / ScoredRecord"]
        IDX["IndexDescription / IndexList"]
        BR["BaseResponse"]
    end

    subgraph Implementations["Provider Implementations"]
        PC["PineconeDatabase"]
        CUSTOM["Custom Provider"]
    end

    subgraph Consumers["Consuming Packages"]
        SYNC["ai-vector-sync"]
        DUPE["ai-vector-dupe"]
        CORE["ai-vectors (VectorBase)"]
    end

    PC -->|extends| VDBB
    CUSTOM -->|extends| VDBB
    SYNC --> VDBB
    DUPE --> VDBB
    CORE --> VDBB

    style Abstraction fill:#2d6a9f,stroke:#1a4971,color:#fff
    style Implementations fill:#2d8659,stroke:#1a5c3a,color:#fff
    style Consumers fill:#7c5295,stroke:#563a6b,color:#fff

Installation

npm install @memberjunction/ai-vectordb

Overview

This package provides:

VectorDBBase -- an abstract class that defines a complete API for vector database operations (index management + record CRUD + similarity queries)
Record types -- VectorRecord, RecordValues, RecordSparseValues, RecordMetadata for representing vectors and their metadata
Query types -- QueryOptions, QueryResponse, ScoredRecord for similarity search
Index types -- IndexDescription, IndexList, IndexModelMetricEnum for index configuration
Response types -- BaseResponse for standardized success/failure responses

Concrete implementations like @memberjunction/ai-vectors-pinecone extend VectorDBBase to connect to specific vector database services.

Core Components

VectorDBBase (Abstract Class)

All vector database providers must extend this class. The constructor requires an API key, which is validated and stored for subclass access via a protected getter.

classDiagram
    class VectorDBBase {
        <<abstract>>
        #apiKey : string
        +constructor(apiKey: string)
        +listIndexes()* IndexList
        +getIndex(params)* BaseResponse
        +createIndex(params)* BaseResponse
        +deleteIndex(params)* BaseResponse
        +editIndex(params)* BaseResponse
        +queryIndex(params)* BaseResponse
        +createRecord(record)* BaseResponse
        +createRecords(records)* BaseResponse
        +getRecord(params)* BaseResponse
        +getRecords(params)* BaseResponse
        +updateRecord(record)* BaseResponse
        +updateRecords(records)* BaseResponse
        +deleteRecord(record)* BaseResponse
        +deleteRecords(records)* BaseResponse
    }

    class PineconeDatabase {
        +listIndexes() IndexList
        +queryIndex(params) BaseResponse
    }

    VectorDBBase <|-- PineconeDatabase

    style VectorDBBase fill:#2d6a9f,stroke:#1a4971,color:#fff
    style PineconeDatabase fill:#2d8659,stroke:#1a5c3a,color:#fff

The abstract methods support both synchronous and asynchronous return types via union types (BaseResponse | Promise<BaseResponse>), allowing implementations to choose the appropriate pattern.

Type System

Vector Records

// Core vector record with generic metadata support
type VectorRecord<T extends RecordMetadata = RecordMetadata> = {
    id: string;                          // Unique record identifier
    values: RecordValues;                // Dense vector (array of numbers)
    sparseValues?: RecordSparseValues;   // Optional sparse representation for hybrid search
    metadata?: T;                        // Arbitrary filterable metadata
};

type RecordValues = Array<number>;

type RecordSparseValues = {
    indices: Array<number>;   // Non-zero positions
    values: Array<number>;    // Corresponding values
};

type RecordMetadataValue = string | boolean | number | Array<string>;
type RecordMetadata = Record<string, RecordMetadataValue>;

Index Configuration

type IndexDescription = {
    name: string;                       // Index name (max 45 chars)
    dimension: number;                  // Vector dimensionality
    metric: IndexModelMetricEnum;       // 'cosine' | 'euclidean' | 'dotproduct'
    host: string;                       // Hosting URL
};

Query Types

graph LR
    QPB["QueryParamsBase<br/>topK, includeValues,<br/>includeMetadata, filter"]
    QBV["QueryByVectorValues<br/>+ vector"]
    QBID["QueryByRecordId<br/>+ id"]
    QO["QueryOptions"]

    QPB --> QBV
    QPB --> QBID
    QBV --> QO
    QBID --> QO

    style QPB fill:#2d6a9f,stroke:#1a4971,color:#fff
    style QBV fill:#2d8659,stroke:#1a5c3a,color:#fff
    style QBID fill:#2d8659,stroke:#1a5c3a,color:#fff
    style QO fill:#b8762f,stroke:#8a5722,color:#fff

// Base query parameters shared by all query types
type QueryParamsBase = {
    topK: number;               // Number of results to return
    includeValues?: boolean;    // Include vector values in results
    includeMetadata?: boolean;  // Include metadata in results
    filter?: object;            // Metadata filter
};

// Query by providing a vector directly
type QueryByVectorValues = QueryParamsBase & { vector: RecordValues };

// Query using an existing record's vector
type QueryByRecordId = QueryParamsBase & { id: string };

// Union type for all query configurations
type QueryOptions = QueryByRecordId | QueryByVectorValues;

Query Response

type QueryResponse<T extends RecordMetadata = RecordMetadata> = {
    matches: Array<ScoredRecord<T>>;  // Sorted by similarity
    namespace: string;                 // Execution namespace
    usage?: OperationUsage;            // Read unit consumption
};

interface ScoredRecord<T> extends VectorRecord<T> {
    score?: number;  // Similarity score (interpretation depends on metric)
}

Standardized Response

All operations return a BaseResponse:

type BaseResponse = {
    success: boolean;
    message: string;
    data: unknown;
};

Usage

Implementing a Provider

import {
    VectorDBBase,
    VectorRecord,
    BaseResponse,
    CreateIndexParams,
    QueryOptions,
    IndexList
} from '@memberjunction/ai-vectordb';

export class MyVectorDB extends VectorDBBase {
    constructor(apiKey: string) {
        super(apiKey); // Validates and stores the API key
    }

    async listIndexes(): Promise<IndexList> {
        // Call your vector DB API using this.apiKey
        return { indexes: [] };
    }

    async createIndex(params: CreateIndexParams): Promise<BaseResponse> {
        return { success: true, message: 'Created', data: { id: params.id } };
    }

    async queryIndex(params: QueryOptions): Promise<BaseResponse> {
        // Perform similarity search
        return { success: true, message: 'OK', data: { matches: [] } };
    }

    // ... implement remaining abstract methods
}

Consuming a Provider

import { VectorDBBase, VectorRecord } from '@memberjunction/ai-vectordb';

async function searchSimilar(vectorDB: VectorDBBase, embedding: number[]): Promise<void> {
    // Insert a record
    const record: VectorRecord = {
        id: 'doc-001',
        values: embedding,
        metadata: { entity: 'Products', recordId: '12345' }
    };
    await vectorDB.createRecord(record);

    // Query for similar vectors
    const result = await vectorDB.queryIndex({
        vector: embedding,
        topK: 10,
        includeMetadata: true
    });

    if (result.success) {
        for (const match of result.data.matches) {
            console.log(`Match: ${match.id} (score: ${match.score})`);
        }
    }
}

Distance Metrics

The IndexModelMetricEnum supports three metrics for similarity comparison:

| Metric | Description | Use Case | |---|---|---| | cosine | Measures angle between vectors (direction similarity) | Text embeddings, semantic search | | euclidean | Straight-line distance between points | Numeric features, specifications | | dotproduct | Measures both direction and magnitude alignment | Recommendation systems, weighted scoring |

Available Implementations

| Package | Vector Database | |---|---| | @memberjunction/ai-vectors-pinecone | Pinecone |

Create additional implementations by extending VectorDBBase and registering with MemberJunction's class factory.

Dependencies

| Package | Purpose | |---|---| | @memberjunction/core | Core MemberJunction functionality | | @memberjunction/global | Global utilities |

Development

# Build
npm run build

# Development mode
npm run start

License

ISC

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@memberjunction/ai-vectordb

Architecture

Installation

Overview

Core Components

VectorDBBase (Abstract Class)

Type System

Vector Records

Index Configuration

Query Types

Query Response

Standardized Response

Usage

Implementing a Provider

Consuming a Provider

Distance Metrics

Available Implementations

Dependencies

Development

License