harper-fabric-embeddings

v0.2.3

Published

18 days ago

Minimal llama.cpp embedding wrapper for Harper Fabric. Talks directly to the native addon — no build tools, no CLI, no chat wrappers.

Downloads

3,947

0High
0Medium
0Low

heskew

llama embeddings harper fabric vector

harper-fabric-embeddings

Minimal llama.cpp embedding wrapper for Harper Fabric. Talks directly to the @node-llama-cpp native N-API addon — no build tools, no CLI, no chat wrappers, no model downloaders beyond a simple HuggingFace fetch.

~19 MB installed (native binary only) vs ~250 MB+ for node-llama-cpp.

Installation

npm install harper-fabric-embeddings

The package uses @node-llama-cpp platform-specific binaries. The linux-x64 binary is included as an optional dependency. For other platforms, install the appropriate package:

npm install @node-llama-cpp/mac-arm64-metal  # macOS Apple Silicon
npm install @node-llama-cpp/mac-x64          # macOS Intel
npm install @node-llama-cpp/linux-arm64      # Linux ARM64

Usage

import { init, embed, dimensions, dispose } from 'harper-fabric-embeddings';

// Initialize with a models directory (finds or downloads the model)
await init({ modelsDir: '/path/to/models' });

// Generate an embedding (L2-normalized)
const vector = await embed('Hello world');

// Get vector dimensionality
const dims = dimensions();

// Clean up native resources
await dispose();

API

`init(options)`

Initialize the embedding engine. Call once before using embed().

| Option | Type | Default | Description | | ------------- | ------ | -------------------- | ---------------------------------------- | | modelPath | string | — | Absolute path to a .gguf model file | | modelsDir | string | — | Directory to search/download model files | | modelName | string | "nomic-embed-text" | Model name from the built-in registry | | contextSize | number | 2048 | Token context window size | | batchSize | number | 512 | Batch processing size | | threads | number | 6 | CPU threads for inference | | gpuLayers | number | 0 | Layers to offload to GPU (0 = CPU only) | | addonPath | string | — | Override path to llama-addon.node |

Either modelPath or modelsDir is required.

`embed(text)`

Generate an L2-normalized embedding vector for the given text. Returns number[].

`dimensions()`

Returns the embedding vector dimensionality.

`dispose()`

Clean up native resources (model, context, binding).

`downloadModel(dir, modelName?)`

Download a model from HuggingFace. Called automatically by init() when using modelsDir and no local model is found.

Models

Two models are built in:

| Name | Source | Quantization | | ------------------------- | -------------------------------- | ------------ | | nomic-embed-text | nomic-ai/nomic-embed-text-v1.5 | Q4_K_M | | nomic-embed-text-v2-moe | nomic-ai/nomic-embed-text-v2-moe | Q4_K_M |

Models are resolved in order: HuggingFace-prefixed filename, bare filename, stem match scan, then download from HuggingFace.

Testing

# Unit tests (no model file needed)
npm test

# Integration tests (requires a model file)
MODEL_PATH=/path/to/model.gguf npm test

Requirements

Node.js 22+
A @node-llama-cpp platform package for your architecture

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

harper-fabric-embeddings

Installation

Usage

API

init(options)

embed(text)

dimensions()

dispose()

downloadModel(dir, modelName?)

Models

Testing

Requirements

License

`init(options)`

`embed(text)`

`dimensions()`

`dispose()`

`downloadModel(dir, modelName?)`