pixmap-engine

v1.0.1

Published

9 days ago

On-device image similarity search

0High
0Medium
0Low

pixmap

Local image similarity search using CLIP embeddings, HNSW indexing, and SQLite. Everything runs on your machine — no cloud, no API keys, your images stay private.

Install

npm install pixmap-engine

Requires Node.js 18+. Native modules (sharp, better-sqlite3, hnswlib-node) compile during install.

Usage

Command Line

pixmap add ./photo.jpg            # index a single image
pixmap add ./photos/              # index a directory (recursive)
pixmap search ./query.jpg         # find similar images
pixmap search ./query.jpg -k 10   # return more results
pixmap list                       # show indexed images
pixmap status                     # index stats

Terminal View

As a library

import { PixmapEngine } from "pixmap-engine";

const engine = new PixmapEngine({ dataDir: "./data" });
await engine.init();

await engine.add("./photos/dog.jpg");
await engine.add("./photos/cat.png");

const results = await engine.search("./query.jpg", 5);

for (const r of results) {
  console.log(`${r.path} — ${(r.score * 100).toFixed(1)}%`);
}

Options:

-d, --data-dir <path> — where to store index and db (default: ./data)
-k, --top-k <n> — number of results (default: 5)

API

`new PixmapEngine(options)`

{
  dataDir: string;          // where to store index.hnsw and metadata.db
  indexFileName?: string;   // default: "index.hnsw"
  dbFileName?: string;      // default: "metadata.db"
}

Methods

**init()** — loads the CLIP model, opens or creates the HNSW index and SQLite db. Must be called before anything else.
**add(imagePath)** — indexes an image. Returns { id, skipped, record }. If the image was already indexed, skipped is true.
**search(imagePath, topK?)** — embeds the query image and returns the top-K most similar indexed images. Default K is 5.
**listImages(limit?)** — returns indexed images, newest first.
**getImage(id)** — look up a single image by its id.
**getIndexedCount()** — total number of indexed images.

How it works

Each image is resized to 224x224 with Sharp, run through a CLIP vision model (ViT-B/32, via ONNX), producing a 512-dimensional embedding vector. That vector goes into an HNSW index for fast nearest-neighbor lookup. File paths, IDs, and timestamps are tracked in a SQLite database.

Searching works the same way: embed the query image, ask HNSW for the closest vectors, look up the metadata.

See docs/HOW.md for the full technical breakdown.

What gets stored

Everything lives in your dataDir:

index.hnsw — the vector index (binary, hnswlib format)
metadata.db — SQLite database with image paths and metadata

About ~2KB per indexed image. A 100K image index is roughly 220MB.

Supported formats

JPEG, PNG, WebP, BMP, GIF, AVIF, TIFF — anything Sharp can read.

Dependencies

@xenova/transformers — runs the CLIP model locally via ONNX
sharp — image resize and preprocessing
hnswlib-node — approximate nearest neighbor index
better-sqlite3 — metadata storage

License

MIT