@rajeevdesai/face-verification-api

v0.2.1

Published

13 days ago

Browser face verification library — custom pipeline (no wrapper libs)

0High
0Medium
0Low

face-recognition face-verification face-detection liveness anti-spoofing biometrics arcface mobilefacenet onnx mediapipe browser wasm

face-verification-api

Browser-only 1:1 face verification library. Custom pipeline — no wrapper libraries, fully auditable, open-sourceable.

Answers one question: "Is the face in image B the same person as the face in image A?" — entirely client-side, with no biometric data leaving the browser.

▶ Try the live demo — webcam capture, runs entirely in your browser; nothing is uploaded.

MediaPipe FaceLandmarker → Umeyama alignment → MobileFaceNet ONNX → cosine distance
                                                      (+ optional MiniFASNetV2 liveness)

⚠️ Not a sole authentication factor. The bundled liveness (a two-model anti-spoof ensemble) reliably rejects screen/video replay; print is the hard case and can still occasionally pass. Pair with another factor for anything security-critical. See Open Risks.

What this solves

1:1 verification — confirm two images are (or aren't) the same identity. Typical uses: matching a selfie to an ID photo, re-verifying a returning user, enrollment-frame comparison.
Privacy by default — detection, alignment, embedding, and matching all run in the browser via WASM. No image or biometric template is uploaded.
Auditable — the entire pipeline (alignment math, preprocessing, distance metric) lives in readable TypeScript. No opaque third-party recognition SDK.
Actionable fraud signals — every result carries structured flags (missing face, multiple faces, low confidence, liveness fail) so your app can branch on them.

What it does not do

No 1:N identification / search. It compares two faces; it does not search a gallery or database of identities.
No bulletproof liveness. Screen/video replay is reliably rejected; print is the hard case — even with the default two-model ensemble, a printed photo can occasionally pass (see Open Risks). Treat liveness as one signal, not a guarantee.
No calibrated probability. confidence is a margin below the threshold, not a true match probability.
No perfect accuracy. Recognition is bounded by the embedding model — facex_nano scores ~95.62% on LFW, and harder in-the-wild captures (pose, lighting, occlusion) do worse. Calibrate and expect some error.
No server / Node runtime. Requires browser APIs (createImageBitmap, OffscreenCanvas, WASM). It will not run under Node or SSR.
No bundled weights. Models are downloaded separately (license + size reasons).

How it works

Decode (image.ts) — normalize each input (HTMLImageElement | ImageData | string) to ImageData.
Detect (detect.ts) — MediaPipe FaceLandmarker returns 468/478 landmarks per face; faces are sorted largest-first.
Align (align.ts) — five ArcFace keypoints are extracted, then an Umeyama similarity transform warps the face to the canonical 112×112 ArcFace pose.
Embed & compare (embed.ts) — MobileFaceNet produces an L2-normalized embedding per face; identity distance is 1 − cosine similarity.
Liveness (liveness.ts, optional) — MiniFASNetV2 scores the matched face for spoofing.
Decide (compare.ts) — a match requires identity (distance ≤ threshold) and, when liveness is enabled, a passing liveness score; flags are emitted along the way.

Install

npm install @rajeevdesai/face-verification-api

# peer deps (often already present in your app):
npm install onnxruntime-web @mediapipe/tasks-vision

# download the 4 model weights straight into your served static dir:
npx @rajeevdesai/face-verification-api download public/models

public/models/ is served at /models/... in Vite / CRA / Next.js — point loadModels there. Pass any directory you serve (or a CDN; loadModels also accepts absolute URLs).

👉 Full step-by-step setup, model hosting, WASM hosting, and framework-specific notes: INSTALL.md.

Quick start

import { loadModels, compareFaces } from '@rajeevdesai/face-verification-api';

// Once at app startup — loads & caches the models (liveness optional).
await loadModels({
  faceLandmarkerPath: '/models/face_landmarker.task',
  recognitionModelPath: '/models/mobilefacenet.onnx',
  // Liveness ensemble (omit to disable); array → scores averaged.
  livenessModelPath: ['/models/minifasnet_v2.onnx', '/models/minifasnet_v1se.onnx'],
  liveness: [{ cropScale: 2.7 }, { cropScale: 4.0 }],
  // wasmBasePath: '/wasm/'  ← only if ort .wasm files aren't on the default CDN
});

// Per comparison:
const result = await compareFaces(baselineImage, currentImage);
console.log(result.match, result.confidence, result.flags);

API reference

`loadModels(config)`

Loads and caches the models as singletons (FaceLandmarker + recognition, plus liveness if a path is given). Safe to call multiple times — subsequent calls are no-ops once loaded. Must run before compareFaces.

| Field | Required | Default | Description | |-------|----------|---------|-------------| | faceLandmarkerPath | no | models/face_landmarker.task | MediaPipe FaceLandmarker .task | | recognitionModelPath | no | models/mobilefacenet.onnx | Embedding model ONNX | | livenessModelPath | no | — | Liveness ONNX path, or an array to ensemble (live scores averaged). Omit to disable liveness. | | wasmBasePath | no | jsDelivr CDN | Base URL for MediaPipe + onnxruntime-web .wasm | | warmup | no | true | Run a dummy inference to avoid first-call latency | | recognition | no | facex_nano spec | Preprocessing/metric overrides for a BYO recognition model — see Bring your own model | | liveness | no | MiniFASNetV2 spec | Preprocessing/class overrides for a BYO liveness model |

`compareFaces(baseline, current, options?)`

Each image accepts HTMLImageElement | ImageData | string. Safe string formats:

data: URLs
blob: URLs (e.g. URL.createObjectURL(file))
same-origin URLs
CORS-enabled cross-origin URLs

Cross-origin non-CORS URLs will fail — use data: or blob: for file inputs.

Options

| Option | Default | Description | |--------|---------|-------------| | threshold | 0.5 | Match cutoff in the configured distance metric (default cosine). Lower = stricter. Uncalibrated — see calibration. | | livenessThreshold | 0.5 | Liveness score must meet this to pass. Uncalibrated. | | checkLiveness | auto | Defaults to true when a liveness model is loaded, false otherwise. Set false to skip; passing true with no liveness model throws. |

Result shape

interface CompareResult {
  match: boolean;
  confidence: number;      // 0–1, margin below threshold (NOT a calibrated probability)
  flags: FraudFlag[];
  details: {
    baselineFacesFound: number;
    currentFacesFound: number;
    distance: number;        // configured metric (default cosine); lower = more similar
    threshold: number;
    livenessScore?: number;  // 0–1; present only when the liveness check ran
  };
}

Flags

| Flag | Meaning | |------|---------| | baseline_missing | 0 faces in baseline — match is false | | baseline_ambiguous | >1 face in baseline — largest used, comparison continues | | face_missing | 0 faces in current — match is false | | multiple_faces | >1 face in current — best match used (weakens anti-fraud, documented) | | identity_mismatch | distance > threshold | | low_confidence | matched, but distance sits in the grey zone (80–100% of threshold) | | liveness_fail | liveness score below threshold (spoof suspected) — match forced false |

Bring your own model

Both the recognition and liveness models are swappable, and their preprocessing is configurable — so a model that doesn't match the bundled defaults works without forking. Override only the fields that differ:

await loadModels({
  recognitionModelPath: '/models/my-arcface.onnx',
  recognition: {                 // defaults shown
    inputSize: 112,              // square input side
    layout: 'NCHW',              // 'NCHW' | 'NHWC'
    channelOrder: 'RGB',         // 'RGB' | 'BGR'
    mean: 127.5,                 // scalar or [r,g,b]; value = (pixel - mean) / std
    std: 127.5,
    l2normalize: true,           // L2-normalize embeddings before distance
    metric: 'cosine',            // 'cosine' | 'euclidean'
  },
  livenessModelPath: '/models/my-liveness.onnx',  // omit to disable liveness
  liveness: {
    inputSize: 80,
    layout: 'NCHW',
    channelOrder: 'BGR',
    mean: 0,
    std: 1,                      // raw [0,255]; use std 255 for [0,1]
    cropScale: 2.7,              // face-crop expansion factor
    liveClassIndex: 1,           // index of the "live" class in the output
    applySoftmax: true,          // false if the model already outputs probabilities
  },
});

Preprocessing contract. The aligned crop is mapped value = (pixel - mean) / std per channel (pixels in [0,255]), in channelOrder, laid out per layout. The recognition crop is the face warped to the ArcFace 5-point template scaled to inputSize, so any inputSize is supported. The embedding can be any fixed length (baseline and current use the same model). If distances look wrong for the same person, the usual culprit is channelOrder, layout, or mean/std.

Defaults (used when recognition / liveness are omitted):

| | Recognition (facex_nano) | Liveness (MiniFASNetV2) | |---|---|---| | inputSize | 112 | 80 | | layout | NCHW | NCHW | | channelOrder | RGB | BGR | | mean / std | 127.5 / 127.5 → [-1,1] | 0 / 1 → raw [0,255] | | other | l2normalize true, metric cosine | cropScale 2.7, liveClassIndex 1, applySoftmax true |

Threshold is model-specific. Distances are not comparable across models — recalibrate threshold (and livenessThreshold) on your own data whenever you swap the recognition model. See Threshold & calibration.

No liveness? Omit livenessModelPath; the liveness stage is skipped entirely (checkLiveness defaults to false, livenessScore absent).

Liveness

A liveness check runs on the matched face by default — an ensemble of MiniFASNetV2 (crop 2.7) and MiniFASNetV1SE (crop 4.0), each scored and averaged (minivision's Silent-Face approach). The score is 0–1; below livenessThreshold it sets the liveness_fail flag and forces match: false. Pass a single livenessModelPath string to use one model, or omit it to disable liveness.

await compareFaces(a, b, {
  livenessThreshold: 0.5,  // default; uncalibrated — tune on your own live/spoof set
  checkLiveness: false,    // skip the liveness model entirely
});

Reliable for replay, harder for print. The ensemble reliably rejects screen/video replay; print is the hard case and can still occasionally score as live. Do not treat liveness_fail as a complete spoof defense — see Open Risks.

Threshold & calibration

The default 0.5 (cosine distance) is a documented placeholder near common ArcFace LFW operating points (~0.4–0.5) — not calibrated for your model, capture conditions, or population. Calibrate before production, and re-calibrate whenever you swap the recognition model (distances aren't comparable across models).

Calibration harness

demo/calibrate.html does this locally — images never leave the browser. Serve the demo (npm run demo) and open it, then:

Point it at a folder laid out as identity/image.jpg (one sub-folder per person).
It builds same-person pairs (within an identity) and different-person pairs (across identities), runs the pipeline, and records the distance for each.
It reports the two distributions plus the EER threshold (where false-accept rate ≈ false-reject rate) and a best-accuracy threshold, with FAR/FRR.

Set threshold near the EER value, then bias lower (stricter — fewer false accepts) or higher (looser — fewer false rejects) per your risk tolerance: compareFaces(a, b, { threshold: 0.42 }). Point the harness at a different recognitionModelPath + recognition config to calibrate a BYO model.

⚠️ Use data you have the right to use. A face used for identification is biometric personal data (GDPR Art. 9, Illinois BIPA, and similar) — calibrate on consented, self-collected, or CC0 images, never scraped datasets. Most well-known "face benchmarks" (LFW, VGGFace2, CelebA, …) are research-only or scraped and are not appropriate for tuning a shipped product. A single global threshold also can't equalize error rates across demographics; audit per-group if fairness matters.

Liveness threshold

livenessThreshold (default 0.5) can't be calibrated from stored photos — it needs live captures vs spoof attempts (print/replay). Live faces score high and screen replay near zero, so 0.5 separates those; print can overlap the live range (see Open Risks).

Integration notes

Browser only. Guard against SSR — call loadModels from a client-side effect, never during server render.
Load once. Models are module singletons; calling loadModels repeatedly is cheap but only the first call does work.
No special headers. Inference is single-threaded (numThreads = 1), so you do not need cross-origin isolation (COOP/COEP).
Host the models. Serve models/ (and optionally the ort .wasm blobs) from a path your app can fetch. See INSTALL.md.
File inputs. For <input type="file">, pass URL.createObjectURL(file) (a blob: URL) or a data: URL.

React sketch:

useEffect(() => {
  loadModels({
    faceLandmarkerPath: '/models/face_landmarker.task',
    recognitionModelPath: '/models/mobilefacenet.onnx',
    livenessModelPath: '/models/minifasnet_v2.onnx',
  }).catch(console.error);
}, []);

async function onVerify(idPhoto: string, selfie: File) {
  const result = await compareFaces(idPhoto, URL.createObjectURL(selfie));
  // branch on result.match / result.flags
}

Demo

npm install
npm run demo   # builds, then serves at http://localhost:5299

Open demo/index.html, capture a baseline and a current frame from your webcam, and hit Compare to exercise the full pipeline end-to-end. For threshold calibration, open demo/calibrate.html — see Threshold & calibration.

Tests

npm test   # Node-safe unit tests: Umeyama math + flag logic

Integration tests require browser APIs and locally-provided consented fixtures, so they auto-skip in Node and whenever fixtures are absent. To run them, supply the images named in the integration test header (gitignored) in a browser/polyfilled env. See CONTRIBUTING.md for the testing strategy.

Open Risks

Standing limitations to weigh before production. (Verified during development and no longer tracked as risks: recognition-model extractability, preprocessing layout [1,3,112,112], landmark indices, and the liveness output format/class mapping — see git history.)

Thresholds are uncalibrated. threshold (0.5 cosine distance) and livenessThreshold (0.5) are placeholders. Calibrate on your own consented data — see Threshold & calibration and demo/calibrate.html. Distances aren't comparable across models, so recalibrate whenever you swap the recognition model.
Liveness is not a complete spoof defense. The default ensemble (MiniFASNetV2 @2.7 + MiniFASNetV1SE @4.0) reliably rejects screen/video replay, but print is the hard case — a printed photo can still occasionally pass (a single-model probe false-accepted one at 0.882, overlapping the live range). Passive RGB liveness can't fully eliminate this; pair with another factor for anything security-critical.
Calibration data is biometric. A face used for identification is personal data (GDPR Art. 9, Illinois BIPA, and similar). Use consented, self-collected, or CC0 images — never scraped datasets.
Accuracy is bounded by the model. facex_nano scores ~95.62% on LFW; harder in-the-wild captures (pose, lighting, occlusion) do worse. Expect some error and design for it.

Licensing

Our code: MIT (see LICENSE).
Model weights: Apache-2.0 (see NOTICE for attribution and the MS1M-RefineV2 data caveat).
Models are not bundled — downloaded separately via npx @rajeevdesai/face-verification-api download <dir>. Consumers are responsible for compliance with the applicable model licenses.

Contributing

Contributions welcome — bug fixes, calibration data, and especially help closing the Open Risks. See CONTRIBUTING.md.