facecap

v1.0.2

Published

2 months ago

Face autocapture module with quality check, face angle estimation via solvePnP, and guideline UI overlay

0High
0Medium
0Low

x_xkuson

face-detection autocapture litert tflite blazeface solvepnp opencv

facecap

Plug-and-play face autocapture library powered by Google's BlazeFace TFLite model (via @litertjs/core) with solvePnP-based face-angle estimation, per-frame quality gating, and a built-in canvas overlay UI.

✨ Features

🎯 Face Detection — BlazeFace short-range TFLite model running in-browser via WASM
📐 Pose Estimation — solvePnP (opencv.js) converts 6 facial keypoints to yaw / pitch / roll angles
✅ Quality Gating — Rejects frames with multiple faces, low confidence, wrong size, or off-angle pose
📸 Auto-Capture — Captures a configurable number of face images after N consecutive quality-passing frames
🎨 Overlay UI — Draws a guide oval (or face silhouette), bounding boxes, landmarks, and a status banner
🔁 Full Lifecycle — init → start → stop → destroy with safe re-initialization support
📦 ESM + CJS — Ships both module formats with TypeScript declarations

📦 Installation

npm install facecap

Peer runtime: opencv.js is loaded from CDN at runtime by default (configurable). No extra npm install needed.

🚀 Quick Start

<!-- 1. Add a container element -->
<div id="camera" style="width: 320px;"></div>

import { FaceAutocapture, FaceStatus } from 'facecap';

const capture = new FaceAutocapture({
  container: '#camera',
  captureCount: 1,
});

capture.onCapture = (result) => {
  console.log(`Captured image ${result.index}`, result.blob);
  const url = URL.createObjectURL(result.blob);
  document.querySelector<HTMLImageElement>('#preview')!.src = url;
};

capture.onStatusChange = (status, message) => {
  console.log(`[${status}] ${message}`);
};

await capture.init();   // loads WASM + model + opencv.js
await capture.start();  // opens webcam + begins detection loop

⚙️ Configuration

All options are optional — sensible defaults are applied automatically.

new FaceAutocapture({
  // Required
  container: '#camera',               // CSS selector or HTMLElement

  // Model / runtime
  modelPath?: string,                 // URL to .tflite model (default: bundled)
  wasmBasePath?: string,              // @litertjs/core WASM base URL
  opencvJsUrl?: string,               // opencv.js CDN URL

  // Capture behaviour
  captureCount?: number,              // Number of images to capture  (default: 1)
  captureIntervalSec?: number,        // Minimum seconds between captures (default: 0.5)
  stableFramesRequired?: number,      // Consecutive quality-OK frames before capture (default: 3)
  captureMimeType?: string,           // 'image/jpeg' | 'image/png'  (default: 'image/jpeg')
  captureQuality?: number,            // 0–1 quality for lossy formats (default: 0.99)

  // Quality thresholds
  minConfidence?: number,             // Minimum detection score 0–1  (default: 0.75)
  maxYawAngle?: number,               // Max left/right head turn °   (default: 15)
  maxPitchAngle?: number,             // Max up/down head tilt °      (default: 15)
  maxRollAngle?: number,              // Max clockwise head roll °    (default: 15)
  minFaceAreaRatio?: number,          // Min face area / frame area   (default: 0.24)
  maxFaceAreaRatio?: number,          // Max face area / frame area   (default: 0.30)

  // Output
  outputWidth?: number,               // Capture canvas width px      (default: 480)
  outputHeight?: number,              // Capture canvas height px     (default: 640)
  facingMode?: 'user' | 'environment',// Camera facing               (default: 'user')

  // Overlay UI
  showFaceGuide?: 'oval' | 'face',    // Guide shape style            (default: 'oval')
  showBoundingBox?: boolean,          // Draw detection bbox          (default: true)
  showLandmarks?: boolean,            // Draw 6 keypoint dots         (default: false)
});

📡 Events / Callbacks

`onCapture`

Fired each time a quality frame is captured.

capture.onCapture = (result: CaptureResult) => {
  result.blob       // Blob — the captured face image
  result.index      // number — 1-based capture index
  result.detection  // Detection — raw bbox & keypoints
  result.angle      // FaceAngle — { yaw, pitch, roll } in degrees
  result.quality    // QualityResult — individual check flags
  result.timestamp  // number — ms since epoch
};

`onStatusChange`

Fired when the face status changes (debounced to avoid flicker).

capture.onStatusChange = (status: FaceStatus, message: string) => {
  // status is one of:
  // FaceStatus.NO_FACE | MULTIPLE_FACES | FACE_TOO_SMALL | FACE_TOO_LARGE
  // FACE_NOT_STRAIGHT | LOW_CONFIDENCE | HOLD_STILL | CAPTURING
  // READY | COMPLETE
};

`onDetection`

Fired every frame with raw detection data (useful for custom analytics).

capture.onDetection = (detections: Detection[], angle: FaceAngle | null) => {
  // Called at the camera frame rate
};

🔄 Lifecycle

new FaceAutocapture(config)
        │
        ▼
   capture.init()      ← loads WASM + TFLite model + opencv.js
        │
        ▼
   capture.start()     ← opens webcam, begins requestAnimationFrame loop
        │
        ▼
  [detection loop]     ← detects → quality check → overlay → capture
        │
        ▼
   capture.stop()      ← pauses loop, releases webcam
        │
        ▼
   capture.destroy()   ← removes DOM elements, releases model

You can safely call init() + start() again after destroy().

🗂️ Exported Types

import type {
  AutocaptureConfig,      // Constructor config object
  Detection,              // { box, keypoints, score }
  BoundingBox,            // { x, y, width, height }
  Point2D,                // { x, y }
  FaceAngle,              // { yaw, pitch, roll }
  QualityResult,          // { passed, checks, message }
  CaptureResult,          // { blob, index, detection, angle, quality, timestamp }
  OnCaptureCallback,
  OnStatusChangeCallback,
  OnDetectionCallback,
} from 'facecap';

import { FaceStatus, KeypointIndex, STATUS_MESSAGES } from 'facecap';

🏗️ Architecture

The library is built with a 5-layer architecture for maintainability:

Application  ──►  Core (Orchestrator)
                     ├── Domain Layer      (business logic: capture, status, smoothing)
                     ├── Infrastructure    (I/O: DOM, webcam, ML pipeline)
                     ├── Processors        (pure math: preprocess, decode, NMS, pose)
                     └── UI Layer          (canvas overlay rendering)

🧑‍💻 Full Example

<!DOCTYPE html>
<html>
<body>
  <div id="camera" style="width: 320px; margin: auto;"></div>
  <img id="preview" style="display:none; margin: auto; width: 320px;" />

  <script type="module">
    import { FaceAutocapture, FaceStatus } from 'facecap';

    const cam = new FaceAutocapture({
      container: '#camera',
      captureCount: 3,
      stableFramesRequired: 5,
      showFaceGuide: 'oval',
      showLandmarks: true,
    });

    cam.onCapture = ({ blob, index }) => {
      console.log(`📸 Captured ${index}`);
      if (index === 1) {
        document.getElementById('preview').src = URL.createObjectURL(blob);
        document.getElementById('preview').style.display = 'block';
      }
    };

    cam.onStatusChange = (status, msg) => {
      if (status === FaceStatus.COMPLETE) console.log('✅ All done!');
    };

    await cam.init();
    await cam.start();
  </script>
</body>
</html>

🔒 Browser Permissions

The library accesses the camera via navigator.mediaDevices.getUserMedia. The browser will prompt the user for permission. This requires:

A secure context (https:// or localhost)
A supported browser (Chrome, Firefox, Safari, Edge)

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

facecap

✨ Features

📦 Installation

🚀 Quick Start

⚙️ Configuration

📡 Events / Callbacks

onCapture

onStatusChange

onDetection

🔄 Lifecycle

🗂️ Exported Types

🏗️ Architecture

🧑‍💻 Full Example

🔒 Browser Permissions

📄 License

`onCapture`

`onStatusChange`

`onDetection`