mediapipe-nodejs

v1.2.1

Published

6 months ago

A Node.js library for running MediaPipe models that are typically browser-only. This package uses a local Express (web) server and Playwright (headless browser) to bridge the gap between Node.js and MediaPipe's browser-based APIs.

mediapipe-nodejs

Currently supports:

Face Landmarker - Detect facial landmarks and features

More MediaPipe models will be added over time.

Features

Face Landmarker Model - Detect facial landmarks and features with advanced options:
- Custom number of faces to detect
- Crop region for area of interest detection
- Image rotation support (applied after cropping)
- Landmark visualization for debugging
Node.js Support - Enables MediaPipe models to run in Node.js environments
Custom Image Directory - Serve images from specified directory with customizable URL paths
Auto-Installation - Playwright automatically installed and launched, or use existing browser instance
TypeScript Support - Full type definitions included

Installation

npm install mediapipe-nodejs

You can also install mediapipe-nodejs with pnpm, yarn, or slnpm

Usage Example

import { startClient } from 'mediapipe-nodejs'

async function detectFaces() {
  // Start the MediaPipe client
  const mediapipe = await startClient({
    port: 8560,
    headless: true,
  })

  // Serve images from a directory
  mediapipe.attachImageDirection({
    url_prefix: '/images',
    directory: './uploads',
  })

  // Detect face landmarks and face blendshapes
  const result = await mediapipe.detectFaceLandmarks({
    image_url: '/images/face.jpg',
    num_faces: 1,
    draw_landmarks: true,
  })

  console.log('Faces detected:', result.faceLandmarks.length)

  // Clean up
  await mediapipe.close()
}

Complete usage example see test.ts

Typescript Signature

import { FaceLandmarkerResult } from '@mediapipe/tasks-vision'
import { Browser, Page } from 'playwright'

interface StartMediaPipeClientOptions {
  port: number
  /** default: true */
  auto_install_playwright?: boolean
  /** default: true */
  headless?: boolean
  /** auto launch chromium browser if no instance provided */
  browser?: Browser
}

function startMediaPipeClient(
  options: StartMediaPipeClientOptions,
): Promise<MediaPipeClient>

interface MediaPipeClient {
  attachImageDirection(options: {
    /** e.g. '/images' */
    url_prefix: string
    /** e.g. './uploads/' */
    directory: string
  }): void

  detectFaceLandmarks(
    options: DetectFaceLandmarksOptions,
  ): Promise<FaceLandmarkerResult>

  /** alias: stop, close */
  stop(): Promise<void>
  close(): Promise<void>

  /** for extended reuse */
  browser: Browser
  page: Page
}

interface DetectFaceLandmarksOptions {
  /** local url or internet url */
  image_url: string
  /** default: 1 */
  num_faces?: number
  /** area to crop for detection, coordinates are in [0,1] with 'left' < 'right' and 'top' < 'bottom' */
  crop_region?: {
    left: number
    top: number
    right: number
    bottom: number
  }
  /** applied after crop region of interest, in degrees */
  rotation?: number
  /** for debugging, default: false */
  draw_landmarks?: boolean
  /** for debugging, default: false */
  draw_bounding_box?: boolean
  /** for debugging, default: 'red' */
  draw_style?: string
  /** for debugging, default: 5 */
  draw_size?: number
}

License

This project is licensed with BSD-2-Clause

This is free, libre, and open-source software. It comes down to four essential freedoms [ref]:

The freedom to run the program as you wish, for any purpose
The freedom to study how the program works, and change it so it does your computing as you wish
The freedom to redistribute copies so you can help others
The freedom to distribute copies of your modified versions to others

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

mediapipe-nodejs

Features

Installation

Usage Example

Typescript Signature

License