pyomatting

v1.3.6

Published

3 months ago

TypeScript package for closed-form alpha matting using Pyodide - bringing Python computer vision to the browser with memory-efficient TypedArray processing

Downloads

0High
0Medium
0Low

crbuch

pyodide python typescript wasm alpha-matting computer-vision image-processing matting foreground-extraction webassembly browser canvas imagedata

Pyomatting

A TypeScript package that implements closed-form alpha matting using Pyodide to run Python algorithms in the browser. This package brings the power of the closed-form matting algorithm from MarcoForte/closed-form-matting to web browsers via WebAssembly with optimized memory usage through TypedArrays and efficient data transfer.

Features

🎭 Closed-Form Alpha Matting: State-of-the-art alpha matting algorithm based on Levin et al. (2007)
🐍 Python in the Browser: Leverages Pyodide to run NumPy, SciPy, and OpenCV in WebAssembly
📦 TypeScript Support: Full type definitions and modern ES modules
🛠️ Modular Architecture: Clean separation of Python algorithms in separate files
⚡ Pre-initialization: Optional runtime pre-loading for reduced latency
🧠 Memory Efficient: Uses TypedArrays and transferable objects for zero-copy data transfer
🎯 Entropy Trimap: Smart trimap refinement for over-confident neural network predictions
📊 Progress Callbacks: Real-time progress updates during initialization and processing
🔧 Configurable Logging: Verbose logging support for debugging
🎨 Interactive Demo: Complete web interface for testing the algorithms
📱 Single Image Processing: Efficient processing of individual images with optimized memory usage

Installation

npm install pyomatting

Quick Start

import { closedFormMatting } from 'pyomatting';

// Basic usage with trimap in alpha channel
const result = await closedFormMatting(imageDataWithTrimap);

// With custom max dimension
const result = await closedFormMatting(imageDataWithTrimap, 512);

// With entropy trimap refinement for over-confident predictions
const result = await closedFormMatting(imageDataWithTrimap, 1024, { 
  band_ratio: 0.015, 
  mid_band: 0.25 
});

API Reference

Core Functions

`closedFormMatting(imageData: ImageData, maxDimension?: number, entropyTrimapParams?: object): Promise<ImageData>`

Performs closed-form alpha matting on a single image with trimap encoded in alpha channel.

Parameters:

imageData: ImageData from canvas containing the source image with trimap in alpha channel:
- RGB channels: Original image colors
- Alpha channel: Trimap where 0=background, 255=foreground, 128=unknown
maxDimension (optional): Maximum dimension for processing. Images larger than this will be downscaled. Default: 1024
entropyTrimapParams (optional): Object for entropy-based trimap refinement:
- band_ratio: Minimum band width as fraction of min(H,W). Default: 0.01
- mid_band: |p-0.5| <= mid_band becomes unknown region. Default: 0.2

Returns: ImageData containing the computed RGBA result image (with foreground colors and alpha)

`initializePyodide(): Promise<void>`

Pre-initializes the Pyodide runtime and packages. This is optional but recommended for better user experience.

import { initializePyodide } from 'pyomatting';

// Pre-initialize to reduce latency for first processing call
await initializePyodide();

Progress & Logging

`addProgressCallback(fn: (stage: string, progress: number, message?: string) => void): void`

Add a callback to receive progress updates during initialization and processing.

import { addProgressCallback } from 'pyomatting';

addProgressCallback((stage, progress, message) => {
  console.log(`${stage}: ${progress}% - ${message}`);
});

`setVerboseLogging(verbose: boolean): void`

Enable or disable verbose logging for debugging.

import { setVerboseLogging } from 'pyomatting';

setVerboseLogging(true); // Enable detailed logging

`removeProgressCallback(): void`

Remove the current progress callback.

`terminateWorker(): void`

Terminate the web worker (useful for cleanup).

Example Usage

import { 
  closedFormMatting, 
  addProgressCallback, 
  setVerboseLogging,
  initializePyodide 
} from 'pyomatting';

// Enable logging for development
setVerboseLogging(true);

// Set up progress tracking
addProgressCallback((stage, progress, message) => {
  document.getElementById('progress').textContent = `${message} (${progress}%)`;
});

// Optional: Pre-initialize for faster first run
await initializePyodide();

// Combine image and trimap into single ImageData
function combineImageWithTrimap(sourceImg: HTMLImageElement, trimapImg: HTMLImageElement): ImageData {
  const canvas = document.createElement('canvas');
  const ctx = canvas.getContext('2d')!;
  
  canvas.width = sourceImg.width;
  canvas.height = sourceImg.height;
  
  // Draw source image to get RGB data
  ctx.drawImage(sourceImg, 0, 0);
  const imageData = ctx.getImageData(0, 0, canvas.width, canvas.height);
  
  // Draw trimap to get alpha data
  ctx.clearRect(0, 0, canvas.width, canvas.height);
  ctx.drawImage(trimapImg, 0, 0);
  const trimapData = ctx.getImageData(0, 0, canvas.width, canvas.height);
  
  // Combine: RGB from source, Alpha from trimap
  for (let i = 0; i < imageData.data.length; i += 4) {
    imageData.data[i + 3] = trimapData.data[i]; // Alpha from trimap's red channel
  }
  
  return imageData;
}

// Process image
const combinedData = combineImageWithTrimap(sourceImage, trimapImage);
// Process images
const result = await closedFormMatting(combinedImageData);

// For over-confident neural network predictions (like U^2-Net)
const resultWithEntropy = await closedFormMatting(combinedImageData, 1024, true);

Entropy Trimap Refinement

When working with neural networks like U^2-Net that produce over-confident predictions, the trimap may have too thin unknown regions. Enable entropy trimap processing to:

Expand uncertain regions: Areas where the model probability is near 0.5 become unknown
Add geometric bands: Guaranteed minimum band width around foreground/background boundaries
Adaptive scaling: Band width adapts to image size for consistent results

// Enable entropy trimap refinement for better results with confident neural networks
const result = await closedFormMatting(combinedImageData, 1024, true);

Development

# Install dependencies
npm install

# Build the package
npm run build

# Run example demo
npm run example

The examples folder contains a complete interactive demo with:

Drag & drop image upload interface
Interactive trimap editor
Real-time progress tracking
Sequential processing of multiple images for memory efficiency
Side-by-side result comparison

Algorithm

This package implements the closed-form alpha matting algorithm described in:

"A Closed-Form Solution to Natural Image Matting" by Levin, Lischinski, and Weiss (2007)

The implementation is based on the Python version from MarcoForte/closed-form-matting, adapted to run in the browser using Pyodide.

How it works:

Laplacian Computation: Builds a sparse matting Laplacian matrix from local image neighborhoods
Quadratic Optimization: Formulates alpha estimation as a constrained quadratic optimization problem
Sparse Solving: Uses sparse linear algebra (SciPy) to solve for alpha values efficiently
Foreground Estimation: Computes separated foreground colors using additional constraints

Performance

First Run: ~10-30 seconds (downloads and initializes Pyodide + packages)
Subsequent Runs: ~1-5 seconds per image (cached runtime)
Memory: ~200-500MB (depending on image size and browser)

Use initializePyodide() to pre-load the runtime during app initialization for better UX.

Browser Requirements

Modern browser with WebAssembly support
Internet connection (for loading Pyodide and packages from CDN on first run)
Sufficient memory for image processing (recommend 4GB+ RAM for large images)

Credits

Original Algorithm: Levin, Lischinski, and Weiss (2007)
Python Implementation: MarcoForte/closed-form-matting
Web Adaptation: This package (Pyodide + TypeScript wrapper)

License

MIT

Contributing

Issues and pull requests are welcome! Please ensure any changes maintain compatibility with the existing API.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme