process-in-chunks

v1.2.1

Published

2 months ago

Conveniently process data in chunks

0High
0Medium
0Low

thijskoerselman

process chunks throttling typescript

Process in Chunks

Efficiently process large collections of data in manageable chunks with built-in error handling, throttling, and TypeScript support.

Features

🚀 High Performance: Process items in parallel chunks for optimal throughput
🛡️ Robust Error Handling: Choose between fail-fast or graceful error collection
⏱️ Built-in Throttling: Control processing rate to avoid overwhelming systems
📦 Flexible Processing: Handle items individually or process entire chunks
🔒 Type Safe: Full TypeScript support with discriminated unions
🎯 Zero Dependencies: Lightweight and focused

Installation

pnpm add process-in-chunks

Quick Start

import { processInChunks } from "process-in-chunks";

// Process items in parallel chunks
const results = await processInChunks(
  [1, 2, 3, 4, 5],
  async (item) => item * 2,
);
console.log(results); // [2, 4, 6, 8, 10]

Usage

Process with a single item handler

Process items individually with parallel execution within each chunk. This is the most common use case.

import { processInChunks } from "process-in-chunks";

const results = await processInChunks(
  [1, 2, 3, 4, 5, 6, 7, 8, 9, 10],
  async (item, index) => {
    console.log(`Processing item ${index}, value ${item}`);
    return item * 10;
  },
);

console.log(results); // [10, 20, 30, 40, 50, 60, 70, 80, 90, 100]

Process with a chunk-sized handler

Process entire chunks at once, useful for batch operations like database inserts or API calls that accept multiple items.

import { processInChunksByChunk } from "process-in-chunks";

const results = await processInChunksByChunk(
  [1, 2, 3, 4, 5, 6, 7, 8, 9, 10],
  async (chunk) => {
    // Process entire chunk at once
    return chunk.reduce((sum, item) => sum + item, 0);
  },
  { chunkSize: 3, throttleSeconds: 1 },
);

console.log(results); // [6, 15, 24, 10] (after ~4 seconds)

Error Handling

Default Behavior (Fail-Fast)

By default, functions throw errors immediately when they encounter a failure, providing fail-fast behavior.

import { processInChunksByChunk } from "process-in-chunks";

try {
  const results = await processInChunks([1, 2, 3, 4, 5], async (item) => {
    if (item % 2 === 0) {
      throw new Error(`Failed to process even number: ${item}`);
    }
    return `Processed: ${item}`;
  });

  console.log("All items processed:", results);
} catch (error) {
  console.error("Processing failed:", error.message);
  // Stops on first error (item 2)
}

Enhanced Error Handling with `noThrow`

Enable graceful error handling to collect partial results and error information.

import { processInChunks } from "process-in-chunks";

const result = await processInChunks(
  [1, 2, 3, 4, 5],
  async (item) => {
    if (item % 2 === 0) {
      throw new Error(`Failed to process even number: ${item}`);
    }
    return `Processed: ${item}`;
  },
  { noThrow: true },
);

if (result.hasErrors) {
  console.log("Some items failed:", result.errorMessages);
  console.log("Partial results:", result.results); // Contains undefined for failed items

  // Filter out failed items
  const successfulResults = result.results.filter((r) => r !== undefined);
  console.log("Successful results:", successfulResults);
} else {
  console.log("All items processed:", result.results);
}

Options

Both functions accept an optional configuration object with the following options:

`chunkSize`

Type: number
Default: 500
Description: Number of items to process in parallel within each chunk

await processInChunks(items, handler, { chunkSize: 100 });

`throttleSeconds`

Type: number
Default: 0
Description: Minimum time in seconds for processing each chunk. The throttle runs in parallel with processing, so it only adds delay if the chunk finishes faster than the specified time. Useful for rate limiting.

await processInChunks(items, handler, { throttleSeconds: 2 });

`noThrow`

Type: boolean
Default: false
Description: Enable enhanced error handling. When true, returns a discriminated union with error information instead of throwing.

await processInChunks(items, handler, { noThrow: true });

Function Reference