recursive-llm-ts

v4.3.2

Published

a day ago

TypeScript bridge for recursive-llm: Recursive Language Models for unbounded context processing with structured outputs

Downloads

2,848

0High
0Medium
0Low

jacobbeck_dev

llm ai recursive context nlp language-model structured-output zod schema extraction

recursive-llm-ts

TypeScript/JavaScript package for Recursive Language Models (RLM) - process unbounded context lengths with LLMs.

Based on the paper: Recursive Language Models by Alex Zhang and Omar Khattab (MIT, 2025)

Features

✨ Pure Go Implementation - No Python dependencies required
🚀 50x Faster Startup - Native binary vs Python runtime
💾 3x Less Memory - Efficient Go implementation
📦 Single Binary - Easy distribution and deployment
🔄 Unbounded Context - Process 10M+ tokens without degradation
🎯 Provider Agnostic - Works with OpenAI, Anthropic, Azure, Bedrock, local models
🔍 Structured Outputs - Extract typed data with Zod schemas and parallel execution

Installation

npm install recursive-llm-ts

Prerequisites

Node.js 16+ or Bun 1.0+
Go 1.21+ (for building from source during install)

Note: The package includes pre-built binaries for common platforms. Go is only needed if building from source.

Go Binary (Automatic)

The postinstall script automatically builds the Go binary during installation. If Go is not available, the script will warn but not fail.

If you need to build manually:

# From the package directory
cd node_modules/recursive-llm-ts
node scripts/build-go-binary.js

# Or directly with Go
cd go && go build -o ../bin/rlm-go ./cmd/rlm

Override the binary path if needed:

export RLM_GO_BINARY=/custom/path/to/rlm-go

Usage

Automatic Runtime Detection (Recommended)

import { RLM } from 'recursive-llm-ts';

// Initialize RLM with a model
// Automatically detects Node.js or Bun and uses appropriate bridge
const rlm = new RLM('gpt-4o-mini', {
  max_iterations: 15,
  api_key: process.env.OPENAI_API_KEY
});

// Process a query with unbounded context
const result = await rlm.completion(
  'What are the key points in this document?',
  longDocument
);

console.log(result.result);
console.log('Stats:', result.stats);

Structured Outputs with Zod Schemas

Extract structured, typed data from any context using Zod schemas. Supports complex nested objects, arrays, enums, and automatic parallel execution for performance.

import { RLM } from 'recursive-llm-ts';
import { z } from 'zod';

const rlm = new RLM('gpt-4o-mini', {
  api_key: process.env.OPENAI_API_KEY
});

// Define your schema
const sentimentSchema = z.object({
  sentimentValue: z.number().min(1).max(5),
  sentimentExplanation: z.string(),
  keyPhrases: z.array(z.object({
    phrase: z.string(),
    sentiment: z.number()
  })),
  topics: z.array(z.enum(['pricing', 'features', 'support', 'competition']))
});

// Extract structured data
const result = await rlm.structuredCompletion(
  'Analyze the sentiment and extract key information',
  callTranscript,
  sentimentSchema
);

// result.result is fully typed!
console.log(result.result.sentimentValue); // number
console.log(result.result.keyPhrases); // Array<{phrase: string, sentiment: number}>

Key Benefits:

✅ Type-safe - Full TypeScript types from your Zod schema
✅ Automatic validation - Retries with error feedback if schema doesn't match
✅ Parallel execution - Complex schemas processed in parallel with goroutines (3-5x faster)
✅ Deep nesting - Supports arbitrarily nested objects and arrays
✅ Enum support - Validates enum values automatically

Performance Options:

// Enable/disable parallel execution
const result = await rlm.structuredCompletion(
  query,
  context,
  schema,
  { 
    parallelExecution: true,  // default: true for complex schemas
    maxRetries: 3              // default: 3
  }
);

Agent Coordinator (Advanced)

For complex multi-field schemas, use the coordinator API:

import { RLMAgentCoordinator } from 'recursive-llm-ts';

const coordinator = new RLMAgentCoordinator(
  'gpt-4o-mini',
  { api_key: process.env.OPENAI_API_KEY },
  'auto',
  { parallelExecution: true }
);

const result = await coordinator.processComplex(
  'Extract comprehensive call analysis',
  transcript,
  complexSchema
);

Bridge Selection

The package automatically uses the Go binary by default (if available). You can explicitly specify a bridge if needed:

import { RLM } from 'recursive-llm-ts';

// Default: Auto-detection (prefers Go if available)
const rlm = new RLM('gpt-4o-mini', {
  max_iterations: 15,
  api_key: process.env.OPENAI_API_KEY
});

// Explicit: Force Go binary
const rlmGo = new RLM('gpt-4o-mini', {
  max_iterations: 15,
  api_key: process.env.OPENAI_API_KEY
}, 'go');

// Legacy: Use Python bridges (bunpy for Bun, pythonia for Node)
// Note: Requires separate Python dependencies
const rlmPython = new RLM('gpt-4o-mini', {}, 'bunpy');

API

`RLM`

Main class for recursive language model completions.

Constructor:

constructor(model: string, rlmConfig?: RLMConfig)

model: The LLM model to use (e.g., 'gpt-4o-mini', 'claude-3-sonnet')
rlmConfig: Optional configuration object

Methods:

`completion(query: string, context: string): Promise<RLMResult>`

Process a query with the given context using recursive language models.

Parameters:

query: The question or task to perform
context: The context/document to process (can be arbitrarily large)

Returns:

Promise<RLMResult>: Result containing the answer and statistics

`structuredCompletion<T>(query: string, context: string, schema: ZodSchema<T>, options?): Promise<StructuredRLMResult<T>>`

Extract structured, typed data from context using a Zod schema.

Parameters:

query: The extraction task to perform
context: The context/document to process
schema: Zod schema defining the output structure
options: Optional configuration
- parallelExecution?: boolean - Enable parallel processing (default: true)
- maxRetries?: number - Max validation retries (default: 3)

Returns:

Promise<StructuredRLMResult<T>>: Typed result matching your schema

Example:

const schema = z.object({ score: z.number(), summary: z.string() });
const result = await rlm.structuredCompletion('Analyze', doc, schema);
// result.result is typed as { score: number, summary: string }

`cleanup(): Promise<void>`

Clean up the bridge and free resources.

await rlm.cleanup();

Types

interface RLMConfig {
  // Model configuration
  recursive_model?: string;      // Model to use for recursive calls (defaults to main model)
  
  // API configuration
  api_base?: string;             // Custom API base URL (for Azure, Bedrock, etc.)
  api_key?: string;              // API key for authentication
  
  // Execution limits
  max_depth?: number;            // Maximum recursion depth (default: 5)
  max_iterations?: number;       // Maximum REPL iterations per call (default: 30)
  pythonia_timeout?: number;     // Python bridge timeout in ms (default: 100000ms = 100s)
  go_binary_path?: string;       // Override path for Go binary (optional)
  
  // LiteLLM parameters - pass any additional parameters supported by LiteLLM
  api_version?: string;          // API version (e.g., for Azure)
  timeout?: number;              // Request timeout in seconds
  temperature?: number;          // Sampling temperature
  max_tokens?: number;           // Maximum tokens in response
  [key: string]: any;            // Any other LiteLLM parameters
}

interface RLMResult {
  result: string;
  stats: RLMStats;
}

interface RLMStats {
  llm_calls: number;
  iterations: number;
  depth: number;
}

Environment Variables

Set your API key as an environment variable:

export OPENAI_API_KEY='your-api-key-here'

Or pass it in the configuration:

const rlm = new RLM('gpt-4o-mini', {
  api_key: 'your-api-key-here'
});

Custom Providers

The Go binary uses an OpenAI-compatible chat completion API and works seamlessly with LiteLLM proxy or any provider that supports the OpenAI /chat/completions schema. This keeps the implementation provider-agnostic.

Quick Reference

The Go binary speaks the OpenAI chat completion schema, so you can:

Use OpenAI directly with api_key
Use an OpenAI-compatible endpoint (Azure OpenAI, vLLM, Ollama)
Use a LiteLLM proxy to reach providers like Anthropic, Bedrock, or Cohere

Amazon Bedrock (via LiteLLM proxy)

import { RLM } from 'recursive-llm-ts';

const rlm = new RLM('bedrock/anthropic.claude-3-sonnet-20240229-v1:0', {
  api_base: 'http://localhost:4000', // LiteLLM proxy URL
  api_key: process.env.LITELLM_API_KEY,
  max_iterations: 15
});

Azure OpenAI

const rlm = new RLM('gpt-4o', {
  api_base: 'https://your-resource.openai.azure.com/openai/deployments/your-deployment',
  api_key: process.env.AZURE_API_KEY,
  api_version: '2024-02-15-preview' // Passed through to the OpenAI-compatible API
});

Custom OpenAI-Compatible APIs

For providers with OpenAI-compatible APIs (e.g., local models, vLLM, Ollama):

const rlm = new RLM('openai/your-model', {
  api_base: 'https://your-custom-endpoint.com/v1',
  api_key: 'your-key-here'
});

Long-Running Processes

For large documents or queue-based processing that may take longer than the default 100s timeout:

const rlm = new RLM('gpt-4o-mini', {
  max_iterations: 50,           // Allow more iterations for complex processing
  pythonia_timeout: 600000,     // 10 minutes timeout for Python bridge
  timeout: 300                  // 5 minutes timeout for LLM API calls
});

// Process very large document
const result = await rlm.completion(
  'Summarize all key points from this document',
  veryLargeDocument
);

Other Providers

See the LiteLLM documentation for the complete list of supported providers and their configuration.

How It Works

This package provides a TypeScript wrapper around a Go implementation of Recursive-LLM, enabling seamless integration into Node.js/TypeScript applications without Python dependencies. The Go binary is built locally (or supplied via RLM_GO_BINARY) and invoked for completions.

The recursive-llm approach breaks down large contexts into manageable chunks and processes them recursively, allowing you to work with documents of any size without hitting token limits.

Key Features

✅ No Python dependency - Go binary handles the full recursive loop
✅ Provider-agnostic - Works with OpenAI-compatible APIs or LiteLLM proxy
✅ Type-safe - Full TypeScript type definitions
✅ Simple API - Just npm install and start using

Publishing

This package uses automated GitHub Actions workflows to publish to npm. See RELEASE.md for detailed instructions on publishing new versions.

Quick start:

npm version patch  # Bump version
git push origin main --tags  # Push tag
# Then create a GitHub release to trigger automatic npm publish

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

recursive-llm-ts

Features

Installation

Prerequisites

Go Binary (Automatic)

Usage

Automatic Runtime Detection (Recommended)

Structured Outputs with Zod Schemas

Agent Coordinator (Advanced)

Bridge Selection

API

RLM

completion(query: string, context: string): Promise<RLMResult>

structuredCompletion<T>(query: string, context: string, schema: ZodSchema<T>, options?): Promise<StructuredRLMResult<T>>

cleanup(): Promise<void>

Types

Environment Variables

Custom Providers

Quick Reference

Amazon Bedrock (via LiteLLM proxy)

Azure OpenAI

Custom OpenAI-Compatible APIs

Long-Running Processes

Other Providers

How It Works

Key Features

Publishing

License

`RLM`

`completion(query: string, context: string): Promise<RLMResult>`

`structuredCompletion<T>(query: string, context: string, schema: ZodSchema<T>, options?): Promise<StructuredRLMResult<T>>`

`cleanup(): Promise<void>`