@platformatic/ai-provider

v1.1.3

Published

4 months ago

The Platformatic AI provider interface

0High
0Medium
0Low

@platformatic/ai-provider

Core implementation for AI communication with multiple providers, offering unified access to OpenAI, DeepSeek, and Google Gemini with advanced features like automatic fallback, session management, and intelligent rate limiting.

🚀 Features

Multi-Provider Support: OpenAI, DeepSeek, and Google Gemini
Automatic Fallback: Seamless switching between providers when one fails
Session Management: Persistent conversation history with multiple storage backends
Auto-Resume: Seamless stream resumption from interruption points using UUID event IDs
Hash-Based Storage: Efficient O(1) event access with Redis hash operations
Rate Limiting: Per-model rate limiting with automatic restoration
Streaming Support: Real-time response streaming with UUID event identification
Error Recovery: Intelligent error handling with configurable retry policies

📦 Installation

npm install @platformatic/ai-provider

🔧 Basic Usage

import { Ai } from '@platformatic/ai-provider'
import pino from 'pino'

const ai = new Ai({
  logger: pino({ level: 'info' }),
  providers: {
    openai: { 
      apiKey: process.env.OPENAI_API_KEY 
    },
    deepseek: { 
      apiKey: process.env.DEEPSEEK_API_KEY 
    },
    gemini: { 
      apiKey: process.env.GEMINI_API_KEY 
    }
  },
  models: [
    { provider: 'openai', model: 'gpt-4o-mini' },
    { provider: 'openai', model: 'gpt-4o' },
    { provider: 'deepseek', model: 'deepseek-chat' },
    { provider: 'gemini', model: 'gemini-2.5-flash' }
  ]
})

await ai.init()

// Simple request
const response = await ai.request({
  prompt: 'Hello, how are you today?',
  options: {
    temperature: 0.7,
    maxTokens: 150
  }
})

console.log(response.text)
console.log(response.sessionId)

// Streaming request
const streamResponse = await ai.request({
  prompt: 'Tell me a story',
  options: {
    stream: true,
    temperature: 0.8
  }
})

// Process Node.js stream with for await loop
try {
  for await (const chunk of streamResponse) {
    console.log('Chunk:', chunk.toString())
  }
  console.log('Stream finished')
} catch (err) {
  console.error('Stream error:', err)
}

await ai.close()

⚙️ Configuration Options

Configuration file settings are grouped as follows:

AiOptions

Main configuration object for the Ai class:

logger (Logger, required): Pino logger instance
providers (object, required): Provider configurations with API keys and optional custom clients
models (array, required): Model definitions with providers and optional limits
storage (object, optional): Session storage configuration (default: {type: 'memory'})
limits (object, optional): Global limits and timeouts applied to all models
restore (object, optional): Error recovery settings for automatic restoration

providers

Configure AI provider settings:

openai (object, optional): OpenAI provider configuration
- apiKey (string, required): OpenAI API key
- client (object, optional): Custom HTTP client for advanced configurations
deepseek (object, optional): DeepSeek provider configuration
- apiKey (string, required): DeepSeek API key
- client (object, optional): Custom HTTP client for advanced configurations
gemini (object, optional): Google Gemini provider configuration
- apiKey (string, required): Gemini API key
- client (object, optional): Custom HTTP client for advanced configurations

models

Define AI models with custom limits and restoration policies:

provider (string, required): Provider name ('openai', 'deepseek', or 'gemini')
model (string, required): Model name string
limits (object, optional): Rate limiting and token limits for this model
- maxTokens (number, optional): Maximum tokens per request
- rate (object, optional): Rate limiting configuration
  - max (number, required): Maximum requests per time window
  - timeWindow (string|number, required): Time window ('1m', '30s', or milliseconds)
restore (object, optional): Model-specific recovery settings

storage

Configure session storage backend:

type (string, required): Storage type ('memory' or 'valkey', default: 'memory')
valkey (object, optional): Valkey/Redis configuration when type is 'valkey'
- host (string, optional): Server host (default: 'localhost')
- port (number, optional): Server port (default: 6379)
- username (string, optional): Username for authentication
- password (string, optional): Password for authentication
- database (number, optional): Database number (default: 0)

limits

Set default limits applied to all models:

maxTokens (number, optional): Default max tokens per request
rate (object, optional): Default rate limiting configuration
- max (number, optional): Maximum requests (default: 200)
- timeWindow (string|number, optional): Time window (default: '30s')
requestTimeout (number, optional): Request timeout in milliseconds (default: 30000)
retry (object, optional): Retry configuration
- max (number, optional): Max retry attempts (default: 1)
- interval (number, optional): Retry interval in milliseconds (default: 1000)
historyExpiration (string|number, optional): Session history expiration (default: '1d')

restore

Configure how failed models are restored:

rateLimit (string|number, optional): Rate limit error recovery time (default: '1m')
retry (string|number, optional): Retry error recovery time (default: '1m')
timeout (string|number, optional): Timeout error recovery time (default: '1m')
providerCommunicationError (string|number, optional): Communication error recovery time (default: '1m')
providerExceededError (string|number, optional): Quota exceeded error recovery time (default: '10m')

Time Windows

Time windows can be specified as:

String: '30s', '5m', '1h', '2d'
Number: Milliseconds (e.g., 30000 for 30 seconds)

📚 API Reference

Core Methods

`ai.init()`

Initialize the AI instance, storage, and providers. Must be called before making requests.

`ai.request(request)`

Make an AI request with automatic fallback and session management.

Options:

prompt (string, required): User input prompt
models (array, optional): Specific models to use for this request
options (object, optional): Request configuration options
- context (string, optional): System context/instructions
- temperature (number, optional): Model temperature (0-1)
- maxTokens (number, optional): Maximum tokens to generate
- stream (boolean, optional): Enable streaming responses (default: false)
- sessionId (string, optional): Session identifier for conversation history
- history (array, optional): Previous conversation history

`ai.close()`

Close all provider connections and storage.

Session Management

Sessions are automatically created and managed:

// Automatic session creation
const response = await ai.request({
  prompt: 'Hello, I am Alice'
})
console.log(response.sessionId) // Auto-generated session ID

// Continue conversation with session ID
const followUp = await ai.request({
  prompt: 'What is my name?',
  options: { sessionId: response.sessionId }
})

Response Types

Content Response (Non-streaming)

{
  text: "Generated text",                    // Generated text
  result: "COMPLETE",                        // 'COMPLETE' | 'INCOMPLETE_MAX_TOKENS' | 'INCOMPLETE_UNKNOWN'
  sessionId: "session-id-string"             // Session identifier
}

Stream Response (Streaming)

Node.js Readable stream with attached sessionId property for session management.

// Process streaming response with for await loop
try {
  for await (const chunk of streamResponse) {
    const data = chunk.toString()
    // Process chunk (may contain multiple SSE events)
    console.log('Received:', data)
  }
  console.log('Stream completed')
} catch (err) {
  console.error('Stream error:', err)
}
}

🔄 Auto-Resume Functionality

The AI provider includes advanced auto-resume capabilities that automatically recover interrupted streaming conversations:

Automatic Stream Resumption

When a streaming request is interrupted, the system can automatically resume from the last successfully received event:

// First streaming request
const stream1 = await ai.request({
  prompt: 'Write a long story about space exploration',
  options: {
    stream: true,
    sessionId: 'conversation-123'
  }
})

// If interrupted, resume automatically with same sessionId
const stream2 = await ai.request({
  prompt: 'Continue the story', // This prompt is ignored for resume
  options: {
    stream: true,
    sessionId: 'conversation-123', // Same session triggers auto-resume
    resume: true                   // Explicitly enable resume (default: true)
  }
})

// Only missing events will be streamed

UUID Event Identification

All streaming events include unique UUID identifiers for precise resumption:

// Streaming events include UUID IDs
const reader = streamResponse.getReader()
const decoder = new TextDecoder()

while (true) {
  const { done, value } = await reader.read()
  if (done) break
  
  const chunk = decoder.decode(value)
  // Example SSE format:
  // id: f47ac10b-58cc-4372-a567-0e02b2c3d479
  // event: content
  // data: {"response": "Text chunk"}
}

Resume Configuration

Control resume behavior per request:

// Disable resume for fresh response
const response = await ai.request({
  prompt: 'New conversation',
  options: {
    sessionId: 'existing-session',
    stream: true,
    resume: false  // Force new request instead of resume
  }
})

// Resume is enabled by default when sessionId + stream = true
const autoResumeResponse = await ai.request({
  prompt: 'Continue',
  options: {
    sessionId: 'existing-session',
    stream: true
    // resume: true (default)
  }
})

🗄️ Storage Architecture

Hash-Based Event Storage

The new storage system uses Redis hash operations for O(1) event access:

// Storage structure: sessionId -> { eventUUID: eventData }
{
  "session-123": {
    "f47ac10b-58cc-4372-a567-0e02b2c3d479": {
      "timestamp": 1642789200000,
      "type": "content",
      "data": "First chunk"
    },
    "6ba7b810-9dad-11d1-80b4-00c04fd430c8": {
      "timestamp": 1642789201000,
      "type": "content", 
      "data": "Second chunk"
    }
  }
}

Storage Backends

Memory Storage (Default)

Uses EventEmitter for pub/sub operations:

const ai = new Ai({
  // ... other options
  storage: {
    type: 'memory'  // Default storage type
  }
})

Valkey/Redis Storage

Production-ready with Redis hash commands and dedicated pub/sub:

const ai = new Ai({
  // ... other options
  storage: {
    type: 'valkey',
    valkey: {
      host: 'localhost',
      port: 6379,
      username: 'default',
      password: 'your-password',
      database: 0
    }
  }
})

Storage Operations

The storage interface provides hash-based operations:

hashSet(sessionId, eventId, value, expiration) - Store event with UUID key
hashGetAll(sessionId) - Retrieve all events for session
hashGet(sessionId, eventId) - Get specific event by UUID
rangeFromId(sessionId, fromEventId) - Get events starting from UUID
publish(channel, data) - Publish real-time events
subscribe(channel, callback) - Subscribe to event streams

🔄 Advanced Features

Custom Provider Client

Implement custom HTTP client for providers:

import { Pool } from 'undici'

const customClient = {
  pool: new Pool('https://api.openai.com', {
    pipelining: 4,
    connections: 10
  })
}

const ai = new Ai({
  // ... other options
  providers: {
    openai: {
      apiKey: process.env.OPENAI_API_KEY,
      client: customClient
    }
  }
})

Error Handling

The library provides detailed error types:

try {
  const response = await ai.request({ prompt: 'Hello' })
} catch (error) {
  switch (error.code) {
    case 'PROVIDER_RATE_LIMIT_ERROR':
      console.log(`Rate limited, retry in ${error.retryAfter}s`)
      break
    case 'PROVIDER_REQUEST_TIMEOUT_ERROR':
      console.log(`Request timed out after ${error.timeout}ms`)
      break
    case 'PROVIDER_NO_MODELS_AVAILABLE_ERROR':
      console.log('All models are currently unavailable')
      break
    default:
      console.error('Unexpected error:', error.message)
  }
}

Model Selection Strategy

Models are selected in the order defined, with automatic fallback:

const models = [
  { provider: 'openai', model: 'gpt-4o-mini' },    // Try this first
  { provider: 'gemini', model: 'gemini-2.5-flash' }, // Fallback to this
  { provider: 'deepseek', model: 'deepseek-chat' }   // Final fallback
]

Development

# Install dependencies
npm install

# Run tests
npm test

# Run tests with coverage
npm run test:coverage

# Type check
npm run typecheck

# Build
npm run build

# Lint
npm run lint

# Fix linting issues
npm run lint:fix

# Full check (lint + typecheck + test + build)
npm run check

License

Apache-2.0

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@platformatic/ai-provider

🚀 Features

📦 Installation

🔧 Basic Usage

⚙️ Configuration Options

AiOptions

providers

models

storage

limits

restore

Time Windows

📚 API Reference

Core Methods

ai.init()

ai.request(request)

ai.close()

Session Management

Response Types

Content Response (Non-streaming)

Stream Response (Streaming)

🔄 Auto-Resume Functionality

Automatic Stream Resumption

UUID Event Identification

Resume Configuration

🗄️ Storage Architecture

Hash-Based Event Storage

Storage Backends

Memory Storage (Default)

Valkey/Redis Storage

Storage Operations

🔄 Advanced Features

Custom Provider Client

Error Handling

Model Selection Strategy

Development

License

`ai.init()`

`ai.request(request)`

`ai.close()`