@ai-integrator/core

v0.3.0

Published

3 months ago

The lightest AI integration library with zero-config switching between OpenAI, Anthropic, and Google Gemini. Optimized for edge runtimes.

0High
0Medium
0Low

hv-ojha

ai llm openai anthropic claude gemini gpt edge cloudflare vercel lightweight typescript

@ai-integrator/core

The lightest AI integration library with zero-config switching between OpenAI, Anthropic, and Google Gemini. Optimized for edge runtimes.

Why @ai-integrator/core?

🪶 Lightweight: Minimal dependencies, tree-shakable, ~17KB raw / ~4KB gzipped
⚡ Zero-config: Switch providers with a single parameter
🌐 Edge-ready: Works on Cloudflare Workers, Vercel Edge, Deno, Node.js
🔄 Auto-fallback: Automatic provider switching when APIs fail
📡 Streaming: First-class streaming support across all providers
🛠️ Tool calling: Unified function/tool calling API across all providers
🔌 Custom providers: Bring your own LLM backend or integrate any provider
🔒 Type-safe: Full TypeScript support with comprehensive types
🎯 Simple API: Unified interface across OpenAI, Anthropic, Gemini, and custom providers

Installation

npm install @ai-integrator/core

Then install the provider SDKs you need:

# OpenAI
npm install openai

# Anthropic (Claude)
npm install @anthropic-ai/sdk

# Google Gemini
npm install @google/generative-ai

Note: Provider SDKs are peer dependencies, so you only install what you use.

Quick Start

Basic Usage

import { AIClient } from '@ai-integrator/core';

const client = new AIClient({
  provider: 'openai',
  apiKey: process.env.OPENAI_API_KEY,
});

const response = await client.chat({
  model: 'gpt-4o-mini',
  messages: [
    { role: 'user', content: 'What is the capital of France?' }
  ],
});

console.log(response.message.content);
// Output: "The capital of France is Paris."

Switch Providers

// Use Anthropic instead
const client = new AIClient({
  provider: 'anthropic',
  apiKey: process.env.ANTHROPIC_API_KEY,
});

const response = await client.chat({
  model: 'claude-3-5-sonnet-20241022',
  messages: [
    { role: 'user', content: 'Explain quantum computing' }
  ],
});

// Or use Gemini
const client = new AIClient({
  provider: 'gemini',
  apiKey: process.env.GEMINI_API_KEY,
});

const response = await client.chat({
  model: 'gemini-2.0-flash-exp',
  messages: [
    { role: 'user', content: 'Write a haiku about code' }
  ],
});

Streaming

const stream = client.chatStream({
  model: 'gpt-4o-mini',
  messages: [
    { role: 'user', content: 'Write a short story about a robot' }
  ],
});

for await (const chunk of stream) {
  process.stdout.write(chunk.delta.content || '');
}

Automatic Fallback

Configure fallback providers for high availability:

const client = new AIClient({
  provider: 'openai',
  apiKey: process.env.OPENAI_API_KEY,
  fallbacks: [
    {
      provider: 'anthropic',
      apiKey: process.env.ANTHROPIC_API_KEY,
      priority: 1, // Lower = higher priority
    },
    {
      provider: 'gemini',
      apiKey: process.env.GEMINI_API_KEY,
      priority: 2,
    },
  ],
});

// If OpenAI fails, automatically tries Anthropic, then Gemini
const response = await client.chat({
  model: 'gpt-4o-mini',
  messages: [{ role: 'user', content: 'Hello!' }],
});

Advanced Configuration

Retry Logic

const client = new AIClient({
  provider: 'openai',
  apiKey: process.env.OPENAI_API_KEY,
  retry: {
    maxRetries: 3,
    initialDelay: 1000, // 1 second
    maxDelay: 60000, // 60 seconds
    backoffMultiplier: 2, // Exponential backoff
  },
  timeout: 30000, // 30 seconds
});

Debug Mode

const client = new AIClient({
  provider: 'openai',
  apiKey: process.env.OPENAI_API_KEY,
  debug: true, // Enables detailed logging
});

System Messages

const response = await client.chat({
  model: 'gpt-4o-mini',
  messages: [
    { role: 'system', content: 'You are a helpful coding assistant' },
    { role: 'user', content: 'How do I center a div?' },
  ],
});

Temperature & Other Options

const response = await client.chat({
  model: 'gpt-4o-mini',
  messages: [{ role: 'user', content: 'Be creative!' }],
  temperature: 1.5,
  max_tokens: 500,
  top_p: 0.9,
  stop: ['\n\n', 'END'],
});

Function/Tool Calling

Call external functions and APIs from your AI models across all providers.

Note: If you're upgrading from the legacy functions API, see the Migration Guide.

Basic Example

const response = await client.chat({
  model: 'gpt-4o-mini',
  messages: [{ role: 'user', content: 'What is the weather in Tokyo?' }],
  tools: [{
    type: 'function',
    function: {
      name: 'get_weather',
      description: 'Get current weather for a location',
      parameters: {
        type: 'object',
        properties: {
          location: { type: 'string', description: 'City name' },
          unit: { type: 'string', enum: ['celsius', 'fahrenheit'] }
        },
        required: ['location']
      }
    }
  }]
});

if (response.message.tool_calls) {
  // Execute your function
  const weatherData = getWeather('Tokyo');

  // Send result back
  const finalResponse = await client.chat({
    model: 'gpt-4o-mini',
    messages: [
      { role: 'user', content: 'What is the weather in Tokyo?' },
      response.message,
      {
        role: 'tool',
        content: JSON.stringify(weatherData),
        tool_call_id: response.message.tool_calls[0].id,
      }
    ],
    tools,
  });

  console.log(finalResponse.message.content);
}

Provider Support

| Provider | Tool Calling | Parallel Calls | Streaming | |----------|--------------|----------------|-----------| | OpenAI | ✅ | ✅ | ✅ | | Anthropic | ✅ | ✅ | ✅ | | Gemini | ✅ | ❌ | ✅ |

Tool Choice Options

// Let model decide (default)
tool_choice: 'auto'

// Disable tools
tool_choice: 'none'

// Force tool use (OpenAI only)
tool_choice: 'required'

// Force specific tool
tool_choice: { type: 'function', function: { name: 'get_weather' } }

// Enable parallel tool calls (OpenAI only, default: true)
parallel_tool_calls: true

Custom Provider Support

You can now create custom providers to integrate any LLM or backend API while maintaining the unified interface and features like auto-fallback and retry logic.

Creating a Custom Provider

Extend the BaseProvider class to create your own provider:

import { BaseProvider, type ChatRequest, type ChatResponse, type StreamChunk, AIIntegratorError, ErrorType } from '@ai-integrator/core';

class MyCustomProvider extends BaseProvider {
  readonly type = 'my-custom' as const;

  protected getProviderDefaultModel(): string {
    return 'my-default-model';
  }

  async chat(request: ChatRequest): Promise<ChatResponse> {
    this.validateRequest(request);

    const response = await fetch(`${this.config.baseURL}/chat`, {
      method: 'POST',
      headers: {
        'Authorization': `Bearer ${this.config.apiKey}`,
        'Content-Type': 'application/json',
      },
      body: JSON.stringify({
        model: request.model || this.getDefaultModel(),
        messages: request.messages,
      }),
    });

    const data = await response.json();

    return {
      id: data.id,
      provider: this.type,
      model: data.model,
      message: {
        role: 'assistant',
        content: data.content,
      },
      finish_reason: 'stop',
      created_at: new Date(),
    };
  }

  async *chatStream(request: ChatRequest): AsyncGenerator<StreamChunk> {
    // Implement streaming logic
    this.validateRequest(request);
    // ... streaming implementation
  }

  protected handleError(error: unknown): AIIntegratorError {
    return new AIIntegratorError(
      ErrorType.API_ERROR,
      error instanceof Error ? error.message : 'Unknown error',
      undefined,
      this.type,
      false,
      error
    );
  }
}

Using a Custom Provider

import { AIClient } from '@ai-integrator/core';
import { MyCustomProvider } from './my-custom-provider';

const client = new AIClient({
  provider: 'my-custom',
  customProvider: MyCustomProvider,
  apiKey: process.env.MY_API_KEY,
  baseURL: 'https://my-api.com/v1',
});

const response = await client.chat({
  model: 'my-model',
  messages: [{ role: 'user', content: 'Hello!' }],
});

Custom Provider with Fallbacks

Custom providers work seamlessly with the auto-fallback system:

const client = new AIClient({
  provider: 'my-custom',
  customProvider: MyCustomProvider,
  apiKey: process.env.MY_API_KEY,
  baseURL: 'https://my-api.com/v1',
  fallbacks: [
    {
      provider: 'openai',
      apiKey: process.env.OPENAI_API_KEY,
      priority: 1,
    },
    {
      provider: 'another-custom',
      customProvider: AnotherCustomProvider,
      apiKey: process.env.ANOTHER_API_KEY,
      priority: 2,
    },
  ],
});

Benefits of Custom Providers

✅ No vendor lock-in: Use any LLM provider or your own backend
✅ Unified interface: Same API across all providers (built-in and custom)
✅ Auto-fallback support: Custom providers work with the fallback system
✅ Type-safe: Full TypeScript support for custom providers
✅ Retry logic: Automatic retries work with custom providers
✅ Streaming support: Implement streaming for your custom provider

Use Cases

Custom backend API: Integrate with your own LLM backend
Fine-tuned models: Use fine-tuned models from custom endpoints
New LLM providers: Add support for providers like Mistral, Cohere, etc.
Proxy/Middleware: Add logging, caching, or rate limiting

See examples/custom-provider.ts for complete examples.

Edge Runtime Examples

Cloudflare Workers

export default {
  async fetch(request: Request, env: Env) {
    const client = new AIClient({
      provider: 'openai',
      apiKey: env.OPENAI_API_KEY,
    });

    const response = await client.chat({
      model: 'gpt-4o-mini',
      messages: [{ role: 'user', content: 'Hello from the edge!' }],
    });

    return new Response(response.message.content);
  },
};

Vercel Edge Functions

import { AIClient } from '@ai-integrator/core';

export const config = {
  runtime: 'edge',
};

export default async function handler(req: Request) {
  const client = new AIClient({
    provider: 'anthropic',
    apiKey: process.env.ANTHROPIC_API_KEY,
  });

  const stream = client.chatStream({
    model: 'claude-3-5-sonnet-20241022',
    messages: [{ role: 'user', content: 'Stream me a story' }],
  });

  const encoder = new TextEncoder();
  const readable = new ReadableStream({
    async start(controller) {
      for await (const chunk of stream) {
        controller.enqueue(encoder.encode(chunk.delta.content || ''));
      }
      controller.close();
    },
  });

  return new Response(readable, {
    headers: { 'Content-Type': 'text/plain' },
  });
}

Deno

import { AIClient } from 'npm:@ai-integrator/core';

const client = new AIClient({
  provider: 'gemini',
  apiKey: Deno.env.get('GEMINI_API_KEY')!,
});

const response = await client.chat({
  model: 'gemini-2.0-flash-exp',
  messages: [{ role: 'user', content: 'Hello from Deno!' }],
});

console.log(response.message.content);

API Reference

`AIClient`

Main client class for interacting with AI providers.

Constructor

new AIClient(config: AIClientConfig)

AIClientConfig:

| Property | Type | Required | Description | |----------|------|----------|-------------| | provider | 'openai' \| 'anthropic' \| 'gemini' \| string | Yes | Primary AI provider (built-in or custom identifier) | | apiKey | string | Yes | API key for the provider | | customProvider | class extending BaseProvider | No | Custom provider class (required when using custom provider identifier) | | baseURL | string | No | Custom API endpoint | | organization | string | No | Organization ID (OpenAI only) | | defaultModel | string | No | Default model to use | | fallbacks | FallbackConfig[] | No | Fallback providers (supports both built-in and custom providers) | | retry | RetryConfig | No | Retry configuration | | timeout | number | No | Request timeout in ms | | debug | boolean | No | Enable debug logging |

Methods

`chat(request: ChatRequest): Promise<ChatResponse>`

Perform a chat completion.

ChatRequest:

| Property | Type | Required | Description | |----------|------|----------|-------------| | model | string | Yes | Model identifier | | messages | Message[] | Yes | Conversation messages | | temperature | number | No | Sampling temperature (0-2) | | max_tokens | number | No | Maximum tokens to generate | | top_p | number | No | Nucleus sampling parameter | | stop | string \| string[] | No | Stop sequences | | stream | boolean | No | Enable streaming | | tools | ToolDefinition[] | No | Function/tool definitions for tool calling | | tool_choice | ToolChoice | No | Control which tool to call ('auto', 'none', 'required', or specific tool) | | parallel_tool_calls | boolean | No | Enable parallel tool execution (OpenAI only, default: true) |

`chatStream(request: ChatRequest): AsyncGenerator<StreamChunk>`

Perform a streaming chat completion.

`getPrimaryProvider(): string`

Get the current primary provider type.

`getProviders(): string[]`

Get all configured providers.

`setDebug(enabled: boolean): void`

Enable or disable debug logging.

Feature Comparison

| Feature | OpenAI | Anthropic | Gemini | @ai-integrator/core | |---------|--------|-----------|--------|---------------------| | Unified API | ❌ | ❌ | ❌ | ✅ | | Auto-fallback | ❌ | ❌ | ❌ | ✅ | | Edge-ready | ⚠️ | ⚠️ | ⚠️ | ✅ | | Zero-config switching | ❌ | ❌ | ❌ | ✅ | | Built-in retry | ❌ | ❌ | ❌ | ✅ | | Custom provider support | ❌ | ❌ | ❌ | ✅ | | Tool/Function calling | ✅ | ✅ | ✅ | ✅ | | Streaming | ✅ | ✅ | ✅ | ✅ | | TypeScript | ✅ | ✅ | ✅ | ✅ |

Comparison with Other Libraries

When to Use Each Library

| You Need | Use This | Why | |----------|----------|-----| | Simple provider switching | @ai-integrator/core | Minimal overhead, focused scope | | UI streaming components | Vercel AI SDK | React hooks, UI framework integration | | Full AI framework | LangChain | Chains, agents, memory, retrievers | | Self-hosted proxy | LiteLLM | Centralized routing, enterprise features |

vs. Vercel AI SDK

@ai-integrator/core:

Focused on backend provider abstraction
Wraps official provider SDKs (peer dependencies)
4.4 KB gzipped - optimized for edge
No UI components or framework integration

Vercel AI SDK:

Comprehensive UI and streaming toolkit
Custom API client implementations
~50 KB gzipped - includes React hooks, Zod validation
Built for full-stack Next.js applications

Best of both: Use Vercel AI SDK for UI, @ai-integrator/core for backend provider management.

vs. LangChain

@ai-integrator/core:

Minimal provider abstraction layer
Simple API for chat completions and streaming
4.4 KB gzipped - single-purpose library
Perfect for straightforward LLM API calls

LangChain:

Full AI application framework
Chains, agents, memory, retrievers, vector stores
37+ KB gzipped (core) to 200+ KB (full) - comprehensive features
Built for complex AI workflows and orchestration

Best of both: Use LangChain for complex AI workflows, @ai-integrator/core for simple provider switching.

vs. LiteLLM

@ai-integrator/core:

Zero infrastructure required
Edge-compatible npm package
Direct integration in your codebase
Developer-first, library approach

LiteLLM:

Self-hosted proxy server
Centralized routing and rate limiting
Python-first with enterprise features
Operations-first, infrastructure approach

Best of both: Use LiteLLM for centralized management, @ai-integrator/core for embedded integration.

Default Models

| Provider | Default Model | |----------|---------------| | OpenAI | gpt-4o-mini | | Anthropic | claude-3-5-sonnet-20241022 | | Gemini | gemini-2.0-flash-exp |

Error Handling

import { AIIntegratorError } from '@ai-integrator/core';

try {
  const response = await client.chat({
    model: 'gpt-4o-mini',
    messages: [{ role: 'user', content: 'Hello!' }],
  });
} catch (error) {
  if (error instanceof AIIntegratorError) {
    console.error('Error type:', error.type);
    console.error('Provider:', error.provider);
    console.error('Status code:', error.statusCode);
    console.error('Retryable:', error.retryable);
  }
}

Error Types:

authentication_error: Invalid API key
rate_limit_error: Rate limit exceeded
invalid_request_error: Invalid request parameters
api_error: API error from provider
timeout_error: Request timeout
network_error: Network connectivity issue
unknown_error: Unknown error

Bundle Size

| Package | Minified | Gzipped | Focus Area | Dependency Strategy | |---------|----------|---------|------------|---------------------| | @ai-integrator/core | 17.6 KB | 4.4 KB ✨ | Provider abstraction | Peer deps (optional SDKs) | | Vercel AI SDK (ai) | ~186 KB | ~50 KB† | UI + streaming toolkit | Custom API clients | | LangChain Core | ~120 KB | ~37 KB | Framework base | Optional dependencies | | LangChain (full) | 800+ KB | 200+ KB | Complete framework | Modular packages |

Measurement notes:

Gzipped is the actual size delivered to users (HTTP compression) - the industry standard
@ai-integrator/core: Measured from built package v0.2.0
Vercel AI SDK: Based on community analysis (source)
LangChain: Based on official docs and Bundlephobia
† Estimated from typical gzip compression ratios

Why These Size Differences?

We're smaller because:

Focused scope: Only provider switching, no UI, no chains, no agents
Leverage official SDKs: We wrap existing libraries (peer deps)
Minimal transformations: Only convert between formats when needed
No validation library: Rely on provider SDKs for validation

Vercel AI SDK is larger because:

Custom implementations: Reimplements API clients for all providers
UI framework integration: React hooks, streaming components
Zod validation: Runtime validation and deep type inference
More features: Structured outputs, middleware, tooling

LangChain is larger because:

Full framework: Chains, agents, memory, retrievers, tools
100+ integrations: Vector stores, document loaders, etc.
Complex abstractions: LCEL, Runnables, callback systems

💡 Bottom line: Each library serves different needs. We're the smallest because we focus exclusively on provider abstraction.

Best Practices

Use environment variables for API keys
Enable fallbacks for production applications
Configure timeouts appropriate for your use case
Use streaming for better UX in chat applications
Handle errors gracefully with try-catch
Set appropriate temperature based on use case (lower for factual, higher for creative)

Requirements

Node.js 20+
TypeScript 5+ (for type support)

License

MIT

Contributing

Contributions are welcome! Please read our Contributing Guide before submitting a pull request.

Development Workflow

Fork the repository
Clone and install dependencies: npm install
Create a feature branch: git checkout -b feat/my-feature
Make your changes
Run validation: npm run validate
Commit with conventional commits (enforced by git hooks)
Submit a pull request

Quick links:

Quick Start Guide - Fast setup and workflow
Contributing Guide - Detailed contribution guidelines

Commit Message Format

This repository enforces commit message conventions using git hooks.

Use the interactive commit tool:

git add .
npm run commit

Or write commits manually following the format:

git commit -m "feat: add new feature"
git commit -m "fix: resolve bug"
git commit -m "docs: update readme"

See our guides:

Commit Convention - Format and versioning rules
Commit Linting Guide - Validation and troubleshooting

Automated Release Process

When your PR is merged to main, the package will be automatically:

Version bumped (based on commit messages)
Published to npm
Released on GitHub with bundle size info

See RELEASING.md for details.

Documentation

Documentation Index - Complete guide to all documentation
GitHub Issues - Report issues or request features
Contributing Guide - How to contribute

Roadmap

✅ Completed (v0.2.0)

[x] Function/tool calling support - Full support across OpenAI, Anthropic, and Gemini
- Unified tool API with backward compatibility
- Parallel tool execution (OpenAI)
- Streaming tool calls
- Comprehensive migration guide

✅ Completed (v0.3.0)

[x] Custom provider support - Bring your own LLM backend
- Extend BaseProvider class for custom implementations
- Full type safety for custom providers
- Custom providers work with auto-fallback system
- Example implementations and documentation
- No breaking changes to existing API

🚀 Upcoming

Near-term (v0.3.x - v0.4.x)

[ ] Vision support for multimodal models
- Image inputs for GPT-4 Vision, Claude 3, Gemini Pro Vision
- Unified image handling API
[ ] Enhanced streaming capabilities
- Server-sent events (SSE) support
- WebSocket streaming option
- Better error handling in streams

Mid-term (v0.5.x - v0.7.x)

[ ] Caching layer
- Built-in response caching
- Prompt caching (Anthropic)
- Cache invalidation strategies
[ ] Community provider registry
- Pre-built custom providers for Mistral, Cohere, etc.
- Verified provider implementations
- Example providers and templates
[ ] Observability & monitoring
- Custom hooks for logging
- Token usage tracking
- Cost estimation
- Performance metrics

Long-term (v0.8.x+)

[ ] Advanced features
- Structured output modes
- JSON mode support across providers
- Audio input/output support
- Batch processing API
[ ] Developer tooling
- Admin dashboard for monitoring
- CLI tool for testing providers
- Playground/sandbox environment

Made with ❤️ for developers who want simple AI integration

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@ai-integrator/core

Why @ai-integrator/core?

Installation

Quick Start

Basic Usage

Switch Providers

Streaming

Automatic Fallback

Advanced Configuration

Retry Logic

Debug Mode

System Messages

Temperature & Other Options

Function/Tool Calling

Basic Example

Provider Support

Tool Choice Options

Custom Provider Support

Creating a Custom Provider

Using a Custom Provider

Custom Provider with Fallbacks

Benefits of Custom Providers

Use Cases

Edge Runtime Examples

Cloudflare Workers

Vercel Edge Functions

Deno

API Reference

AIClient

Constructor

Methods

chat(request: ChatRequest): Promise<ChatResponse>

chatStream(request: ChatRequest): AsyncGenerator<StreamChunk>

getPrimaryProvider(): string

getProviders(): string[]

setDebug(enabled: boolean): void

Feature Comparison

Comparison with Other Libraries

When to Use Each Library

vs. Vercel AI SDK

vs. LangChain

vs. LiteLLM

Default Models

Error Handling

Bundle Size

Why These Size Differences?

Best Practices

Requirements

License

Contributing

Development Workflow

Commit Message Format

Automated Release Process

Documentation

Roadmap

✅ Completed (v0.2.0)

✅ Completed (v0.3.0)

🚀 Upcoming

Near-term (v0.3.x - v0.4.x)

Mid-term (v0.5.x - v0.7.x)

Long-term (v0.8.x+)

`AIClient`

`chat(request: ChatRequest): Promise<ChatResponse>`

`chatStream(request: ChatRequest): AsyncGenerator<StreamChunk>`

`getPrimaryProvider(): string`

`getProviders(): string[]`

`setDebug(enabled: boolean): void`