@krutai/ai-provider

v0.3.8

Published

a month ago

AI provider package for KrutAI — fetch-based client for your deployed LangChain server with API key validation

0High
0Medium
0Low

satyabrat7805

ashwanikharwar

krutai ai langchain llm ai-provider fetch

@krutai/ai-provider

AI provider package for KrutAI — fetch-based client form our deployed server.

Features

🔑 API Key validation — validates your key against the server before use
🚀 Zero SDK dependencies — uses native fetch only
📡 Streaming — SSE-based streaming via async generator
💬 Multi-turn chat — full conversation history support
🎙️ Live Conversation — real-time voice via LiveKit
🔊 Text-to-Speech (TTS) — convert text to audio with high-quality voices
⚙️ Configurable — pass any model name to the server

Installation

npm install @krutai/ai-provider

Quick Start

import { krutAI } from '@krutai/ai-provider';

const ai = krutAI({
  apiKey: 'your-krutai-api-key',
  // Optional: omitted to use the default local dev server ('http://localhost:8000')
  // serverUrl: 'https://krut.ai',
});

await ai.initialize(); // validates key with your server

// Single response
const text = await ai.chat('Write a poem about TypeScript');
console.log(text);

Usage

Chat (single response)

const ai = krutAI({
  apiKey: process.env.KRUTAI_API_KEY!,
  serverUrl: 'https://krut.ai', // Override default for production
  model: 'gemini-3.1-pro-preview', // optional — server's default is used if omitted
});

await ai.initialize();

const text = await ai.chat('Explain async/await in JavaScript', {
  system: 'You are a helpful coding tutor.',
  maxTokens: 500,
  temperature: 0.7,
});

console.log(text);

Multi-turn Chat

const ai = krutAI({
  apiKey: process.env.KRUTAI_API_KEY!,
});

await ai.initialize();

const response = await ai.chat([
  { role: 'system', content: 'You are a helpful assistant.' },
  { role: 'user', content: 'What is the capital of France?' },
  { role: 'assistant', content: 'Paris.' },
  { role: 'user', content: 'What is it famous for?' },
]);

console.log(response);

Multimodal Messages (Images)

For vision-supported models, you can pass an array of ContentParts instead of a flat string:

const response = await ai.chat([
  {
    role: 'user',
    content: [
      { type: 'text', text: 'Describe this image for me.' },
      { 
        type: 'image_url', 
        image_url: { url: 'https://example.com/logo.png' } 
      }
    ]
  }
], { 
  model: 'gemini-3.1-pro-preview',
  // You can also pass images, documents, or pdfs via GenerateOptions
  images: ['https://example.com/photo.jpg'],
  documents: ['https://example.com/doc.docx'],
  pdf: ['https://example.com/report.pdf']
});

Streaming (Proxying SSE Streams)

If you are building an API route (e.g., in Next.js) and want to pipe the true Server-Sent Events (SSE) stream down to your backend component, use streamChatResponse.

streamChatResponse returns the raw fetch Response object containing the text/event-stream body from deployed LangChain server.

// app/api/chat/route.ts
export async function POST(req: Request) {
  const { messages } = await req.json();

  // Returns the native fetch Response (with text/event-stream headers and body)
  const response = await ai.streamChatResponse(messages);
  
  // Proxy it directly to the backend!
  return response;
}

If you need to consume the stream in a Node environment rather than proxying it, you can read from the response body directly:

const response = await ai.streamChatResponse([
  { role: 'user', content: 'Tell me a short story' }
]);

const reader = response.body?.getReader();
const decoder = new TextDecoder();

if (reader) {
  while (true) {
    const { done, value } = await reader.read();
    if (done) break;
    process.stdout.write(decoder.decode(value, { stream: true }));
  }
}

Structured Output

You can request the AI to return data in a specific JSON structure (e.g. for generating models, summaries, or profiles).

interface Profile {
  name: string;
  age: number;
}

const profile = await ai.chat<Profile>('Generate a profile for John Doe', {
  isStructure: true,
  // Pass an array of field names for simple string objects...
  output_structure: ['name', 'age'], 
  // ...or pass a full JSON Schema for complex objects
});

console.log(profile.name, profile.age);

Skip validation (useful for tests)

const ai = krutAI({
  apiKey: 'test-key',
  serverUrl: 'http://localhost:3000',
  validateOnInit: false, // skips the /validate round-trip
});

// No need to call initialize() when validateOnInit is false
const text = await ai.chat('Hello!');

Live Conversation (Real-time Voice)

This library supports real-time voice conversations using Gemini Live API via LiveKit. It provides a getLiveConnection method that returns LiveKit connection details (URL and token) which you can use with @livekit/components-react in your frontend.

Basic Usage

import { krutAI } from '@krutai/ai-provider';

const ai = krutAI({
  apiKey: process.env.KRUTAI_API_KEY!,
  serverUrl: 'http://localhost:8000',
});

await ai.initialize();

// Get LiveKit connection details
const { url, token } = await ai.getLiveConnection({
  room: 'my-voice-room',
  participant: 'user',
});

// Use with LiveKit components
// <LiveKitRoom serverUrl={url} token={token} ... />

With Custom Instructions and Voice

You can customize the AI's behavior and voice:

const ai = krutAI({
  apiKey: process.env.KRUTAI_API_KEY!,
});

await ai.initialize();

const { url, token } = await ai.getLiveConnection({
  room: 'support-agent-room',
  participant: 'customer',
  instructions: 'You are a helpful customer support agent.', // AI personality
  voice: 'Puck', // Voice name (see available voices below)
});

Available Voices

The following voices are supported:

| Voice Name | Description | |------------|-------------| | Puck | Default voice | | Zephyr | Light and airy | | Charon | Deep and authoritative | | Kore | Warm and conversational | | Fenrir | Strong and bold | | Leda | Soft and friendly | | Orus | Clear and professional | | Aoede | Expressive | | Callirrhoe | Rich and full | | Autonoe | Friendly | | Enceladus | Deep | | Iapetus | Professional | | Umbriel | Mellow | | Algieba | Warm | | Despina | Light | | Erinome | Soft | | Algenib | Clear | | Rasalgethi | Strong | | Laomedeia | Expressive | | Achernar | Deep | | Alnilam | Professional | | Schedar | Authoritative | | Gacrux | Warm | | Pulcherrima | Elegant | | Achird | Friendly |

Integration with LiveKit Components

Here's a complete example with Next.js and LiveKit:

// app/live/page.tsx
'use client';

import { useState } from 'react';
import { KrutAIProvider } from '@krutai/ai-provider';
import {
  LiveKitRoom,
  RoomAudioRenderer,
  BarVisualizer,
  VoiceAssistantControlBar,
  useVoiceAssistant,
  useConnectionState,
} from '@livekit/components-react';
import '@livekit/components-styles';

export default function LivePage() {
  const [connectionDetails, setConnectionDetails] = useState<{ url: string; token: string } | null>(null);

 const startCall = async () => {
    try {
      setError(null);
      setIsConnecting(true);
      
      const ai = new KrutAIProvider({
        apiKey: process.env.KRUTAI_API_KEY || '',
        serverUrl: process.env.KRUTAI_SERVER_URL || 'http://localhost:8000',
      });

      await ai.initialize();

      // Fetch LiveKit connection details instead of using websocket directly
      const details = await ai.getLiveConnection({
        room: `room-${Math.floor(Math.random() * 10000)}`,
        participant: 'user',
        instructions: 'You are a helpful assistant',
        voice: 'Puck',
      });

      setConnectionDetails(details);
    } catch (err: any) {
      setError(err.message);
    } finally {
      setIsConnecting(false);
    }
  };

  const endCall = () => {
    setConnectionDetails(null);
  };

  return (
    <div>
      {!connectionDetails ? (
        <button onClick={startCall}>Start Call</button>
      ) : (
      <LiveKitRoom
            serverUrl={connectionDetails.url}
            token={connectionDetails.token}
            connect={true}
            audio={true}
            video={false}
            onDisconnected={endCall}
          >
            <VoiceAssistantUI />
            <RoomAudioRenderer />
          </LiveKitRoom>
      )}
    </div>
  );
}

Text-to-Speech (TTS)

Convert text to speech using the Gemini TTS API. This returns base64-encoded audio content that can be played directly in the browser or saved to a file.

Basic Usage

const { audioContent, audioMimeType } = await ai.tts('Hello, how can I help you today?', {
  voice: 'Kore',
  speakingRate: 1.1,
});

// Play in browser
const audio = new Audio(`data:${audioMimeType};base64,${audioContent}`);
audio.play();

TTS Options

| Option | Type | Default | Description | |---|---|---|---| | voice | string | Charon | Voice name (see voices list in Live section) | | prompt | string | - | Style prompt (e.g., "Speak with excitement") | | encoding | string | MP3 | MP3, LINEAR16, or OGG_OPUS | | languageCode| string | en-US | Language code (e.g., en-US, es-ES) | | speakingRate| number | 1.0 | Rate (0.25 to 4.0) | | pitch | number | 0 | Pitch in semitones (-12.0 to 12.0) | | volumeGainDb| number | 0 | Volume gain in dB (-96.0 to 16.0) |

Server API Contract

Your LangChain server must expose these endpoints:

| Endpoint | Method | Auth | Body | |---|---|---|---| | /validate | POST | x-api-key header | { "apiKey": "..." } | | /generate | POST | Authorization: Bearer <key> | { "prompt": "...", "isStructure": boolean, "output_structure": any, ... } | | /stream | POST | Authorization: Bearer <key> | { "messages": [...], "model": "...", ... } | | /live | GET | Authorization: Bearer <key> | Query params: room, participant, instructions, voice | | /tts | POST | Authorization: Bearer <key> | { "text": "...", "voice": "...", ... } |

Validation response: { "valid": true } or { "valid": false, "message": "reason" }

AI response: { "text": "..." } or { "content": "..." } or { "message": "..." }

Stream: text/event-stream with data: <chunk> lines, ending with data: [DONE]

Live connection: { "url": "...", "token": "..." }

TTS response: { "audioContent": "...", "audioMimeType": "..." }

API Reference

`krutAI(config)`

Factory function — preferred way to create a provider.

const ai = krutAI({
  apiKey: string;           // required — KrutAI API key
  serverUrl?: string;       // optional — defaults to 'http://localhost:8000'
  model?: string;           // optional — passed to server (default: 'default')
  validateOnInit?: boolean; // optional — default: true
});

`KrutAIProvider`

Full class API with the same methods as above. Use when you need the class directly.

Exports

export { krutAI, KrutAIProvider, KrutAIKeyValidationError, validateApiKey, validateApiKeyFormat, DEFAULT_MODEL };
export type { KrutAIProviderConfig, GenerateOptions, ChatMessage, LiveConnectionOptions, TTSOptions, TTSResponse };

`getLiveConnection(options?)`

Get LiveKit connection details for real-time voice conversation.

interface LiveConnectionOptions {
  room?: string;          // Room name (default: 'gemini-room')
  participant?: string;  // Participant identity (default: random)
  instructions?: string; // AI system prompt (default: 'You are a helpful assistant')
  voice?: string;      // Voice name (default: 'Puck')
}

// Returns LiveKit connection details
const { url, token } = await ai.getLiveConnection({
  room: 'my-room',
  instructions: 'You are a helpful assistant.',
  voice: 'Kore',
});

`tts(text, options?)`

Convert text to speech.

interface TTSOptions {
  voice?: string;
  prompt?: string;
  encoding?: 'MP3' | 'LINEAR16' | 'OGG_OPUS';
  languageCode?: string;
  speakingRate?: number;
  pitch?: number;
  volumeGainDb?: number;
}

// Returns base64 audio and mime type
const { audioContent, audioMimeType } = await ai.tts("Hello world", {
  voice: "Puck"
});

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@krutai/ai-provider

Features

Installation

Quick Start

Usage

Chat (single response)

Multi-turn Chat

Multimodal Messages (Images)

Streaming (Proxying SSE Streams)

Structured Output

Skip validation (useful for tests)

Live Conversation (Real-time Voice)

Basic Usage

With Custom Instructions and Voice

Available Voices

Integration with LiveKit Components

Text-to-Speech (TTS)

Basic Usage

TTS Options

Server API Contract

API Reference

krutAI(config)

KrutAIProvider

Exports

getLiveConnection(options?)

tts(text, options?)

License

`krutAI(config)`

`KrutAIProvider`

`getLiveConnection(options?)`

`tts(text, options?)`