@tetherai/mistral

v0.4.1

Published

7 months ago

Mistral AI provider for TetherAI (streaming-first + middleware).

Downloads

0High
0Medium
0Low

nbursa

ai mistral chat streaming tetherai

@tetherai/mistral

Standalone Mistral provider for TetherAI - Everything you need in one package!

This package provides a complete, streaming-first solution for the Mistral AI Chat Completions API.
No external dependencies required - includes all types, utilities, and middleware built-in.
Think of it as Express for AI providers with everything included.

What's Included

Mistral Provider: Streaming chat completions with full API support
Enhanced Chat Options: Temperature, maxTokens, topP, frequencyPenalty, presencePenalty, stop sequences, system prompts
Non-Streaming Chat: Complete response handling for simple requests
Model Management: List models, validate model IDs, get token limits
Retry Middleware: Automatic retries with exponential backoff
Fallback Middleware: Multi-provider failover support
Error Handling: Rich error classes with HTTP status codes
Edge Runtime: Works everywhere from Node.js to Cloudflare Workers
SSE Utilities: Built-in Server-Sent Events parsing
TypeScript: 100% typed with zero any types

Quick Start

Installation

npm install @tetherai/mistral
# or
pnpm add @tetherai/mistral
# or
yarn add @tetherai/mistral

That's it! No additional packages needed - everything is included.

Basic Usage

Set your API key:

export MISTRAL_API_KEY=mist-...

Streaming Chat Example

import { mistral } from "@tetherai/mistral";

const provider = mistral({ 
  apiKey: process.env.MISTRAL_API_KEY!,
  timeout: 30000,        // 30 second timeout
  maxRetries: 2          // Built-in retry configuration
});

for await (const chunk of provider.streamChat({
  model: "mistral-large-latest",
  messages: [{ role: "user", content: "Hello!" }],
  temperature: 0.7,      // Enhanced chat options
  maxTokens: 1000,
  systemPrompt: "You are a helpful assistant."
})) {
  if (chunk.done) break;
  process.stdout.write(chunk.delta);
}

Non-Streaming Chat Example

const response = await provider.chat({
  model: "mistral-large-latest",
  messages: [{ role: "user", content: "Hello!" }],
  temperature: 0.5,
  maxTokens: 500,
  responseFormat: "json_object"  // Get structured responses
});

console.log(response.content);
console.log(`Used ${response.usage.totalTokens} tokens`);

Model Management Example

// Get available models
const models = await provider.getModels();
console.log("Available models:", models);

// Validate model ID
const isValid = provider.validateModel("mistral-large-latest");
console.log("Model valid:", isValid);

// Get token limits
const maxTokens = provider.getMaxTokens("mistral-large-latest");
console.log("Max tokens:", maxTokens);

API Reference

Provider Configuration

interface MistralOptions {
  apiKey: string;           // Required: Your Mistral API key
  baseURL?: string;         // Optional: Custom API endpoint (default: https://api.mistral.ai/v1)
  timeout?: number;         // Optional: Request timeout in milliseconds (default: 30000)
  maxRetries?: number;      // Optional: Built-in retry attempts (default: 2)
  fetch?: typeof fetch;     // Optional: Custom fetch implementation
}

Streaming Chat

const stream = provider.streamChat({
  model: "mistral-large-latest",        // Required: Model to use
  messages: [                            // Required: Conversation history
    { role: "user", content: "Hello" }
  ],
  temperature: 0.7,                      // Optional: Randomness (0.0 to 2.0)
  maxTokens: 1000,                       // Optional: Max tokens to generate
  topP: 0.9,                            // Optional: Nucleus sampling
  frequencyPenalty: 0.1,                // Optional: Repetition penalty
  presencePenalty: 0.1,                 // Optional: Topic penalty
  stop: ["\n", "END"],                  // Optional: Stop sequences
  systemPrompt: "You are helpful"       // Optional: System instructions
});

// Process streaming response
for await (const chunk of stream) {
  if (chunk.done) break;
  process.stdout.write(chunk.delta);
}

Non-Streaming Chat

const response = await provider.chat({
  model: "mistral-large-latest",
  messages: [{ role: "user", content: "Hello" }],
  temperature: 0.7,
  maxTokens: 1000,
  responseFormat: "json_object"  // Get structured responses
});

console.log(response.content);
console.log(`Model: ${response.model}`);
console.log(`Finish reason: ${response.finishReason}`);
console.log(`Usage: ${response.usage.totalTokens} tokens`);

Model Management

// List available models
const models = await provider.getModels();
// Returns: ['mistral-tiny', 'mistral-small', 'mistral-medium', 'mistral-large']

// Validate model ID
const isValid = provider.validateModel("mistral-large-latest");     // true
const isInvalid = provider.validateModel("gpt-4");                 // false

// Get token limits
const maxTokens = provider.getMaxTokens("mistral-large-latest");   // 32768
const smallTokens = provider.getMaxTokens("mistral-small-latest"); // 32768

Supported Models

| Model | Context | Description | |-------|---------|-------------| | mistral-tiny | 32K | Fast and efficient model | | mistral-small | 32K | Balanced performance and speed | | mistral-medium | 32K | High-quality responses | | mistral-large | 32K | Most capable model |

Error Handling

The provider throws MistralError for API-related errors:

import { MistralError } from "@tetherai/mistral";

try {
  const response = await provider.chat({
    model: "mistral-large-latest",
    messages: [{ role: "user", content: "Hello" }]
  });
} catch (error) {
  if (error instanceof MistralError) {
    console.error(`Mistral API Error ${error.status}: ${error.message}`);
    
    switch (error.status) {
      case 401:
        console.error("Invalid API key");
        break;
      case 429:
        console.error("Rate limit exceeded");
        break;
      case 500:
        console.error("Server error");
        break;
      default:
        console.error("Unexpected error");
    }
  }
}

Error Types

MistralError - Base error class with HTTP status and message
401 - Authentication failed (invalid API key)
429 - Rate limit exceeded
500 - Server error
400 - Bad request (invalid parameters)

Middleware

Retry Middleware

import { withRetry } from "@tetherai/mistral";

const retryProvider = withRetry(provider, {
  maxRetries: 3,
  retryDelay: 1000,
  shouldRetry: (error) => error.status >= 500
});

// Use with automatic retries
const response = await retryProvider.chat({
  model: "mistral-large-latest",
  messages: [{ role: "user", content: "Hello" }]
});

Fallback Middleware

import { withFallback } from "@tetherai/mistral";

const fallbackProvider = withFallback(provider, {
  fallbackProvider: backupProvider,
  shouldFallback: (error) => error.status === 429
});

// Automatically fallback on rate limits
const response = await fallbackProvider.chat({
  model: "mistral-large-latest",
  messages: [{ role: "user", content: "Hello" }]
});

Advanced Examples

Streaming with System Prompt

const stream = provider.streamChat({
  model: "mistral-large-latest",
  messages: [
    { role: "user", content: "Write a Python function to calculate fibonacci numbers" }
  ],
  systemPrompt: "You are a helpful coding assistant. Always provide working code examples.",
  temperature: 0.3,
  maxTokens: 2000
});

let fullResponse = "";
for await (const chunk of stream) {
  if (chunk.done) break;
  fullResponse += chunk.delta;
  process.stdout.write(chunk.delta);
}

console.log("\n\nFull response:", fullResponse);

Error Recovery with Fallback

import { mistral } from "@tetherai/mistral";
import { withFallback } from "@tetherai/mistral";

const mistralProvider = mistral({ apiKey: process.env.MISTRAL_API_KEY! });
const backupProvider = openAI({ apiKey: process.env.OPENAI_API_KEY! });

const fallbackProvider = withFallback(mistralProvider, {
  fallbackProvider: backupProvider,
  shouldFallback: (error) => error.status === 429 || error.status >= 500
});

try {
  const response = await fallbackProvider.chat({
    model: "mistral-large-latest",
    messages: [{ role: "user", content: "Hello" }]
  });
  console.log("Response:", response.content);
} catch (error) {
  console.error("Both providers failed:", error);
}

Custom Fetch Implementation

const customProvider = mistral({
  apiKey: process.env.MISTRAL_API_KEY!,
  fetch: async (url, options) => {
    // Add custom headers
    const customOptions = {
      ...options,
      headers: {
        ...options.headers,
        'X-Custom-Header': 'value'
      }
    };
    
    return fetch(url, customOptions);
  }
});

TypeScript

Full TypeScript support with zero any types:

import { 
  mistral, 
  MistralOptions, 
  MistralError, 
  ChatResponse,
  StreamChatOptions 
} from "@tetherai/mistral";

const options: MistralOptions = {
  apiKey: process.env.MISTRAL_API_KEY!,
  baseURL: "https://api.mistral.ai/v1",
  timeout: 30000
};

const provider = mistral(options);

async function chatWithMistral(options: StreamChatOptions): Promise<ChatResponse> {
  try {
    return await provider.chat(options);
  } catch (error) {
    if (error instanceof MistralError) {
      console.error(`Mistral error: ${error.message}`);
    }
    throw error;
  }
}

Edge Runtime Support

Works everywhere from Node.js to Cloudflare Workers:

// Cloudflare Worker
export default {
  async fetch(request: Request): Promise<Response> {
    const provider = mistral({ 
      apiKey: env.MISTRAL_API_KEY,
      fetch: globalThis.fetch 
    });
    
    const response = await provider.chat({
      model: "mistral-large-latest",
      messages: [{ role: "user", content: "Hello from Cloudflare!" }]
    });
    
    return new Response(response.content);
  }
};

Performance Tips

Use streaming for real-time responses
Set appropriate timeouts for your use case
Implement retry logic for production reliability
Use fallback providers for high availability
Batch requests when possible

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@tetherai/mistral

What's Included

Quick Start

Installation

Basic Usage

Streaming Chat Example

Non-Streaming Chat Example

Model Management Example

API Reference

Provider Configuration

Streaming Chat

Non-Streaming Chat

Model Management

Supported Models

Error Handling

Error Types

Middleware

Retry Middleware

Fallback Middleware

Advanced Examples

Streaming with System Prompt

Error Recovery with Fallback

Custom Fetch Implementation

TypeScript

Edge Runtime Support

Performance Tips

License