@sisu-ai/adapter-openai

v9.0.1

Published

15 hours ago

OpenAI‑compatible Chat adapter with tools support.

0High
0Medium
0Low

lejahmie

sisu ai ai-agent agentic adapter openai

@sisu-ai/adapter-openai

OpenAI‑compatible Chat adapter with tools support.

Setup

npm i @sisu-ai/adapter-openai

Env: OPENAI_API_KEY (preferred) or API_KEY required.
Optional: DEBUG_LLM=1 to log redacted request/response summaries on errors.
Base URL: OPENAI_BASE_URL or BASE_URL can override the base URL (or pass baseUrl in code).
Examples may support CLI flags to override env at runtime, e.g. --openai-api-key, --openai-base-url, --openai-model.

Documentation

Discover what you can do through examples or documentation. Check it out at https://github.com/finger-gun/sisu

Tools

Sends tools and tool_choice (or function_call compatibility when needed).
Maps message.tool_calls to simplified { id, name, arguments } for the middleware loop.
Assistant messages that carry only tool_calls use content: null.

Usage

import { openAIAdapter } from '@sisu-ai/adapter-openai';

const model = openAIAdapter({ model: 'gpt-4o-mini' });
// or with a gateway
const model = openAIAdapter({ model: 'gpt-4o-mini', baseUrl: 'https://openrouter.ai/api/' });

Images (Vision)

Supports OpenAI multi-part content arrays with type: 'text' | 'image_url'.
You can pass OpenAI-style content parts directly, or use convenience fields like images/image_url.

OpenAI-style content parts:

const messages: any[] = [
  { role: 'system', content: 'You are concise.' },
  {
    role: 'user',
    content: [
      { type: 'text', text: 'What is in this image?' },
      { type: 'image_url', image_url: { url: 'https://example.com/pic.jpg' } },
    ],
  },
];
const res = await model.generate(messages, { toolChoice: 'none' });

Convenience shape (adapter builds parts under the hood):

const messages: any[] = [
  { role: 'system', content: 'You are concise.' },
  { role: 'user', content: 'Describe the image.', images: ['https://example.com/pic.jpg'] },
];
const res = await model.generate(messages, { toolChoice: 'none' });

Cost estimation: With @sisu-ai/mw-usage-tracker, configure pricing per 1M tokens.

Examples:

gpt-4o-mini: inputPer1M ≈ 0.15, outputPer1M ≈ 0.60
Images: Prefer imagePer1K (e.g., ≈0.217 per 1K images). Alternatively, use imageInputPer1K + imageTokenPerImage.

Reasoning Models Support

The OpenAI adapter supports reasoning/thinking models (o1-preview, o1-mini, o3, ChatGPT 5.1) that provide extended chain-of-thought capabilities through internal reasoning that can be preserved across conversation turns.

Quick Start

Enable reasoning with a simple boolean flag:

import { openAIAdapter } from '@sisu-ai/adapter-openai';

const llm = openAIAdapter({ model: 'o1-preview' });

const response = await llm.generate(
  [{ role: 'user', content: 'How many "r"s are in "strawberry"?' }],
  { reasoning: true }
);

console.log(response.message.content); // "There are 3 'r's in 'strawberry'"
console.log(response.message.reasoning_details); // Contains reasoning context

Reasoning Parameter Formats

The adapter accepts two formats for the reasoning parameter:

// Boolean (recommended) - automatically normalized to { enabled: true }
{ reasoning: true }

// Object format (OpenAI native) - passed through as-is
{ reasoning: { enabled: true } }

Understanding reasoning_details

When reasoning is enabled, the API returns a reasoning_details field in the assistant message. This field is opaque (provider-specific) and typically contains:

// Example structure (actual format may vary by provider)
{
  reasoning_details: [
    {
      type: 'reasoning.summary',
      summary: '...human-readable reasoning process...'
    },
    {
      type: 'reasoning.encrypted',
      data: '...encrypted internal state...'
    }
  ]
}

Important characteristics:

Summary: Contains a human-readable explanation of the model's thinking process (when available)
Encrypted contexts: Preserve internal model state for improved multi-turn coherence
Opaque format: Treat as a black box; do not modify or parse
Preservation required: Must be included in subsequent turns for optimal performance

Multi-Turn Conversations

For multi-turn conversations with reasoning, you must preserve the entire assistant message including reasoning_details:

const llm = openAIAdapter({ model: 'o1-preview' });

// First turn
const response1 = await llm.generate(
  [{ role: 'user', content: 'Solve this complex math problem...' }],
  { reasoning: true }
);

// Build conversation history - CRITICAL: include full message with reasoning_details
const messages = [
  { role: 'user', content: 'Solve this complex math problem...' },
  response1.message, // Complete message with reasoning_details
  { role: 'user', content: 'Now explain your reasoning step-by-step' }
];

// Second turn - reasoning context is automatically preserved
const response2 = await llm.generate(messages, { reasoning: true });

Why preservation matters:

Improves response accuracy on follow-up questions
Maintains reasoning coherence across turns
Enables the model to reference its previous thinking
Required by some models for optimal performance

Supported Models

| Model | Provider | Reasoning Support | Notes | |-------|----------|-------------------|-------| | o1-preview | OpenAI | ✅ Full | Extended reasoning, higher cost | | o1-mini | OpenAI | ✅ Full | Faster reasoning, lower cost | | o3-preview | OpenAI | ✅ Full | Next-gen reasoning (limited access) | | gpt-5.1 | OpenRouter | ✅ Full | Via OpenRouter with baseUrl | | gpt-4o | OpenAI | ⚠️ Partial | Accepts parameter but limited reasoning | | gpt-4o-mini | OpenAI | ⚠️ Partial | Accepts parameter but limited reasoning |

Usage with OpenRouter:

const llm = openAIAdapter({
  model: 'gpt-5.1',
  baseUrl: 'https://openrouter.ai/api/v1',
  apiKey: process.env.OPENROUTER_API_KEY
});

Cost Considerations

Reasoning models typically have higher costs due to extended thinking time:

| Model | Input (per 1M tokens) | Output (per 1M tokens) | Notes | |-------|----------------------|------------------------|-------| | o1-preview | ~$15-20 | ~$60-80 | Premium reasoning | | o1-mini | ~$3-5 | ~$12-15 | Cost-effective reasoning | | gpt-5.1 | Varies | Varies | Check OpenRouter pricing |

Tips for cost optimization:

Use o1-mini for most reasoning tasks
Reserve o1-preview for complex problems requiring deep analysis
Enable reasoning only when needed (not for simple queries)
Use temperature: 0.1 or lower for deterministic output

Streaming Support

Reasoning is fully supported in streaming mode:

const stream = await llm.generate(
  [{ role: 'user', content: 'Complex question...' }],
  { reasoning: true, stream: true }
);

for await (const event of stream) {
  if (event.type === 'token') {
    process.stdout.write(event.token);
  } else if (event.type === 'assistant_message') {
    // reasoning_details available in final message
    console.log('\nReasoning captured:', event.message.reasoning_details);
  }
}

Troubleshooting

Error: 400/405 when enabling reasoning

Some models don't support the reasoning parameter. Try:

Verify you're using a reasoning-capable model (o1-preview, o1-mini, etc.)
Check that your API key has access to reasoning models
Ensure baseUrl is correct (especially for OpenRouter)

try {
  const res = await llm.generate(messages, { reasoning: true });
} catch (error) {
  if (error.message.includes('405') || error.message.includes('400')) {
    console.error('Model may not support reasoning parameter');
    console.error('Try: o1-preview, o1-mini, or gpt-5.1 via OpenRouter');
  }
  throw error;
}

No reasoning_details in response

This is normal for non-reasoning models:

Standard GPT-4/GPT-3.5 models accept the parameter but don't return reasoning details
The adapter handles this gracefully - no error, just no reasoning_details field

Multi-turn context lost

Ensure you're preserving the complete message:

// ✅ Correct - preserves reasoning_details
messages.push(response.message);

// ❌ Wrong - loses reasoning_details
messages.push({ role: 'assistant', content: response.message.content });

High costs

Reasoning models use more tokens:

Monitor usage with @sisu-ai/mw-usage-tracker
Use o1-mini instead of o1-preview when possible
Enable reasoning only for complex tasks

Complete Example

See the openai-reasoning example for a full working demonstration including:

Basic reasoning usage
Multi-turn conversations with preserved context
Error handling
Usage tracking
Trace visualization

Debugging

DEBUG_LLM=1 prints sanitized payloads and error bodies.
Combine with LOG_LEVEL=debug to see middleware events.

Community & Support

Discover what you can do through examples or documentation. Check it out at https://github.com/finger-gun/sisu. Example projects live under examples/ in the repo.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@sisu-ai/adapter-openai

Setup

Documentation

Tools

Usage

Images (Vision)

Reasoning Models Support

Quick Start

Reasoning Parameter Formats

Understanding reasoning_details

Multi-Turn Conversations

Supported Models

Cost Considerations

Streaming Support

Troubleshooting

Complete Example

Debugging

Community & Support