ai-chat-toolkit-server

v1.2.1

Published

10 days ago

Plug-and-play AI chat backend for Express with LLM providers and tool calling

0High
0Medium
0Low

ai-chat-toolkit-server

Plug-and-play AI chat backend for Express apps. Connect any LLM provider and register custom tools — the widget handles the UI, this handles the intelligence.

Current release: 1.2.0 on npm.

Works with [email protected] or any client that follows the chat API contract.

Install

npm install ai-chat-toolkit-server@^1.2.0 express

Quick start

import express from "express";
import { AiChatServer } from "ai-chat-toolkit-server";

const app = express();

const aiChat = new AiChatServer({
  provider: "groq",
  apiKey: process.env.API_KEY,
  model: "llama-3.3-70b-versatile",
  cors: { origin: "http://localhost:5173" },
});

aiChat.attach(app);
app.listen(3000, () => console.log("Listening on http://localhost:3000"));

That's it. The server now accepts POST /ai-chat/custom and responds to chat messages.

Environment-based options

Use serverOptionsFromEnv() to map common env vars to AiChatServer options:

import { AiChatServer, serverOptionsFromEnv } from "ai-chat-toolkit-server";

const aiChat = new AiChatServer({
  path: "/my-chat",
  ...serverOptionsFromEnv({
    provider: process.env.PROVIDER,
    apiKey: process.env.API_KEY ?? process.env.OPENAI_API_KEY,
    model: process.env.MODEL,
    baseUrl: process.env.BASE_URL,
  }),
  cors: { origin: true },
});

| Input | Typical env var | Notes | |-------|-----------------|-------| | provider | PROVIDER | groq, openai / openai-compatible, gemini, ollama; defaults to groq | | apiKey | API_KEY | Optional on the options object | | model | MODEL | Falls back to per-provider default | | baseUrl | BASE_URL | For OpenAI-compatible or Ollama endpoints |

Exported defaults: DEFAULT_CHAT_PROVIDER, CHAT_PROVIDER_DEFAULTS.

Providers

Groq

new AiChatServer({
  provider: "groq",
  apiKey: process.env.GROQ_API_KEY,
  model: "llama-3.3-70b-versatile",
});

Groq uses the OpenAI-compatible format internally with https://api.groq.com/openai/v1. Supports tool calling.

OpenAI

new AiChatServer({
  provider: "openai-compatible",
  apiKey: process.env.OPENAI_API_KEY,
  model: "gpt-4o-mini",
});

OpenRouter (or any OpenAI-compatible API)

new AiChatServer({
  provider: "openai-compatible",
  apiKey: process.env.OPENROUTER_API_KEY,
  model: "deepseek/deepseek-r1:free",
  baseUrl: "https://openrouter.ai/api/v1",
});

Gemini

new AiChatServer({
  provider: "gemini",
  apiKey: process.env.GEMINI_API_KEY,
  model: "gemini-1.5-flash",
});

Tool calling is not yet implemented for Gemini (chat only).

Ollama (local models)

new AiChatServer({
  provider: "ollama",
  model: "llama3.1",
  baseUrl: "http://localhost:11434", // default
});

Tool calling is not yet implemented for Ollama (chat only). No API key required.

Tool registration

aiChat.addTools([
  {
    name: "get_products",
    description: "Get products by category. Use when the user asks to browse or list products.",
    inputSchema: {
      type: "object",
      properties: {
        category: { type: "string", description: "Category name, e.g. Electronics" },
      },
      required: ["category"],
    },
    handler: async ({ category }, context) => {
      // context.request gives you the Express Request for auth checks
      return [{ id: "p1", name: "Demo Product", category }];
    },
  },
]);

The LLM decides when to call a tool. Up to maxToolRounds tool-call loops happen per request (default: 3). The final text reply is returned to the widget.

Tool orchestration (LangChain, internal)

By default the server uses a lightweight native tool loop (orchestration: "native" — unchanged from 1.0.0).

For multi-step tasks (tool order, chaining outputs into later tools), opt in:

new AiChatServer({
  provider: "groq",
  apiKey: process.env.API_KEY,
  model: "llama-3.3-70b-versatile",
  orchestration: "langchain",
});

LangChain is bundled as an internal dependency and is not exported from this package. Your public API (addTools, chat routes, request/response shape) stays the same.

Requires a provider that supports tool calling (Groq / OpenAI-compatible). Gemini and Ollama are chat-only today.

Runnable demo: examples/langchain-orchestration.

Plugin support

Extend the server with optional plugins without changing the core API:

const plugin = {
  install(server) {
    server.registerBeforeLLMHook(async ({ message, history }) => {
      return {
        context: "Extra context before LLM call",
      };
    });
  },
};

const server = new AiChatServer({ /* ... */ });
server.use(plugin);

Contract

server.use(plugin) calls plugin.install(server)
server.registerBeforeLLMHook(fn) registers a hook that runs before each LLM call
Hooks receive { message, history, request }
Hooks may return { context?: string } — returned text is appended to the system prompt
Multiple hooks are supported; their context blocks are combined in registration order
Hook errors are logged and do not crash the request

Works with both native and LangChain orchestration paths.

System prompt

Shape the assistant's personality and behavior:

new AiChatServer({
  provider: "groq",
  apiKey: process.env.API_KEY,
  model: "llama-3.3-70b-versatile",
  systemPrompt: `You are a helpful support assistant for Acme Corp.
Keep answers concise and friendly. Only call tools when the user asks for specific data.`,
});

CORS configuration

CORS middleware is applied only to the AI chat routes — not your entire Express app.

// Single origin
cors: { origin: "http://localhost:5173" }

// Multiple allowed origins
cors: { origin: ["https://app.example.com", "https://admin.example.com"] }

// Allow all origins — development only
cors: { origin: true }

// Disable CORS headers entirely
cors: { origin: false }

Need Access-Control-Allow-Credentials? The built-in CORS helper does not set this header. Use the cors npm package on your Express app instead — it is the community standard for credentials scenarios and handles the full spec correctly:

import cors from "cors";

app.use(cors({ origin: "https://app.example.com", credentials: true }));
aiChat.attach(app); // attach after cors middleware

Routes

Calling aiChat.attach(app) registers these routes:

| Method | Path | Description | |--------|------|-------------| | POST | {path} (default /ai-chat/custom) | Send a chat message | | GET | /ai-chat/health | Health check | | GET | /ai-chat/tools | List registered tools |

Change the chat path:

new AiChatServer({
  path: "/my-chat",
  // ...
});

API contract

Request POST {path}

{
  "message": "What products do you have?",
  "history": [
    { "role": "user", "content": "Hi" },
    { "role": "assistant", "content": "Hello! How can I help?" }
  ]
}

Response

{ "message": "Here are our products..." }

history is optional. When provided, prior user / assistant turns are sent to the LLM so follow-up questions work. The server does not persist history — the client (widget) must resend it on every request.

Error response

{ "error": "Message cannot be empty." }

All options

| Option | Default | Description | |--------|---------|-------------| | provider | — | "groq", "openai-compatible", "gemini", "ollama" | | apiKey | — | Provider API key (not required for Ollama) | | model | — | Model name (e.g. "llama-3.3-70b-versatile") | | baseUrl | Provider default | Override the provider's API base URL | | path | /ai-chat/custom | Chat endpoint path | | systemPrompt | — | System message sent to the LLM on every request | | orchestration | "native" | "native" or "langchain" (internal multi-step tool orchestration) | | maxToolRounds | 3 | Max tool-call loops per request | | cors | — | CORS config (see above) |

Security notes

Keep API keys on the server. Never send them to the browser.
Tools run server-side only. Use context.request inside handlers for auth checks.
Restrict CORS origins in production. Use origin: "https://yourapp.com" not origin: true.
Use requiresConfirmation: true for any tool that writes data — the LLM will not be able to call it until a confirmation flow is implemented.
Do not expose unrestricted database access as a tool.

Roadmap

[ ] Streaming responses
[ ] Tool confirmation flow
[ ] Claude / Bedrock support
[ ] Gemini tool calling
[ ] Ollama tool calling
[ ] Fastify / NestJS adapters

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

ai-chat-toolkit-server

Install

Quick start

Environment-based options

Providers

Groq

OpenAI

OpenRouter (or any OpenAI-compatible API)

Gemini

Ollama (local models)

Tool registration

Tool orchestration (LangChain, internal)

Plugin support

System prompt

CORS configuration

Routes

API contract

All options

Security notes

Roadmap

License