tooled-prompt
v0.15.0
Published
Runtime LLM prompt library with smart tool recognition
Maintainers
Readme
tooled-prompt
Runtime LLM prompt library with smart tool recognition for TypeScript.
Table of Contents
The Problem
LLM tool calling requires manual JSON schema authoring, tool registration boilerplate, and managing the request/execute/respond loop. Existing libraries add heavy abstractions and framework lock-in.
The Solution
Tagged template literals are the perfect API for LLM prompts. Functions in ${} are auto-detected as tools. No boilerplate, no schema authoring, no framework.
Installation
npm install tooled-promptQuick Start
import { prompt, setConfig } from 'tooled-prompt';
setConfig({
apiUrl: 'https://api.openai.com/v1',
apiKey: process.env.OPENAI_API_KEY,
modelName: 'gpt-5-nano',
});
function getWeather(cityName: string) {
return `Weather in ${cityName}: Sunny, 72°F`;
}
const { data } = await prompt`
What's the weather like in San Francisco?
Use ${getWeather} to find out.
`();
console.log(data);Using Deno? See the Even Quicker Start guide.
Features
- Smart Tool Recognition — Functions in template literals are auto-detected as tools
- Multiple Schema Formats — Define tool args with strings, arrays, or Zod schemas
- Structured Output — Get typed responses with Zod schema validation
- Store Pattern — Capture structured output via tool calls with
store()andprompt.return - Deferred Tool Loading — Use
toolSearch()so the LLM discovers tools on demand instead of sending all schemas upfront - Code-Action Tool — Use
toolEval()so the LLM writes JavaScript to orchestrate multiple tools in a single turn - Usage Tracking — Per-call and cumulative token usage across conversation chains
- Image Support — Pass images (Buffer/Uint8Array) directly in templates
- Streaming Events — Subscribe to content, thinking, and tool events
- Multi-Provider — Built-in support for OpenAI, Anthropic, and Ollama
- Multiple Instances — Create isolated instances for different LLM providers
- TypeScript First — Full type safety with generics
Usage
Basic Inference
Functions in template literals are auto-detected as tools. Parameter names and optionality are inferred at runtime — no schema needed:
import { prompt, setConfig } from 'tooled-prompt';
import * as fs from 'fs/promises';
setConfig({
apiUrl: 'https://api.openai.com/v1',
apiKey: process.env.OPENAI_API_KEY,
modelName: 'gpt-5-nano',
});
async function readFile(filePath: string) {
return fs.readFile(filePath, 'utf-8');
}
const { data } = await prompt`
Use ${readFile} to read package.json and summarize it.
`();Image Support
Pass Buffer or Uint8Array values directly in templates. They are auto-detected and sent as base64 to vision-capable models:
import { readFileSync } from 'fs';
const image = readFileSync('photo.png');
const { data } = await prompt`Describe this image: ${image}`();Multiple images work too:
const before = readFileSync('before.png');
const after = readFileSync('after.png');
const { data } = await prompt`
Compare these two images:
Before: ${before}
After: ${after}
`();Multiple Tools
Embed multiple functions in a single template:
async function readDir() {
return fs.readdir('src', { recursive: true });
}
async function readFile(filePath: string) {
return fs.readFile(filePath, 'utf-8');
}
const { data } = await prompt`
Use ${readDir} to list files, then use ${readFile} to read each one.
Summarize what you find.
`();Structured Output
For LLMs that support structured output, pass a Zod schema to get typed, validated responses:
import { z } from 'zod';
const MovieSchema = z.object({
title: z.string(),
year: z.number(),
rating: z.number().min(0).max(10),
});
const { data } = await prompt`
Tell me about the movie Inception
`(MovieSchema);
// data is typed as { title: string; year: number; rating: number }
console.log(data.title, data.year);Or use a SimpleSchema for string-only fields (no Zod required):
const { data } = await prompt`Analyze this text: ${text}`({
sentiment: 'Overall sentiment (positive/negative/neutral)',
confidence: 'Confidence score if available',
});
// data is typed as { sentiment: string; confidence: string }Store Pattern
prompt.return — Early-Exit Structured Output
Some LLMs don't allow using both tools and structured output. When prompt.return appears in a template and a schema is passed, the LLM gets a special tool to store the result. The tool loop exits as soon as the value is stored:
import { z } from 'zod';
const schema = z.object({
summary: z.string(),
files: z.array(
z.object({
path: z.string(),
description: z.string(),
}),
),
});
const { data } = await prompt`
Use ${readDir} and ${readFile} to analyze the project.
Save your analysis in ${prompt.return}.
`(schema);
console.log(data.summary);store() — Explicit Store
For manual control, create a store and retrieve the value after execution:
import { store } from 'tooled-prompt';
import { z } from 'zod';
const changeLog = store(
z.object({
summary: z.string(),
entries: z.array(
z.object({
commit: z.string(),
description: z.string(),
}),
),
}),
);
await prompt`
Use ${gitLog} to read commits, then save a structured
changelog in ${changeLog}.
`();
const result = changeLog.get();Adding Descriptions with tool()
For richer tool metadata, use tool() to add descriptions and explicit arg descriptors.
Plain array descriptors:
import { tool } from 'tooled-prompt';
function copyFile(src, dest) {
fs.copyFileSync(src, dest);
}
tool(copyFile, {
description: 'Copy a file from source to destination',
args: ['Source file path', 'Destination file path'],
});Zod descriptors for rich types:
import { z } from 'zod';
function createUser(name, email, age) {
// ...
}
tool(createUser, {
description: 'Create a new user',
args: [
z.string().describe('User full name'),
z.string().describe('User email address'),
z.number().describe('User age'),
],
});Return types for better tool chaining (especially with toolEval):
// String description
tool(add, { returns: 'The numeric sum as a string' });
// Zod schema — auto-parses the return value at runtime
tool(searchUsers, {
returns: z.array(z.object({ name: z.string(), email: z.string() })),
});Config Tools
Register tools via config so they're available to every prompt without embedding in the template:
const { prompt, setConfig } = createTooledPrompt({
tools: [searchDocs, readFile],
});
// Both tools available even though they're not in the template
await prompt`Summarize the authentication docs`();
// setConfig replaces factory tools
setConfig({ tools: [searchCode] });
await prompt`Find the login handler`(); // only searchCode available
// Per-call tools are added alongside config tools
await prompt`Find and read the config`({ tools: [readFile] });
// Both searchCode (from setConfig) and readFile (per-call) availableMultiple Prompt Instances
Create isolated instances for different LLM providers or models with createTooledPrompt:
import { createTooledPrompt } from 'tooled-prompt';
const openai = createTooledPrompt({
apiUrl: 'https://api.openai.com/v1',
apiKey: process.env.OPENAI_API_KEY,
modelName: 'gpt-4o',
});
const local = createTooledPrompt({
apiUrl: 'http://localhost:11434/v1',
modelName: 'llama3.1',
});
const { data: summary } = await openai.prompt`Summarize this document`();
const { data: translation } = await local.prompt`Translate to French: ${text}`();Use different models for different tasks within one workflow:
const imageLlm = createTooledPrompt({ modelName: 'gemma-3-27b-it' });
const toolLlm = createTooledPrompt({ modelName: 'gpt-4o' });
async function describeImage(path: string) {
const image = readFileSync(path);
const { data } = await imageLlm.prompt`Describe this image: ${image}`();
return data;
}
// Tool LLM orchestrates, delegates image work to image LLM
const { data } = await toolLlm.prompt`
Find images using ${listFiles} and describe each with ${describeImage}.
`();Providers
Built-in support for OpenAI-compatible, Anthropic, and Ollama. Set the provider config to switch:
import { createTooledPrompt } from 'tooled-prompt';
// OpenAI-compatible (default)
const openai = createTooledPrompt({
provider: 'openai',
apiUrl: 'https://api.openai.com/v1',
apiKey: process.env.OPENAI_API_KEY,
modelName: 'gpt-4o',
});
// Anthropic
const anthropic = createTooledPrompt({
provider: 'anthropic',
apiUrl: 'https://api.anthropic.com/v1',
apiKey: process.env.ANTHROPIC_API_KEY,
modelName: 'claude-sonnet-4-5',
maxTokens: 8192,
});
// Ollama
const ollama = createTooledPrompt({
provider: 'ollama',
apiUrl: 'http://localhost:11434',
modelName: 'llama3.1',
});You can also register custom providers:
import { registerProvider } from 'tooled-prompt';
registerProvider('my-provider', myProviderAdapter);System Prompt
Set a system prompt as a plain string or as a builder callback with tool references:
// Plain string
const { prompt } = createTooledPrompt({
systemPrompt: 'You are a helpful assistant.',
});
// Builder callback — tools in the system prompt are available to the LLM
const { prompt } = createTooledPrompt({
systemPrompt: (prompt) => prompt`
You are a code assistant. Use ${searchDocs} to find relevant documentation.
`,
});Multi-Turn Conversations
Every PromptResult includes a next tagged template for continuing the conversation. History and tools carry over automatically, and the system prompt is preserved across turns:
const { data, next } = await prompt`Summarize this file: ${readFile}`();
const { data: followUp, next: next2 } = await next`What are the main concerns?`();
const { data: deeper } = await next2`Elaborate on the first concern`();Injecting Conversation History
Use prompt.messages() to inject external conversation history (e.g. from a database or chat app):
const history = [
{ role: 'user', content: "What's the weather?" },
{ role: 'assistant', content: "It's sunny in SF, 72°F." },
];
const { data } = await prompt`
${prompt.messages(history)}
Now tell me about tomorrow's forecast using ${getWeather}.
`();The messages are prepended as conversation history. The template text becomes the new user message. Only one prompt.messages() is allowed per template, and it cannot be used inside next (which already carries history).
Deferred Tool Loading
When a prompt has many tools (50+), sending all schemas upfront bloats the LLM context. toolSearch() wraps your tools behind a single tool_search meta-tool. The LLM discovers tools on demand, and matched tools become natively available in the next loop iteration:
import { prompt, toolSearch } from 'tooled-prompt';
const search = toolSearch(add, multiply, readFile, writeFile, sendEmail, queryDB);
const { data } = await prompt`
Help the user: ${userMessage}
${search}
`();The LLM sees only tool_search(query) initially. When it calls tool_search("file"), matching tools like read_file and write_file are activated and appear as native tools in subsequent requests. This keeps the initial context small while still giving the LLM access to all tools.
Code-Action Tool (toolEval)
Instead of one LLM round-trip per tool call, toolEval() lets the LLM write a JavaScript function body that orchestrates multiple tools in a single turn:
import { prompt, toolEval } from 'tooled-prompt';
const exec = toolEval(add, multiply, readFile, sendEmail);
const { data } = await prompt`
Read example.txt, multiply 123 by 456, and email the result.
${exec}
`();The LLM sees a single tool_eval(code) tool with JSDoc signatures for all registered functions. When called, the code runs via AsyncFunction with those functions in scope. Accepts raw functions or tool()-wrapped functions.
Tools with a Zod returns schema have their return values auto-parsed, so the LLM's generated code gets real objects — no JSON.parse needed.
Usage Tracking
Every PromptResult includes token usage with per-call and cumulative breakdowns:
const r1 = await prompt`What is the capital of France?`();
console.log(r1.usage?.call);
// => { promptTokens: 14, completionTokens: 8, totalTokens: 22 }
// On the first call, cumulative equals call
console.log(r1.usage?.cumulative);
// => { promptTokens: 14, completionTokens: 8, totalTokens: 22 }Cumulative usage tracks totals across chained next calls:
const r2 = await r1.next`And what is its population?`();
console.log(r2.usage?.call); // tokens for this call only
console.log(r2.usage?.cumulative); // total across both callsUse cumulative usage to implement context budgeting:
if ((r2.usage?.cumulative.totalTokens ?? 0) > 4000) {
console.log('Context getting large, consider compaction');
}Usage is undefined when the provider doesn't include token counts in the response. When a single prompt invocation triggers multiple LLM requests (tool-loop iterations), the call usage is the sum across all iterations.
API Reference
prompt
Tagged template literal for creating LLM prompts (default instance). Functions in ${} are auto-detected as tools.
import { prompt } from 'tooled-prompt';
// Without schema — returns PromptResult<string>
const { data } = await prompt`Your prompt here`();
// With Zod schema — returns PromptResult<T>
const { data } = await prompt`Your prompt here`(zodSchema);
// With SimpleSchema — returns PromptResult<{ field: string }>
const { data } = await prompt`Your prompt here`({ field: 'description' });
// Per-call config
const { data } = await prompt`Your prompt here`({ temperature: 0.9 });setConfig
Update configuration for the default instance.
import { setConfig } from 'tooled-prompt';
setConfig({
apiUrl: 'https://api.openai.com/v1',
apiKey: process.env.OPENAI_API_KEY,
modelName: 'gpt-4o',
temperature: 0.7,
stream: true,
timeout: 30000,
silent: false,
showThinking: false,
tools: [myTool],
});on / off
Subscribe to and unsubscribe from events on the default instance.
import { on, off } from 'tooled-prompt';
const handler = (content: string) => process.stdout.write(content);
on('content', handler);
off('content', handler);createTooledPrompt
Create an isolated instance with its own configuration, event handlers, and tool scope.
import { createTooledPrompt } from 'tooled-prompt';
const instance = createTooledPrompt({ apiUrl: '...', apiKey: '...' });
// instance.prompt, instance.setConfig, instance.on, instance.off, instance.tooltool
Wrap a function with explicit metadata (description, arg descriptors).
import { tool } from 'tooled-prompt';
// Named function
tool(myFunc, { description: '...', args: ['arg1 desc', 'arg2 desc'] });
// Arrow function via object syntax
tool({ myFunc }, { description: '...', args: ['arg1 desc'] });
// With return type (string or Zod schema)
tool(myFunc, { returns: 'A greeting string' });
tool(myFunc, { returns: z.object({ id: z.number() }) }); // auto-parses at runtimetoolSearch
Create a deferred tool loader. Accepts raw functions or tool()-wrapped functions. Returns a meta-tool that the LLM calls to discover tools by keyword.
import { toolSearch } from 'tooled-prompt';
const search = toolSearch(readFile, writeFile, sendEmail);
// Use in template: ${search}
// LLM calls tool_search("file") → read_file and write_file become availabletoolEval
Create a code-action meta-tool that lets the LLM write JavaScript to orchestrate multiple tools in a single turn.
import { toolEval } from 'tooled-prompt';
const exec = toolEval(fn1, fn2, fn3);
// Use in template: ${exec}
// LLM calls tool_eval(code) → code is executed with fn1, fn2, fn3 in scope
// With options:
const exec = toolEval(fn1, fn2, { timeout: 10000 });store
Create a typed store for capturing structured LLM output via tool calls.
import { store } from 'tooled-prompt';
const myStore = store(zodSchema);
// Use in template: ${myStore}
// Retrieve after execution: myStore.get()Event Types
interface TooledPromptEvents {
thinking: (content: string) => void;
content: (content: string) => void;
tool_call: (name: string, args: Record<string, unknown>) => void;
tool_result: (name: string, result: string, duration: number) => void;
tool_error: (name: string, error: string) => void;
}Configuration Options
interface TooledPromptConfig {
apiUrl?: string; // LLM API endpoint
apiKey?: string; // API key
modelName?: string; // Model name
provider?: string; // "openai" (default) | "anthropic" | "ollama" | custom
maxTokens?: number; // Max response tokens (required by Anthropic, defaults to 4096)
maxIterations?: number; // Max tool loop iterations
temperature?: number; // Generation temperature (0-2)
stream?: boolean; // Enable streaming (default: true)
timeout?: number; // Request timeout in ms (default: 60000)
silent?: boolean; // Suppress console output (default: false)
showThinking?: boolean; // Show full thinking content (default: false)
systemPrompt?: string | SystemPromptBuilder; // System prompt (string or builder callback)
maxToolResultLength?: number; // Max tool result chars before truncation (default: unlimited)
tools?: Function[];
}PromptResult<T>
All prompt executions return a PromptResult<T> wrapper:
interface PromptResult<T> {
data?: T;
/** Token usage for this call and cumulative across the conversation chain */
usage?: PromptUsage;
/** Continue the conversation with a follow-up prompt, preserving history and tools */
next: PromptTaggedTemplate;
}
interface PromptUsage {
/** Tokens used in this prompt/next invocation (including all tool-loop iterations) */
call: Usage;
/** Total tokens across the entire conversation chain (prompt + all next calls) */
cumulative: Usage;
}
interface Usage {
promptTokens: number;
completionTokens: number;
totalTokens: number;
}License
MIT — see LICENSE for details.
