@webmcp-auto-ui/agent

v2.5.40

Published

18 days ago

LLM agent loop + remote/WASM/local providers + MCP wrapper

0High
0Medium
0Low

@webmcp-auto-ui/agent

LLM agent loop that connects MCP and WebMCP servers to a UI. Given a user message and a set of tool layers, it runs a tool-use loop until the LLM signals it's done, calling widget tools to build the interface.

Providers

RemoteLLMProvider — proxies to a +server.ts endpoint that holds the API key. Compatible with any OpenAI-compatible API backend (Anthropic, OpenAI, Google, Mistral, etc.). Prompt caching enabled by default. Retry on 503 with exponential backoff. Returns stats in LLMResponse: tok/s, totalTokens, latencyMs.

GemmaProvider (LiteRT) — runs Gemma 4 models via @mediapipe/tasks-genai (LiteRT, formerly known as MediaPipe) directly on the main thread. Uses WebGPU when available. No API key required. Models are cached in OPFS (Origin Private File System) for instant reload after first download.

v0.5.0 migration: GemmaProvider was migrated from ONNX (@huggingface/transformers) to LiteRT (@mediapipe/tasks-genai). LiteRT is 2-4x faster on WebGPU and provides native Gemma 4 support. The provider now runs on the main thread because MediaPipe is incompatible with ES module workers.

LocalLLMProvider — runs against a local Ollama instance (or any OpenAI-compatible local server: vLLM, LM Studio, llamafile). No API key required. Converts messages and tools to the OpenAI chat completions format automatically.

Gemma 4 prompt format — uses <|turn>...<turn|> delimiters (instead of the Gemma 2/3 <start_of_turn>...<end_of_turn>).

Native tool calling — Gemma 4 tool calls are parsed from <|tool_call>call:name{args}<tool_call|> format. No regex heuristics needed.

WebMCP `autoui` server

The package ships a pre-configured WebMCP server named autoui with all built-in widget recipes (stat, table, chart, timeline, etc.). This replaces the previous componentRegistry / ComponentAdapter / COMPONENT_TOOL API.

import { autoui } from '@webmcp-auto-ui/agent';

// autoui is a WebMcpServer with all built-in widgets registered
const layer = autoui.layer();
// → { protocol: 'webmcp', serverName: 'autoui', tools: [...] }

System prompt & lazy tool loading

buildSystemPrompt(layers) generates a unified recipe-driven prompt with a strict 4-step workflow: discover recipes, read instructions, execute, display. Tool lists in the prompt are dynamic placeholders injected from the connected layers. Apps should not hardcode prompts — see docs/system-prompt.md for the full prompt text and design decisions.

Tools are loaded lazily via discovery. The agent initially receives only lightweight discovery tools, then activates full tool schemas on demand:

buildDiscoveryTools(layers) — creates search_recipes and get_recipe tools across all servers, plus WebMCP action tools (widget_display, canvas, recall)
activateServerTools(currentTools, layer) — loads the full tool set for a specific server

This keeps the initial prompt small when many servers/widgets are available. The system prompt references these discovery tools in its steps 1 and 2, forming a coherent pipeline.

Install

npm install @webmcp-auto-ui/agent

Usage

import { autoui, runAgentLoop, RemoteLLMProvider } from '@webmcp-auto-ui/agent';

const result = await runAgentLoop('Show me sales data', {
  provider: new RemoteLLMProvider({ proxyUrl: '/api/chat' }),
  layers: [mcpClient.layer(), autoui.layer()],
  maxIterations: 5,
  callbacks: {
    onWidget: (type, data) => {
      // Display the widget in your UI
      return { id: widgetId };
    },
    onClear: () => { /* clear canvas */ },
    onText: (text) => { /* update chat */ },
    onToolCall: (call) => { /* log tool use */ },
  },
});

TokenTracker

Real-time usage metrics tracking across requests:

import { TokenTracker } from '@webmcp-auto-ui/agent';

const tracker = new TokenTracker();
tracker.record({ inputTokens: 500, outputTokens: 120, cached: 400, latencyMs: 850 });

console.log(tracker.stats);
// { reqPerMin, inputPerMin, outputPerMin, cachedPerMin, totalRequests, totalInput, totalOutput }

Used by the TokenBubble UI component for live dashboard metrics.

summarizeChat

Generates an anonymized summary of a chat conversation for inclusion in HyperSkill exports:

import { summarizeChat } from '@webmcp-auto-ui/agent';

const summary = summarizeChat(messages);
// Returns a short text summary without PII or raw message content

Per-request configuration

temperature, topK, and maxTokens can be set per-request via provider options:

const response = await provider.chat(messages, tools, {
  temperature: 0.7,
  topK: 40,
  maxTokens: 2048,
});

Prompt clipping

sizeInTokens(text) estimates the token count for a string. Used internally to clip long prompts before sending to the LLM.

Gemma LiteRT

import { GemmaProvider } from '@webmcp-auto-ui/agent';

const provider = new GemmaProvider({
  model: 'gemma-e2b',
  onProgress: (pct, status, loaded, total) => console.log(status, pct),
  onStatusChange: (s) => console.log(s), // 'loading' | 'ready' | 'error'
});

Requires Cross-Origin-Opener-Policy: same-origin and Cross-Origin-Embedder-Policy: credentialless headers for WebGPU support. Models are cached in OPFS after first download for instant subsequent loads.

API proxy (`+server.ts`)

The RemoteLLMProvider sends requests to a local +server.ts endpoint that forwards them to the configured LLM API. The endpoint reads LLM_API_KEY from the environment, or from body.__apiKey as a fallback (for cases where the key is provided at runtime).

// src/routes/api/chat/+server.ts
import { env } from '$env/dynamic/private';
export const POST: RequestHandler = async ({ request }) => {
  const body = await request.json();
  const apiKey = body.__apiKey || env.LLM_API_KEY;
  delete body.__apiKey;
  // Forward to your LLM provider (Anthropic, OpenAI, Mistral, etc.)
  const res = await fetch(LLM_ENDPOINT, {
    method: 'POST',
    headers: { 'Authorization': `Bearer ${apiKey}`, 'Content-Type': 'application/json' },
    body: JSON.stringify(body),
  });
  return Response.json(await res.json());
};

License

AGPL-3.0-or-later

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@webmcp-auto-ui/agent

Providers

WebMCP autoui server

System prompt & lazy tool loading

Install

Usage

TokenTracker

summarizeChat

Per-request configuration

Prompt clipping

Gemma LiteRT

API proxy (+server.ts)

License

WebMCP `autoui` server

API proxy (`+server.ts`)