@aws-blocks/bb-agent

v0.1.2

Published

2 days ago

AI agent with streaming, tool calling, and conversation persistence. Powered by [Strands Agents SDK](https://strandsagents.com/).

0High
0Medium
0Low

aws-blocks-npm-ops

@aws-blocks/bb-agent

AI agent with streaming, tool calling, and conversation persistence. Powered by Strands Agents SDK.

When to use: Conversational AI experiences — chatbots, copilots, data extraction, or any LLM-powered feature. Supports multi-turn conversations, tool calling with Zod schemas, and multiple model providers.

Requires: zod ^4.0.0 as a peer dependency. Tool parameters use Zod schemas for validation. If you see ZodType missing properties errors, check your zod version.

Quick Start

import { Scope } from '@aws-blocks/core';
import { Agent, BedrockModels } from '@aws-blocks/bb-agent';

const scope = new Scope('my-app');

const agent = new Agent(scope, 'support-agent', {
  model: { deployed: BedrockModels.DEFAULT },
  systemPrompt: 'You are a helpful support agent.',
});

// Create a conversation and stream a response
const conversationId = await agent.createConversationId('user-123');
const channel = await agent.getChannel(conversationId);
const sub = channel.subscribe((chunk) => { /* handle chunk */ });
await sub.established;
const result = await agent.stream('Until when are you open tomorrow?', { conversationId, userId: 'user-123' });
const done = await result.complete();
console.log(done.text); // "We're open until 6pm tomorrow."

See Tools for adding capabilities, Model Configuration for provider setup, and Local Development for running without AWS Bedrock.

API

const agent = new Agent(scope, id, config)

| Method | Returns | Description | |--------|---------|-------------| | stream(message, options?) | Promise<AgentStreamResult> | Submit a message. Returns immediately with { channelId, channel, complete }. | | resume(channelId, responses, options?) | Promise<void> | Resume an interrupted agent with user responses. Chunks publish to the same channel. | | createConversationId(userId) | Promise<string> | Generate a new conversation ID (UUID). | | getConversation(id, options?) | Promise<Message[]> | Get messages in a conversation. Pass { limit } for most recent N. | | listConversations(userId) | Promise<Conversation[]> | List all conversations for a user. | | deleteConversation(id, userId) | Promise<void> | Delete a conversation and its session data. | | getPendingInterrupts(conversationId) | Promise<Array<...>> | Get unanswered interrupts (for reload support). | | getChannel(channelId) | Promise<RealtimeChannel> | Get a Realtime channel for subscribing to chunks. |

stream() submits the message to AsyncJob and returns immediately — no API Gateway timeout risk. The agent runs asynchronously and publishes chunks to Realtime.

Important: Subscribe before sending. The agent starts emitting chunks immediately after stream() is called. If you subscribe to the channel after calling stream(), early chunks may be dropped. Always subscribe first, await established, then send:

// Correct: subscribe first, await established, then send
const channel = await agent.getChannel(conversationId);
const sub = channel.subscribe((chunk) => { /* handle chunk */ });
await sub.established;
await agent.stream(message, { conversationId, userId });

// Wrong: send first, subscribe after — early chunks lost
await agent.stream(message, { conversationId, userId });
const channel = await agent.getChannel(conversationId); // too late!

The useChat hook (see Client Hook) handles this ordering automatically. Use it instead of hand-rolling stream logic.

Authorization (caller responsibility)

The Agent BB scopes data by conversationId, which is an unguessable UUID, but it does not authorize the caller against a conversation on read paths. getConversation(id) and getPendingInterrupts(conversationId) take only an id, so any caller that supplies a valid conversation ID gets the messages back.

Your API handler owns authorization: derive userId from the authenticated session and verify the conversation belongs to that user before reading it. listConversations(userId) returns only the conversations a user owns, so it's the safe way to resolve which conversation IDs a caller may access:

export const api = new ApiNamespace(scope, 'api', (context) => ({
  async getMessages(conversationId: string) {
    const user = await auth.getCurrentUser(context);
    const owned = await agent.listConversations(user.userId);
    if (!owned.some(c => c.conversationId === conversationId)) {
      throw new Error('Not found');
    }
    return agent.getConversation(conversationId);
  },
}));

deleteConversation(id, userId) is owner-scoped internally — it verifies the conversation belongs to userId before deleting anything, so a non-owner call is a no-op.

AgentStreamResult

Returned by stream(). Provides the Realtime channel and convenience methods:

| Property/Method | Type | Description | |--------|------|-------------| | channelId | string | Realtime channel where chunks are published. | | channel | Promise<RealtimeChannel> | Realtime channel handle — await it, then call .subscribe(handler). | | complete() | Promise<AgentStreamChunk> | Wait for the done chunk (full text + token usage). |

AgentStreamChunk

Each chunk published to the Realtime channel has a type and type-specific fields:

| Type | Fields | Description | |------|--------|-------------| | text-delta | text: string | Incremental text token (in 'token' streaming mode) or full block (in 'block' mode). | | tool-call | toolName: string, input: JSONValue | Agent is calling a tool. | | tool-result | toolName: string, text: string | Tool returned a result. | | done | text: string, usage: TokenUsage | Agent finished. text contains the full response. usage has { inputTokens, outputTokens, totalTokens }. | | error | error: string | Agent encountered an error. | | interrupt | interrupts: Array<{ id, name, reason }> | Agent paused for approval. See Tool Approval. |

Message Roles

Messages stored in conversation history use these roles:

| Role | Description | |------|-------------| | user | User message. | | assistant | Agent response text. | | tool-call | Record of a tool invocation (stored for audit). | | tool-result | Record of a tool's return value. | | approval | User's approval/denial response to an interrupt. | | interrupt | Agent paused — snapshot of pending interrupts. |

The useChat hook only surfaces user, assistant, and approval messages to the UI. Use agent.getConversation() directly to access the full history including tool-call/tool-result records.

AgentConfig

| Option | Type | Description | |--------|------|----------------------------------------------------------------------| | model | { deployed, local? } | Model configuration (see below). | | systemPrompt | string | System prompt for the agent. | | tools | (tool) => Record<string, AgentTool> | Tools the agent can call during reasoning. | | toolContextSchema | z.ZodType | Optional schema for per-call tool context. When set, context is required and typed. | | inferenceOnly | boolean | Skip persistence infra. Default: false. | | conversation | ConversationManagerConfig | How the agent trims message history (sliding-window or summarizing). | | streamingMode | 'token' \| 'block' | How text chunks are published to the client. Default: 'block'. |

Model Configuration

Only deployed is required. Local development works out of the box — the canned provider (keyword-based mock) is used automatically when no local model is specified.

| Option | Type | Description | |--------|------|-------------| | provider | 'bedrock' \| 'openai-api' \| 'canned' | Model provider. | | modelId | string | Model ID. Required for bedrock and openai-api. | | endpoint | string | API endpoint. For openai-api (defaults to api.openai.com). | | apiKey | string \| () => Promise<string> | API key for openai-api. Accepts a string or async resolver. Falls back to OPENAI_API_KEY env var. | | inferenceConfig | { temperature?, topP?, maxTokens?, stopSequences? } | Optional inference parameters. |

import { Agent } from '@aws-blocks/bb-agent';

// Minimal — just deployed model, canned provider used locally automatically
const agent = new Agent(scope, 'agent', {
  model: {
    deployed: { provider: 'bedrock', modelId: '...' },
  },
  systemPrompt: '...',
});

Specify a model for local development to use instead of the canned provider:

const agent = new Agent(scope, 'agent', {
  model: {
    deployed: { provider: 'bedrock', modelId: '...' },
    local: { provider: 'openai-api', modelId: 'llama3.1:8b', endpoint: 'http://localhost:11434/v1', apiKey: 'ollama' },
  },
  systemPrompt: '...',
});

For fallback support, provide an array of candidates. They are tried in order — the first available model wins. Health checks verify each candidate before selecting it (see Health Checks):

model: {
  deployed: [
    { provider: 'bedrock', modelId: '...' },
    { provider: 'bedrock', modelId: '...' },
    { provider: 'canned' },  // canned can be used in deployed as a last resort
  ],
  local: [
    { provider: 'openai-api', modelId: 'llama3.2:3b', endpoint: 'http://localhost:11434/v1' },
    // canned is always appended implicitly as last fallback for local
  ],
}

Bedrock Presets

Pre-configured model presets for quick setup. Names are capability-based so the underlying model can be upgraded without breaking your code. These use cross-region inference profiles — work across all AWS regions:

import { Agent, BedrockModels} from '@aws-blocks/bb-agent';

const agent = new Agent(scope, 'agent', {
  model: {
    deployed: BedrockModels.DEFAULT,
  },
  systemPrompt: '...',
});

| Preset | Current Model | Notes | |--------|---------------|-------| | BedrockModels.DEFAULT | us.anthropic.claude-opus-4-8-20250610-v1:0 | Highest capability. Recommended default. | | BedrockModels.BALANCED | us.anthropic.claude-sonnet-4-20250514-v1:0 | Strong quality/cost balance. | | BedrockModels.FAST | us.anthropic.claude-haiku-4-5-20251001-v1:0 | Fastest, lowest latency. | | BedrockModels.BUDGET | us.amazon.nova-pro-v1:0 | Low cost per token with acceptable quality. | | BedrockModels.MICRO | us.amazon.nova-lite-v1:0 | Ultra-cheap for simple tasks. |

Override inference settings with spread:

model: { deployed: { ...BedrockModels.DEFAULT, inferenceConfig: { temperature: 0.9, maxTokens: 8192 } } }

Ollama Presets

Convenience shortcuts for local development using Ollama. Requires Ollama installed and running (ollama serve), model pulled (ollama pull <model-id>). Uses the default endpoint http://localhost:11434/v1.

import { Agent, BedrockModels, OllamaModels} from '@aws-blocks/bb-agent';

const agent = new Agent(scope, 'agent', {
  model: {
    deployed: BedrockModels.DEFAULT, 
    local: OllamaModels.SMALL,
  },
  systemPrompt: '...',
});

| Preset | Current Model | Size | Recommended VRAM | |--------|---------------|------|------------------| | OllamaModels.XSMALL | llama3.2:3b | 2 GB | 4 GB | | OllamaModels.SMALL | llama3.1:8b | 4.7 GB | 8 GB | | OllamaModels.MEDIUM | deepseek-r1:14b | 9 GB | 16 GB | | OllamaModels.LARGE | llama3.3:70b | 43 GB | 48 GB+ | | OllamaModels.XLARGE | llama4:16x17b | 67 GB | 80 GB+ |

Custom endpoint or specific model? Use openai-api directly:

model: { local: { provider: 'openai-api', modelId: 'llama3.1:8b', endpoint: 'http://custom-host:11434/v1', apiKey: 'ollama' } }

See Ollama Presets and Local Development for more options.

Health Checks

Before selecting a model, the agent verifies its availability:

Bedrock: Verifies model availability via @aws-sdk/client-bedrock (free, no inference cost).
OpenAI-compatible: Pings GET /v1/models and checks if the specified model ID is in the response.
Canned: Always available (no external dependency).

Health checks verify the model exists but cannot guarantee invoke access (e.g., EULA not accepted, quota limits). If all candidates fail, the agent throws AgentErrors.ModelUnavailable. Check logs for details.

To see detailed health check logs, pass a logger with info level:

import { Logger } from '@aws-blocks/bb-logger';

const agent = new Agent(scope, 'agent', {
  model: { deployed: BedrockModels.DEFAULT },
  systemPrompt: '...',
  logger: new Logger(scope, 'agent-log', { level: 'info' }),
});

API Key Management

// Recommended: AppSetting with secret (encrypted via SSM SecureString)
const openaiKey = new AppSetting(scope, 'openai-key', {
  name: '/myapp/openai-api-key',
  secret: true,
});

const agent = new Agent(scope, 'agent', {
  model: {
    deployed: {
      provider: 'openai-api',
      modelId: 'gpt-4',
      apiKey: () => openaiKey.get(),
    },
  },
});

// Alternative: environment variable (local dev)
// Set OPENAI_API_KEY — no apiKey needed in config

// Alternative: plain string (discouraged — leaks in source control)
// apiKey: 'sk-...'

AWS Credentials (Bedrock)

The bedrock provider uses your configured AWS credentials. See Strands quickstart for setup instructions.

Bedrock via Mantle

Amazon Bedrock exposes an OpenAI-compatible endpoint via Bedrock Mantle. Use it with provider: 'openai-api' and set the endpoint to https://bedrock-mantle.<region>.api.aws/v1.

Error Handling

import { isBlocksError } from '@aws-blocks/core';
import { AgentErrors } from '@aws-blocks/bb-agent';

try {
  await agent.getConversation(id);
} catch (e: unknown) {
  if (isBlocksError(e, AgentErrors.PersistenceRequired)) {
    // agent is in inferenceOnly mode
  }
}

| Error | When | |-------|------| | AgentErrors.PersistenceRequired | Conversation CRUD called on an inferenceOnly agent. | | AgentErrors.InvalidModelConfig | Missing modelId, apiKey, unknown provider, or needsApproval + interrupt both specified. | | AgentErrors.ModelUnavailable | All model candidates failed health checks. Check logs for details. | | AgentErrors.StreamFailed | Agent encountered an error during execution. | | AgentErrors.InterruptRequired | Agent paused for approval. Use InterruptError for typed access to pending interrupts. | | AgentErrors.BrowserNotSupported | Agent instantiated in the browser (server-side only). |

Streaming Mode

Controls how text is published to the client:

'block' (default) — buffers text and publishes when a full content block completes.
'token' — publishes every text delta immediately as it arrives. Use for typewriter-style UIs.

const agent = new Agent(scope, 'support', {
  streamingMode: 'token',
  ...
});

Conversation Management

Controls how the agent trims message history when the context window fills up:

// Sliding window — keep last 20 messages
const agent = new Agent(scope, 'support', {
  conversation: { strategy: 'sliding-window', windowSize: 20 },
  ...
});

// Summarizing — summarizes older messages, preserves 5 most recent
const agent = new Agent(scope, 'support', {
  conversation: { strategy: 'summarizing', preserveRecentMessages: 5 },
  ...
});

Tools

Tools let the agent take actions during its reasoning — query a database, call an API, send an email. The model decides when to call a tool based on the user's message and the tool's description. You define the tool's schema and handler; the framework handles the rest.

Adding Tools

Add tools to let the agent take actions. Each tool has a description, Zod schema for parameters, and a handler. The handler receives { input, context, interrupt }:

import { z } from 'zod';

const agent = new Agent(scope, 'support', {
  model: { deployed: { provider: 'bedrock', modelId: '...' } },
  systemPrompt: 'You are a customer support agent. Look up orders when asked.',
  tools: (tool) => ({
    getOrderStatus: tool({
      description: 'Get the status of a customer order by ID',
      parameters: z.object({ orderId: z.string() }),
      handler: async ({ input }) => {
        const order = await db.getOrder(input.orderId);
        return { orderId: input.orderId, status: order.status, total: order.total };
      },
    }),
  }),
});

Declaring tools (the `tools` callback)

tools is a callback that receives a tool() factory and returns a Record keyed by tool name:

tools: (tool) => ({
  getOrderStatus: tool({ /* ... */ }),
})

The callback form lets TypeScript infer each tool's input from its parameters. The Record key is the tool's name.

Tool Context — Scoping Tools to the Caller

Tools often need request-scoped information (e.g. the authenticated userId). Pass a context object on each stream()/resume() call; it's forwarded to every tool invocation:

const agent = new Agent(scope, 'support', {
  model: { deployed: { provider: 'bedrock', modelId: '...' } },
  systemPrompt: 'You are a support agent.',
  tools: (tool) => ({
    listMyOrders: tool({
      description: "List the current user's orders",
      parameters: z.object({}),
      handler: async ({ context }) => {
        return db.listOrders({ userId: context.userId });
      },
    }),
  }),
});

const user = await auth.getCurrentUser(requestContext);
await agent.stream(message, { conversationId, userId: user.userId, context: { userId: user.userId } });

To make context required and type-safe, declare a toolContextSchema:

const agent = new Agent(scope, 'support', {
  model: { deployed: { provider: 'bedrock', modelId: '...' } },
  systemPrompt: '...',
  toolContextSchema: z.object({ userId: z.string(), tenantId: z.string() }),
  tools: (tool) => ({
    listMyOrders: tool({
      description: "List the current user's orders",
      parameters: z.object({}),
      handler: async ({ context }) => {
        // context.userId and context.tenantId are typed as string
        return db.listOrders({ userId: context.userId, tenantId: context.tenantId });
      },
    }),
  }),
});

// context is now required and validated — omitting it throws InvalidModelConfig
await agent.stream(message, { conversationId, userId, context: { userId, tenantId } });

Using KnowledgeBase with the Agent

The KnowledgeBase BB can be used as an agent tool, giving the agent the ability to search documents on demand:

import { Agent } from '@aws-blocks/bb-agent';
import { KnowledgeBase } from '@aws-blocks/bb-knowledge-base';
import { z } from 'zod';

const kb = new KnowledgeBase(scope, 'docs', {
  source: './knowledge',
  description: 'Product documentation and FAQs',
});

const agent = new Agent(scope, 'assistant', {
  model: { deployed: { provider: 'bedrock', modelId: '...' } },
  systemPrompt: 'You are a helpful assistant. Search the knowledge base when the user asks about our product.',
  tools: (tool) => ({
    searchDocs: tool({
      description: 'Search product documentation for relevant information',
      parameters: z.object({
        query: z.string().describe('The search query'),
        maxResults: z.number().optional().describe('Max results to return (default: 5)'),
      }),
      handler: async ({ input }) => kb.retrieve(input.query, { maxResults: input.maxResults ?? 5 }),
    }),
  }),
});

Tool Approval (Human-in-the-Loop)

By default, tools run autonomously. Set needsApproval: true on tools that should pause for user approval — the agent publishes an interrupt chunk, the client shows a confirmation UI, the user responds, and the agent resumes.

| Configuration | Behavior | |---------------|----------| | needsApproval: false (default) | Tool runs autonomously | | needsApproval: true | Pauses for approval every time — user sees Yes / No | | needsApproval: true, trustable: true | Pauses for approval — user sees Yes / No / Trust. "Trust" auto-approves for the rest of the conversation |

Tools that modify state should require user approval. Set needsApproval: true:

tools: (tool) => ({
  getOrderStatus: tool({
    description: 'Look up an order',
    parameters: z.object({ orderId: z.string() }),
    needsApproval: false,  // read-only — safe to run
    handler: async ({ input }) => db.getOrder(input.orderId),
  }),
  cancelOrder: tool({
    description: 'Cancel a customer order',
    parameters: z.object({ orderId: z.string(), reason: z.string() }),
    needsApproval: true,   // destructive — ask first
    trustable: true,        // user can say "trust" to stop being asked
    handler: async ({ input }) => db.cancelOrder(input.orderId, input.reason),
  }),
})

When a tool is interrupted, the client receives an interrupt chunk. Resume with agent.resume():

// Client receives: { type: 'interrupt', interrupts: [{ id, name, reason }] }
// User approves → resume the agent:
await agent.resume(channelId, [{ interruptId: interrupt.id, approved: true }], { conversationId, userId });

Interrupt chunk format: name is approve:${toolName}:${toolUseId} and reason contains { tool: string, input: any, trustable: boolean }. Use reason.tool for display and reason.trustable to decide whether to show a Trust button.

Custom Interrupts

For tools that need input-level approval decisions or runtime-conditional pausing, use the interrupt field or call interrupt() inside the handler:

tools: (tool) => ({
  transferMoney: tool({
    description: 'Transfer money between accounts',
    parameters: z.object({ from: z.string(), to: z.string(), amount: z.number() }),
    interrupt: ({ input, interrupt }) => {
      if (input.amount > 100) {
        interrupt({ name: 'confirm-transfer', reason: { message: `Transfer $${input.amount}?` } });
      }
    },
    handler: async ({ input }) => ({ status: 'completed', amount: input.amount }),
  }),
})

Headless Usage (No UI)

The Agent BB works without a frontend — for scripts, background jobs, or server-to-server flows. Use complete() to wait for the full response. For UI-based flows, see Client Hook — useChat.

Without tool approval

import { Agent, BedrockModels } from '@aws-blocks/bb-agent';

const agent = new Agent(scope, 'summarizer', {
  model: { deployed: BedrockModels.DEFAULT },
  systemPrompt: 'Summarize the input concisely.',
});

const conversationId = await agent.createConversationId('system');
const result = await agent.stream('Summarize this quarter earnings report...', { conversationId, userId: 'system' });
const done = await result.complete();
console.log(done.text);

With tool approval

When tools have needsApproval: true, complete() throws an InterruptError. Handle it programmatically:

import { Agent, BedrockModels, InterruptError } from '@aws-blocks/bb-agent';
import { z } from 'zod';

const refundBot = new Agent(scope, 'refunds', {
  model: { deployed: BedrockModels.DEFAULT },
  systemPrompt: 'You process customer refund requests.',
  tools: (tool) => ({
    issueRefund: tool({
      description: 'Issue a refund to a customer',
      parameters: z.object({ orderId: z.string(), amount: z.number() }),
      needsApproval: true,
      handler: async ({ input }) => {
        await payments.refund(input.orderId, input.amount);
        return { refunded: true, amount: input.amount };
      },
    }),
  }),
});

const conversationId = await refundBot.createConversationId('system');
const result = await refundBot.stream('Refund order #456, item was damaged. Total was $75.', { conversationId, userId: 'system' });

while (true) {
  try {
    const done = await result.complete();
    console.log(done.text);
    break;
  } catch (err) {
    if (!(err instanceof InterruptError)) throw err;
    // Auto-approve refunds under $100, reject larger ones
    const responses = err.interrupts.map(i => ({
      interruptId: i.id,
      approved: i.reason?.input?.amount < 100,
    }));
    await refundBot.resume(result.channelId, responses, { conversationId, userId: 'system' });
  }
}

Inference-Only (No Persistence)

Set inferenceOnly: true for stateless tasks that don't need conversation history — classification, extraction, summarization. No DynamoDB tables or session storage are created.

const classifier = new Agent(scope, 'classifier', {
  inferenceOnly: true,
  model: { deployed: { provider: 'bedrock', modelId: '...' } },
  systemPrompt: 'Classify the sentiment of the input as positive, negative, or neutral.',
});

const result = await classifier.stream('I love this product!');
const done = await result.complete();
console.log(done.text); // "positive"

Local Development

The Agent BB works locally without any external dependencies. No AWS credentials, no API keys, no running services — just npm run dev. By default the agent uses the CannedProvider — a keyword-based mock that responds instantly without calling any real model. For real LLM calls locally, set model.local to an openai-api config (Ollama, vLLM, etc.) or use the Ollama presets. See Model Configuration for details.

Use Local LLM

For real model responses during development, use a fallback chain with your company's shared vLLM server and a local Ollama instance. The agent tries each in order — on the company network it uses the shared server, at home it falls through to your local Ollama:

const agent = new Agent(scope, 'support', {
  model: {
    deployed: { provider: 'bedrock', modelId: '...' },
    local: [
      { provider: 'openai-api', modelId: 'llama3.1:70b', endpoint: 'http://vllm.internal.company.com/v1' },
      { provider: 'openai-api', modelId: 'llama3.1:8b', endpoint: 'http://localhost:11434/v1', apiKey: 'ollama' },
      // canned is appended implicitly — if nothing is available, agent still works
    ],
  },
  systemPrompt: '...',
});

Canned Provider

The CannedProvider is a custom Strands model provider that requires no network or API keys:

Returns simple mock responses
Triggers tool calls when the prompt mentions a tool name (e.g., "get order" triggers getOrderStatus)
Generates valid tool inputs from Zod schemas using type-based placeholders
Streams responses word by word, matching the same protocol as real providers

Client Hook — `useChat`

Import from @aws-blocks/bb-agent/client. Manages conversation state, streaming subscriptions, and interrupt handling. Handles the subscribe-before-send ordering automatically.

import { useChat } from '@aws-blocks/bb-agent/client';

const chat = useChat({
  api: {
    sendMessage: (convId, msg, chId) => api.sendMessage(convId, msg, chId),
    createConversation: () => api.createConversation(userId),
    getConversation: (id) => api.getConversation(id),
    resume: (chId, responses, convId) => api.resume(chId, responses, convId),
  },
  subscribe: async (channelId, handler) => {
    const channel = await api.getChannel(channelId);
    return channel.subscribe(handler);
  },
  onMessagesChange: (msgs) => renderMessages(msgs),
  onLoadingChange: (loading) => updateSpinner(loading),
  onInterrupt: (interrupts) => showApprovalUI(interrupts),
});

await chat.sendMessage('Hello!');
await chat.respondToInterrupt([{ interruptId: 'x', approved: true }]);

Note: useChat is a factory function, not a React hook. Call it once (e.g., outside a component or in a ref) — not on every render. It returns a mutable singleton. Message history only includes user, assistant, and approval messages — tool-call/tool-result internals are filtered for UI clarity. Use getConversation() directly if you need the full history.

Full Examples

1. End-to-End: Backend + Frontend with `useChat`

Complete wiring showing the backend API and frontend useChat connected together.

Backend (aws-blocks/index.ts):

import { Scope, ApiNamespace } from '@aws-blocks/core';
import { Agent, BedrockModels } from '@aws-blocks/bb-agent';

const scope = new Scope('my-app');

const agent = new Agent(scope, 'chat', {
  model: { deployed: BedrockModels.DEFAULT },
  systemPrompt: 'You are a helpful assistant.',
});

export const api = new ApiNamespace(scope, 'api', (context) => ({
  async createConversation(userId: string) {
    return { conversationId: await agent.createConversationId(userId) };
  },
  async sendMessage(conversationId: string, message: string, channelId: string, userId: string) {
    await agent.stream(message, { conversationId, channelId, userId });
  },
  async getConversation(conversationId: string) {
    const messages = await agent.getConversation(conversationId);
    return { messages };
  },
  async getChannel(channelId: string) {
    return agent.getChannel(channelId);
  },
}));

Frontend (app.ts):

import { useChat } from '@aws-blocks/bb-agent/client';

const userId = getCurrentUserId();

const chat = useChat({
  api: {
    sendMessage: (convId, msg, chId) => api.sendMessage(convId, msg, chId, userId),
    createConversation: () => api.createConversation(userId),
    getConversation: (id) => api.getConversation(id),
  },
  subscribe: async (channelId, handler) => {
    const channel = await api.getChannel(channelId);
    return channel.subscribe(handler);
  },
  onMessagesChange: (msgs) => renderMessages(msgs),
  onLoadingChange: (loading) => updateSpinner(loading),
});

// Send a message — useChat handles subscribe-before-send automatically
await chat.sendMessage('Hello!');

// Load an existing conversation (subscribes first, then backfills history)
await chat.loadConversation('conv-123');

2. Support Agent with Tools

Agent with tools that can look up orders and search documentation. Uses tool context to scope queries to the authenticated user.

import { Scope, ApiNamespace } from '@aws-blocks/core';
import { Agent, BedrockModels } from '@aws-blocks/bb-agent';
import { KnowledgeBase } from '@aws-blocks/bb-knowledge-base';
import { z } from 'zod';

const scope = new Scope('my-app');

const kb = new KnowledgeBase(scope, 'docs', { source: './knowledge' });

const agent = new Agent(scope, 'support', {
  model: { deployed: BedrockModels.DEFAULT },
  systemPrompt: 'You are a customer support agent. Look up orders and search documentation to help the user.',
  toolContextSchema: z.object({ userId: z.string() }),
  tools: (tool) => ({
    getOrder: tool({
      description: 'Get order details by ID',
      parameters: z.object({ orderId: z.string() }),
      handler: async ({ input, context }) => {
        return db.getOrder(input.orderId, { userId: context.userId });
      },
    }),
    searchDocs: tool({
      description: 'Search product documentation',
      parameters: z.object({ query: z.string() }),
      handler: async ({ input }) => kb.retrieve(input.query, { maxResults: 5 }),
    }),
  }),
});

export const api = new ApiNamespace(scope, 'api', (context) => ({
  async chat(message: string, conversationId: string) {
    const user = await auth.getCurrentUser(context);
    return await agent.stream(message, {
      conversationId,
      userId: user.userId,
      context: { userId: user.userId },
    });
  },
}));

Best Practices

Keep system prompts focused — one agent per task, not one agent for everything
Define tools with descriptive names and descriptions — the model uses these to decide when to call them
Set model.local to an array of fallback candidates for flexible local dev
Set logging to info during development to surface health check and model resolution details

What It Provisions

The Agent BB composes several internal Building Blocks automatically:

| BB | AWS Resource | Purpose | |----|-------------|---------| | FileBucket | S3 | Session snapshot storage (Strands agent state between turns) | | DistributedTable × 2 | DynamoDB | Conversations table + messages table | | Realtime | API Gateway WebSocket | Streaming chunks to connected clients | | AsyncJob | SQS + Lambda | Runs the agent asynchronously (no API Gateway timeout) |

When inferenceOnly: true, the two DistributedTables are skipped (no conversation persistence).

Scaling & Cost (AWS)

Model: Bedrock pay-per-token pricing. See Bedrock pricing.
Persistence: DynamoDB (DistributedTable) — PAY_PER_REQUEST, single-digit ms latency.
Session storage: S3 (FileBucket) — ~$0.023 per GB/month.
Async execution: SQS (AsyncJob) — $0.40 per million messages.
Streaming: AppSync Events (Realtime) — $1.00 per million connection minutes.
No timeout limit: Agent runs in AsyncJob consumer Lambda (up to 15 min), not behind API Gateway.

Troubleshooting

"Access denied / Legacy model" — Some older model IDs may be marked as legacy. Switch to a cross-region inference profile.

"ValidationException" — Model ID not recognized. Use aws bedrock list-foundation-models --query "modelSummaries[].modelId" to see available models.

Health check passes but invocation fails — The health check verifies the model exists but cannot check EULA acceptance or account-level access.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@aws-blocks/bb-agent

Quick Start

API

Authorization (caller responsibility)

AgentStreamResult

AgentStreamChunk

Message Roles

AgentConfig

Model Configuration

Bedrock Presets

Ollama Presets

Health Checks

API Key Management

AWS Credentials (Bedrock)

Bedrock via Mantle

Error Handling

Streaming Mode

Conversation Management

Tools

Adding Tools

Declaring tools (the tools callback)

Tool Context — Scoping Tools to the Caller

Using KnowledgeBase with the Agent

Tool Approval (Human-in-the-Loop)

Custom Interrupts

Headless Usage (No UI)

Without tool approval

With tool approval

Inference-Only (No Persistence)

Local Development

Use Local LLM

Canned Provider

Client Hook — useChat

Full Examples

1. End-to-End: Backend + Frontend with useChat

2. Support Agent with Tools

Best Practices

What It Provisions

Scaling & Cost (AWS)

Troubleshooting

See Also

Declaring tools (the `tools` callback)

Client Hook — `useChat`

1. End-to-End: Backend + Frontend with `useChat`