@cuylabs/agent-server

v7.4.0

Published

20 days ago

Agent-agnostic app server SDK for sessions, turns, control-plane APIs, and streamed events

0High
0Medium
0Low

cyb3rpandah

cyb3rward0g

agent app-server sessions turns streaming

@cuylabs/agent-server

Agent-agnostic app server SDK for sessions, turns, control-plane APIs, and streamed events. It ships in-process, stdio, and websocket clients plus an explicit agent-core adapter.

See docs/README.md for the modular package docs.

This package adds an interactive host/control-plane layer above agent implementations:

the hosted agent implementation owns execution, tools, model behavior, and persistence behind an adapter
@cuylabs/agent-server provides stable protocol DTOs, session/turn APIs, live event fanout, and client-facing control flows
@cuylabs/agent-core support is provided by the explicit createAgentCoreServerAdapter(...) adapter
@cuylabs/agent-runtime still owns background scheduling/orchestration
@cuylabs/agent-runtime-dapr still owns Dapr durability and workflow hosting

Boundary

Use @cuylabs/agent-server when you want a long-lived local process to:

manage multiple persisted sessions
start and interrupt turns independently of a single UI client
stream RuntimeTurnEvents to multiple subscribers
let clients disconnect and reconnect without losing server-owned turn state

This package is intentionally not:

a replacement for agent-runtime or agent-runtime-dapr
a new plugin system
a renderer/UI extension layer

The current implementation ships three access patterns:

in-process client/server for the CLI and tests
stdio transport for one external client process
websocket transport for reconnectable multi-client use

Capability contract

@cuylabs/agent-server now exposes an explicit capability snapshot through getCapabilities().

That contract is split into:

protocol
- transport kind, reconnect semantics, multi-client support
sessions
- persistence and session management features such as branching
turns
- streaming, waiting, interruption, steering, follow-up queuing, follow-up management, and concurrency rules
interactive
- approval request and human input request routing
runtime
- whether execution is local, remote, or hybrid
- whether orchestration is direct, agent-runtime, or agent-runtime-dapr
- whether Dapr-backed workflow delegation is available
agent
- workspace summary snapshots for startup and welcome surfaces
- model inspection and model switching
- tool, skill, and sub-agent profile inspection
- session runtime status snapshots for model/session/context state
- context compaction plus turn undo / diff helpers
plugins
- the server-side plugin contract: headless behavior, command metadata, no client extension execution

This keeps the server/client boundary inspectable without coupling clients to implementation details.

What this enables

@cuylabs/agent-server is useful when one agent session should outlive one UI process.

It gives us a place to:

keep a turn running even if one TUI client disconnects
let multiple clients observe the same live session
expose session read/list/branch APIs without booting a fresh UI-owned loop
share one backend between TUI, automation, and future web or chat clients
keep plugin and tool execution on the server side while clients focus on rendering and input
expose steering and approval-request flows through a stable protocol instead of tying them to one UI

Core model

This package uses a small runtime vocabulary instead of tying the protocol to one agent framework:

session — a logical persisted conversation exposed by the server/adapter
turn — one user input plus the streamed agent work that follows
event — a RuntimeTurnEvent emitted during a turn

Adapters map their native storage and event models into this contract.

Quick start

import { createAgent } from "@cuylabs/agent-core";
import {
  InProcessAgentServer,
  InProcessAgentServerClient,
} from "@cuylabs/agent-server";
import { createAgentCoreServerAdapter } from "@cuylabs/agent-server/agent-core";

const agent = createAgent({
  model,
  cwd: process.cwd(),
  systemPrompt: "You are a helpful coding agent.",
});

const server = new InProcessAgentServer(createAgentCoreServerAdapter(agent));
const client = new InProcessAgentServerClient(server);

const session = await client.createSession({ title: "Refactor auth flow" });
const turn = await client.startTurn(session.id, "Find the auth bug and fix it.");

const off = client.subscribe((notification) => {
  if (notification.type === "turn/event") {
    console.log(notification.event.type);
  }
}, { turnId: turn.id });

const completed = await client.waitForTurn(turn.id);
off();

console.log(client.getCapabilities().runtime);
console.log(completed.status, completed.output);

Transport examples

WebSocket server

import {
  InProcessAgentServer,
  startAgentServerWebSocketServer,
} from "@cuylabs/agent-server";
import { createAgentCoreServerAdapter } from "@cuylabs/agent-server/agent-core";

const server = new InProcessAgentServer(createAgentCoreServerAdapter(agent));
const transport = startAgentServerWebSocketServer(server, {
  host: "127.0.0.1",
  port: 4647,
});

await transport.ready;
console.error(`listening on ${transport.address()}`);

Remote stdio client

import { connectStdioAgentServerClient } from "@cuylabs/agent-server";

const client = await connectStdioAgentServerClient({
  command: process.execPath,
  args: ["./bin/cuylabs.js", "server", "--transport", "stdio"],
});

Human Input

The built-in agent-core adapter maps direct agent-core human input into the server protocol when the agent is configured with both:

humanInput on createAgent(...)
at least one human-input tool such as createHumanInputTool()

Example:

import {
  createAgent,
  createHumanInputTool,
} from "@cuylabs/agent-core";
import { createAgentCoreServerAdapter } from "@cuylabs/agent-server/agent-core";

const agent = createAgent({
  model,
  humanInput: {},
  tools: [createHumanInputTool()],
});

const adapter = createAgentCoreServerAdapter(agent);

In that setup:

getCapabilities().interactive.humanRequests becomes true
human-input-request and human-input-resolved flow through turn events
respondToInputRequest(..., { kind: "human", response }) resolves the pending controller request

This keeps direct server mode aligned with the durable Dapr semantics:

request state is authoritative
events are live notifications
explicit response APIs resume execution

Why this exists

Without an app server, the CLI process usually owns the live turn:

UI starts the turn
UI consumes agent.chat(...)
UI dies, the live interaction dies

With an app server:

server starts the turn
server owns session and turn state
server fans out updates to subscribers
clients can reconnect or multiple clients can observe the same session

That is the key shift: the live agent loop stops belonging to one terminal window.

Adapters And Plugins

This package does not require a plugin API rewrite. Plugin behavior belongs to the hosted agent implementation and is exposed to clients only through the server protocol.

For agent-core-backed servers, agent-core plugins already contribute headless behavior:

tools
middleware
host commands
prompt sections
lifecycle hooks

That logic stays on the server side. The app server executes through the adapter, so plugin business logic continues to run where it already belongs.

Server-backed hosts can now also discover and execute plugin commands through the same agent-server contract:

listPluginCommands()
executePluginCommand(name, args)

That keeps plugin command routing attached to the hosted agent instead of duplicating command execution logic in every client.

If you later add client-specific UI extensions, that should be a separate client-layer extension surface. Do not mix TUI/web UI contributions into the server plugin contract.

Relationship to Dapr

agent-runtime-dapr and agent-server solve different problems:

Dapr gives durable workflows, checkpoints, state stores, and jobs
agent-server gives session/turn APIs, subscriptions, and live client semantics

They compose well:

agent-server can start turns locally
some turns can be delegated to a durable runtime later while still presenting the same client-facing capability contract
the client-facing protocol stays the same

Current status

The current package provides:

in-process session listing, reading, creation, deletion, and branching
background turn execution with startTurn(...)
per-turn interruption, steering, and follow-up queuing
follow-up management: list, get, resolve/discard queued follow-ups
approval request routing over server notifications
human input request routing over server notifications
multi-subscriber event fanout
waitForTurn(...) and turn inspection
capability-gated agent affordances such as runtime status, model switching, tool inventory, skill inspection, sub-agent profiles, compact, undo, and diff
workspace summary snapshots that let clients render startup state from the server instead of stitching local counters together
stdio and websocket transports for external clients

The next layer is not “add a server.” The server already exists. The next layer is broader client adoption and richer request/response flows on top of the same ownership model.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@cuylabs/agent-server

Boundary

Capability contract

What this enables

Core model

Quick start

Transport examples

WebSocket server

Remote stdio client

Human Input

Why this exists

Adapters And Plugins

Relationship to Dapr

Current status