@brainst0rm/endpoint-stub

v0.1.0

Published

7 days ago

Reference implementation of the endpoint side of the Brainstorm dispatch protocol. Connects to a relay over WS, receives CommandEnvelopes, executes tools via a pluggable executor, emits CommandAck/ProgressEvent/CommandResult back. Used as a test fixture f

Downloads

199

0High
0Medium
0Low

justinjilg

@brainst0rm/endpoint-stub

Reference implementation of the endpoint side of the Brainstorm dispatch protocol. Connects to a relay over WebSocket, receives CommandEnvelope frames, executes tools via a pluggable executor, and emits CommandAck / ProgressEvent / CommandResult back to the relay.

What this is for

Test fixture for distributed dispatch flows (Stage 1.1+ in docs/endpoint-agent-plan.md). Stand it up alongside @brainst0rm/relay to exercise the full operator → relay → endpoint loop without needing a real sandboxed agent.
Reference for crd4sdom's production brainstorm-agent (Go). The TypeScript here pins the protocol semantics — CommandAck timing, signature verification order, lifecycle transitions — that the Go implementation must also satisfy.
Self-contained dev endpoint for local brainstorm dispatch smoke tests on a developer laptop.

What this is NOT

The stub is honest about being a stub:

No microVM sandbox isolation (P3 work in the production agent)
No real evidence-chain hashing of execution
No reset machinery between commands
No GuestQuery / GuestResponse integrity-monitor handling

Every result the stub produces includes { stub: true } in its stdout JSON so consumers can immediately see they're not running against a real isolated endpoint.

Quick start

# 1. Start a relay (separate terminal)
brainstorm-relay

# 2. Have an admin issue a bootstrap token via the relay's HTTP API
curl -X POST http://127.0.0.1:8444/v1/admin/endpoint/enroll \
  -H "Authorization: Bearer $ADMIN_TOKEN" \
  -H "Content-Type: application/json" \
  -d '{"tenant_id":"tenant-dev"}'
# → { "bootstrap_token": "...", "endpoint_id": "uuid-..." }

# 3. Run the stub
export BRAINSTORM_RELAY_URL_WS=ws://127.0.0.1:8443
export BRAINSTORM_RELAY_URL_HTTP=http://127.0.0.1:8444
export BRAINSTORM_ENDPOINT_BOOTSTRAP=...        # from step 2
export BRAINSTORM_ENDPOINT_TENANT_ID=tenant-dev
export BRAINSTORM_ENDPOINT_ID=...               # from step 2
export BRAINSTORM_ENDPOINT_TENANT_PUBKEY_HEX=... # tenant's signing pubkey
brainstorm-endpoint-stub

The stub generates an Ed25519 keypair on first run and persists it to ~/.brainstorm/endpoint-stub/identity.json (mode 0600). Subsequent runs reuse the keypair, so the relay continues to recognize it.

Programmatic usage

import { EndpointStub, type ToolExecutor } from "@brainst0rm/endpoint-stub";

const myExecutor: ToolExecutor = async (ctx) => {
  return { exit_code: 0, stdout: `ran ${ctx.tool}`, stderr: "" };
};

const stub = new EndpointStub({
  relayUrl: "ws://127.0.0.1:8443",
  tenantId: "tenant-dev",
  identityPath: "/tmp/my-endpoint.json",
  endpointId: "uuid-...",
  tenantPublicKey: tenantPubKeyBytes,
  executor: myExecutor,
});

await stub.connect(); // EndpointHello + await EndpointHelloAck
await stub.run(); // Loop until close

connect() resolves once the session is established, so it's safe for an operator to immediately dispatch. run() resolves when the connection closes.

Pluggable executor

The default stubExecutor echoes each command's params back as JSON. To exercise more interesting code paths, supply your own:

const echoExecutor: ToolExecutor = async (ctx) => {
  // ctx: { command_id, tool, params, deadline_ms }
  return { exit_code, stdout, stderr };
};

Returning exit_code !== 0 produces a failed CommandResult with code SANDBOX_TOOL_ERROR. Throwing an exception does the same with the error message in error.message.

Real CHV sandbox executor (`BSM_USE_CHV_EXECUTOR=1`)

The stub ships with a built-in ChvSandboxExecutor that wires the pluggable executor seam to a real ChvSandbox from @brainst0rm/sandbox. When you set BSM_USE_CHV_EXECUTOR=1, the bin constructs a ChvSandboxExecutor from the same env contract first-light.sh uses and hands it to the EndpointStub instead of the default echo-style stubExecutor.

export BSM_USE_CHV_EXECUTOR=1
export BSM_KERNEL=/srv/bsm/sandbox/bsm-sandbox-kernel
export BSM_INITRAMFS=/srv/bsm/sandbox/bsm-sandbox-initramfs   # if modular kernel
export BSM_ROOTFS=/srv/bsm/sandbox/bsm-sandbox-rootfs.img
export BSM_VSOCK_SOCKET=/tmp/bsm-endpoint-stub.sock           # default
export BSM_API_SOCKET=/tmp/bsm-endpoint-stub-api.sock         # default
export BSM_GUEST_PORT=52000                                   # default; matches image-builder vsock-init
# optional: BSM_CH_BIN, BSM_CHREMOTE_BIN to override PATH lookup

# everything below is the standard stub config — unchanged
export BRAINSTORM_RELAY_URL_WS=ws://127.0.0.1:8443
export BRAINSTORM_RELAY_URL_HTTP=http://127.0.0.1:8444
export BRAINSTORM_ENDPOINT_BOOTSTRAP=...
export BRAINSTORM_ENDPOINT_TENANT_ID=tenant-dev
export BRAINSTORM_ENDPOINT_ID=...
export BRAINSTORM_ENDPOINT_TENANT_PUBKEY_HEX=...

brainstorm-endpoint-stub

When the env var is unset (or any value other than "1"), the stub falls back to stubExecutor — the existing echo-back behaviour. So turning the real sandbox on and off is a single env flip; nothing else changes about the stub's wiring.

Honest cost: cold-boot-per-dispatch (~600ms latency floor)

The MVP picks the simpler of the two patterns from the design space:

Cold-boot-per-dispatch (what's shipped): boot a fresh ChvSandbox per command, executeTool, shutdown. ~600ms latency floor on Hetzner node-2 per PR #277. Zero steady-state RAM. No shared-state-between-tools concerns. Failure modes are local — a boot failure on one dispatch does not poison subsequent dispatches.
Pool of N pre-booted sandboxes (deferred): take from pool → executeTool → reset → return to pool. ~2-30ms per dispatch (matches the steady-state numbers in PR #277). Higher steady-state RAM. Adds reset machinery on the critical path. We're holding off until we have real dispatch-rate data to size the pool.

Operators dispatching many commands in tight succession will feel the 600ms floor. If your workload is sub-100ms-sensitive, do not enable BSM_USE_CHV_EXECUTOR=1 until the pool variant lands.

Error mapping (executor → operator)

| Sandbox event | ToolExecutorResult.exit_code | stderr | EndpointStub maps to | | ------------------------------ | ------------------------------ | ------------------------------------------ | ---------------------------------------- | | boot() throws | 126 | chv-executor: sandbox boot failed: … | failed / SANDBOX_TOOL_ERROR | | executeTool() throws | 125 | chv-executor: sandbox executeTool failed | failed / SANDBOX_TOOL_ERROR | | executeTool() exit_code != 0 | preserved (faithful) | preserved (faithful) | failed / SANDBOX_TOOL_ERROR | | shutdown() throws | n/a — logged + swallowed | n/a | result already produced; not re-reported |

shutdown() always runs, even on the boot-failure path (the Sandbox interface documents shutdown() as idempotent).

Programmatic usage of the executor

import { ChvSandboxExecutor, EndpointStub } from "@brainst0rm/endpoint-stub";

const executor = new ChvSandboxExecutor({
  config: {
    apiSocketPath: "/tmp/api.sock",
    kernel: { path: "/srv/bsm/sandbox/bsm-sandbox-kernel" },
    rootfs: { path: "/srv/bsm/sandbox/bsm-sandbox-rootfs.img" },
    vsock: { socketPath: "/tmp/vsock.sock", guestPort: 52000 },
  },
});

const stub = new EndpointStub({
  // ...
  executor: executor.execute,
});

Honest gaps in the executor

Per-tool timeout above the sandbox's deadline_ms: the executor does not add a parallel wall-clock fence; the sandbox itself enforces the deadline. If the sandbox's deadline machinery wedges, the executor will wait with it.
Queueing under load: 10 simultaneous dispatches → 10 parallel cold boots. Relay-side serialisation is the current backstop.
Shared image-pool / page-cache priming: every boot reads kernel
- initramfs + rootfs from disk. A posix_fadvise(WILLNEED) warmer or shared image cache would reduce IO under burst.
Reset between commands: cold-boot-per-dispatch makes reset moot — each command gets a fresh guest. The pool variant will need to call reset() between dispatches.

Protocol contract enforced

The stub verifies, in order, before executing any tool:

Ed25519 signature on the CommandEnvelope against the configured tenantPublicKey (per ed25519-jcs-sha256-v1).
Audience — endpoint: target_endpoint_id must equal this stub's endpoint_id (F5: cross-endpoint envelope replay defense).
Audience — tenant: tenant_id must match the stub's tenant.
Session epoch: session_id must match the current connection's session (F12: relay-restart stale-session defense).
Time skew: issued_at must be within ±60 s of the endpoint's wall clock.
Expiry: expires_at must be in the future.
Lifetime cap: expires_at − issued_at must not exceed 5 min.
Nonce uniqueness (in-memory only — see "out of scope" below): the same nonce cannot be replayed within a single stub process.

A failure emits an ErrorEvent with one of: ENDPOINT_SIGNATURE_INVALID, ENDPOINT_WRONG_AUDIENCE, ENDPOINT_SESSION_STALE, ENDPOINT_ENVELOPE_EXPIRED, ENDPOINT_NONCE_REPLAY.

After verification the stub sends CommandAck before invoking the executor, matching the protocol's dispatched → started transition contract.

Explicit out-of-scope (production agent's job)

Persistent nonce store that survives restart. The stub uses an in-memory Set<string>; a process restart resets it. The production agent must use a SQLite-backed nonce store with a NONCE_CACHE_FULL fail-closed policy.
signing_key_id lookup / revocation. The stub trusts the single tenantPublicKey passed in. Production must look up the key by signing_key_id and check a revocation list.
Atomic identity-file writes. loadOrCreateIdentity uses writeFileSync(..., { mode: 0o600 }) — there is no temp-file + rename. A crash between writeFileSync start and OS sync could leave a partial JSON file. The threat model accepts this for the laptop-loopback host.

Tests

npm test --workspace=@brainst0rm/endpoint-stub

Tests stand up a real relay (WebSocket + enrollment HTTP) on loopback, point a real EndpointStub at it, drive a dispatch from a fake operator, and verify all 7 protocol-correctness invariants end-to-end.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@brainst0rm/endpoint-stub

What this is for

What this is NOT

Quick start

Programmatic usage

Pluggable executor

Real CHV sandbox executor (BSM_USE_CHV_EXECUTOR=1)

Honest cost: cold-boot-per-dispatch (~600ms latency floor)

Error mapping (executor → operator)

Programmatic usage of the executor

Honest gaps in the executor

Protocol contract enforced

Explicit out-of-scope (production agent's job)

Tests

Real CHV sandbox executor (`BSM_USE_CHV_EXECUTOR=1`)