@agentmark-ai/sdk

v2.0.5

Published

a day ago

SDK for communicating with the Agentmark hosted platform

0High
0Medium
0Low

rrandall

kashi11

AgentMark SDK

The SDK for tracing LLM calls, running experiments, and integrating with AgentMark Cloud. Built on OpenTelemetry.

Installation

npm install @agentmark-ai/sdk

Quick Start

import { AgentMarkSDK, span } from "@agentmark-ai/sdk";

// Initialize the SDK with your API key
const sdk = new AgentMarkSDK({
  apiKey: process.env.AGENTMARK_API_KEY!,
  appId: process.env.AGENTMARK_APP_ID!,
});

// Start the OpenTelemetry tracer
sdk.initTracing();

// Wrap any LLM call in a span
const { result, traceId } = await span(
  { name: "customer-support", userId: "user-123" },
  async (ctx) => {
    // Your LLM call here — works with any SDK
    const response = await generateText({ /* ... */ });

    // Create child spans for sub-operations
    await ctx.span({ name: "save-to-db" }, async () => {
      await db.saveResponse(response);
    });

    return response;
  }
);

console.log(`Trace: ${traceId}`);

API

`AgentMarkSDK`

Main SDK class for initialization and cloud integration.

const sdk = new AgentMarkSDK({
  apiKey: string;    // Your AgentMark API key
  appId: string;     // Your AgentMark app ID
  baseUrl?: string;  // Custom API URL (default: https://api.agentmark.co)
  mask?: MaskFunction; // Redact sensitive data before spans are exported
});

Methods:

sdk.initTracing(options?) — Start the OpenTelemetry tracer. Options: { disableBatch?: boolean, registerGlobally?: boolean }.
sdk.getApiLoader() — Get an ApiLoader instance for loading prompts from AgentMark Cloud.
sdk.score(props) — Submit an evaluation score for a trace.
sdk.runExperiment(options) — Run a dataset through a task with evaluators, score thresholds, and a regression gate. Use experimentResultToJUnit to emit a JUnit report for CI.

`span(options, fn)`

Create a root span. Returns { result, traceId }.

const { result, traceId } = await span(
  {
    name: "my-span",           // Required
    userId: "user-123",       // Optional: associate with a user
    sessionId: "session-456", // Optional: group related traces
    sessionName: "chat",      // Optional: human-readable session name
    metadata: { env: "prod" }, // Optional: key-value metadata
  },
  async (ctx) => {
    // ctx.traceId — the trace ID
    // ctx.spanId — the root span ID
    // ctx.setAttribute(key, value) — set span attributes
    // ctx.addEvent(name, attributes?) — add span events
    // ctx.span(options, fn) — create child spans
    return await doWork();
  }
);

`ctx.span(options, fn)`

Create a child span within a trace. Available on the SpanContext passed to span() and nested span() callbacks.

await span({ name: "request" }, async (ctx) => {
  const user = await ctx.span({ name: "fetch-user" }, async (spanCtx) => {
    return await db.getUser(id);
  });

  await ctx.span({ name: "generate-response" }, async (spanCtx) => {
    return await llm.generate({ user });
  });
});

`ApiLoader`

Re-exported from @agentmark-ai/loader-api for convenience. Load prompts from AgentMark Cloud or a local dev server.

// Cloud loader (via SDK)
const loader = sdk.getApiLoader();

// Or create directly
import { ApiLoader } from "@agentmark-ai/sdk";

const cloudLoader = ApiLoader.cloud({
  apiKey: "...",
  appId: "...",
});

const localLoader = ApiLoader.local({
  port: 9418,
});

Also exported

trace(options, fn) — alias for span(); use it for root spans when the name reads better.
observe(fn, options?) / SpanKind — wrap a function so every call is traced.
streamWithSpan(...) — trace streaming LLM responses.
createPiiMasker(config) — build a mask function that redacts emails, phone numbers, and other PII before spans leave the process. See PII masking.
createWebhookRunner(options) — handle cloud-dispatched prompt and experiment runs in your own infrastructure.

Documentation

Full documentation at docs.agentmark.co.

License

AGPL-3.0-or-later

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme