@goatlab/delphi-core

v0.6.0

Published

7 days ago

Distributed agent workflow engine for the Goat ecosystem

Downloads

7,699

0High
0Medium
0Low

cabrerabywaters

robertzibert

tomcastro

@goatlab/delphi-core

Durable workflow engine for TypeScript. Runs step DAGs over Postgres with exactly-once external actions, human-in-the-loop, and queue-first ingestion capable of 5k+ req/s on modest hardware. No Redis required.

What it is

A TypeScript workflow engine designed for agent orchestration. You define workflows as DAGs of steps; the engine handles scheduling, retries, step-level state, human approval gates, distributed execution, event ingestion, budgets, and lineage tracking.

Postgres is the only required infrastructure — it holds all durable state and serves as the dispatch queue via FOR UPDATE SKIP LOCKED. For workloads above ~5k req/s, Redis can be added with a single config line.

Install

pnpm add @goatlab/delphi-core pg

Requires Postgres 14+. Redis is optional (install @goatlab/tasks-adapter-bullmq if needed).

Quick start

import { Workflow, FunctionStep, step, createEngine } from '@goatlab/delphi-core'

// 1. Define steps — just input and output types
class VerifyStep extends FunctionStep<{ orderId: string }, { token: string; amount: number }> {
  stepName = 'verify' as const
  async handle(input) {
    return { output: { token: 'tok_abc', amount: 4200 } }
  }
}

class ChargeStep extends FunctionStep<{ token: string; amount: number }, { chargeId: string }> {
  stepName = 'charge' as const
  async handle(input) {
    return { output: { chargeId: `ch_${input.token}` } }
  }
}

// 2. Define a workflow — steps as classes, auto-pass wires outputs → inputs
class PaymentWorkflow extends Workflow<{ orderId: string }> {
  workflowName = 'payment' as const
  steps = [
    VerifyStep,                                   // root — gets trigger input
    step(ChargeStep, { dependsOn: [VerifyStep] }), // auto-pass — gets verify's output
  ] as const
  // TypeScript verifies: VerifyStep output { token, amount } satisfies ChargeStep input { token, amount } ✓
  // If types don't match, the compiler requires mapInput to bridge them.
}

// 3. Create the engine — one connection string
const engine = createEngine({
  database: process.env.DATABASE_URL,
  workflows: [PaymentWorkflow] as const,
  tenantId: 'default',
})

// 4. Start — fire-and-forget, returns a handle
const { runId } = await engine.payment.start({ orderId: 'ord_123' })
// Workflow runs in the background. Use runId to check status later.

Just Postgres, classes, and types.

What `database` accepts

// Connection string (engine creates the pool)
createEngine({ database: 'postgresql://user:pass@localhost:5432/mydb', ... })

// Existing pg.Pool (shares your backend's pool — no duplicate connections)
createEngine({ database: existingPool, ... })

Adding Redis for high throughput

createEngine({
  database: existingPool,
  redis: existingRedisConnection,  // opt-in — dispatch goes through BullMQ
  workflows: [...],
  tenantId: 'default',
})

How Postgres dispatch works

The engine uses FOR UPDATE SKIP LOCKED to turn the workflow_steps table into a work queue — no external broker needed. On startup, it auto-detects whether LISTEN/NOTIFY is available:

listen() called
  ↓
Self-test: LISTEN → NOTIFY → wait 3s
  ↓
Received? → "LISTEN/NOTIFY available — low-latency dispatch (~1ms)"
            Dedicated connection listens for step-queued notifications.
  ↓
Not received? → "Proxy detected — polling dispatch (~500ms)"
                Pure FOR UPDATE SKIP LOCKED polling with adaptive backoff.

Zero config. If you're connecting directly to Postgres, you get instant dispatch. If PgBouncer (transaction mode) or another proxy sits in between, the self-test fails silently and the engine falls back to polling. The user doesn't need to know their infra topology.

Tuning dispatch

createEngine({
  database: existingPool,
  dispatch: {
    pollingIntervalMs: 200,      // how often to poll for queued steps (default 500)
    maxPollingIntervalMs: 5000,  // adaptive backoff ceiling (default 30_000)
  },
  workflows: [...],
  tenantId: 'default',
})

Under load, the poller adapts automatically — interval increases on contention, decreases on activity, grows slowly when idle.

Library vs service mode

Library mode (default): import @goatlab/delphi-core into your app, construct the engine once, call its methods from your handlers. No new HTTP surface, no extra process.

app.post('/api/orders/:id/checkout', requireAuth, async (req, res) => {
  const { runId } = await engine.payment_critical.startCommitted(
    { orderId: req.params.id, amountCents: req.body.amountCents },
    { idempotencyKey: `checkout-${req.params.id}` },
  )
  res.json({ runId })
})

Service mode: when multiple client apps need to start workflows against a shared engine. Mount via @goatlab/delphi-express. The HTTP boundary becomes the contract; wire your own auth middleware.

| | Library mode | Service mode | |---|---|---| | Single Node app | use this | overkill | | Multiple client apps | — | use this | | Auth | reuse your app's | required at the edge |

Core concepts

Workflow & step

A workflow is a DAG of steps. Steps are pure — they declare their input/output types and know nothing about their DAG position. Workflows wire steps together via dependsOn.

// Simple: raw classes, no wiring needed (single step or linear chain with auto-pass)
steps = [FetchStep, TransformStep, SaveStep] as const

// DAG: step() wrapper only when you need dependsOn or mapInput
steps = [
  VerifyStep,                                       // root — gets trigger input
  step(ChargeStep, { dependsOn: [VerifyStep] }),     // auto-pass — gets verify's output
  step(NotifyStep, {                                // explicit mapping — field names differ
    dependsOn: [VerifyStep, ChargeStep],
    mapInput: (up) => ({ email: up.verify.email, chargeId: up.charge.chargeId }),
  }),
] as const

Auto-pass is type-safe: if the upstream output doesn't satisfy the downstream input, TypeScript requires mapInput — you get a compile error, not a runtime surprise.

State

Runs and steps live in Postgres. The state machine enforces valid transitions (PENDING → QUEUED → RUNNING → COMPLETED | FAILED | WAITING_HUMAN | SLEEPING). deriveWorkflowStatus computes run status from step statuses.

Execution

On engine.start(), the engine inserts the run + steps and dispatches root steps. In Postgres-only mode, the PgConnector polls for QUEUED steps via FOR UPDATE SKIP LOCKED, marks them RUNNING, and calls the handler. The engine then advances the DAG by finding the next ready steps.

Starting workflows (fire-and-forget)

All three start methods return a runId handle immediately — none wait for the workflow to complete. The actual step execution happens in the background.

// start() — waits for PG INSERT (~5-20ms), then returns
const { runId } = await engine.payment.start({ amount: 42 })

// startBuffered() — returns from in-memory buffer (~0-1ms, no await needed)
const { runId } = engine.event_ingest.startBuffered({ payload: 'hi' })

// startCommitted() — waits for PG COMMIT + fsync (~20-80ms)
const { runId } = await engine.payment.startCommitted({ amount: 42 })

Getting results

// Poll
const status = await engine.payment.getStatus(runId)
if (status.status === 'COMPLETED') { /* done */ }

// SSE (realtime push)
const es = new EventSource(`/api/workflows/events/${runId}`)

// Callback (on the workflow class)
class PaymentWorkflow extends Workflow<{ amount: number }> {
  workflowName = 'payment' as const
  steps = [ChargeStep] as const

  onComplete = async (ctx) => {
    // fires when all steps finish
  }
}

Workflow durability (`buffered` vs `committed`)

Each workflow declares its durability guarantee — what "accepted" means when the start call returns:

// Buffered (default) — ~1-2ms response, PG write is async
class EventWorkflow extends Workflow<{ payload: string }> {
  workflowName = 'event_ingest' as const
  steps = [step(processStep)] as const
}

// Committed — response means "row is fsync'd to Postgres"
class PaymentWorkflow extends Workflow<{ amount: number }> {
  workflowName = 'payment' as const
  override durability = 'committed' as const
  steps = [step(chargeStep)] as const
}

Input validation

Workflows can declare a schema (Zod-compatible) for compile-time + runtime input validation:

import { z } from 'zod'

class PaymentWorkflow extends Workflow<{ amount: number }> {
  workflowName = 'payment' as const
  override inputSchema = z.object({ amount: z.number().positive() })
  steps = [step(chargeStep)] as const
}

External actions (exactly-once)

engine.externalActions.run({...}) wraps calls to external systems (GitHub, Linear, Stripe) with idempotency dedup, rate limiting, and audit trail. Safe to retry.

Transactional steps

When a step is marked transactional, the engine wraps the step handler and the result recording in a single Postgres transaction. App writes via ctx.tx are atomic with step completion — COMMIT means both happened, ROLLBACK means neither did.

class CreateOrderStep extends FunctionStep<
  { productId: string; qty: number },
  { orderId: string }
> {
  stepName = 'create_order' as const
  transactional = true  // class-level default

  async handle(input, ctx: StepExecutionContext) {
    // ctx.tx is a pg PoolClient inside the same transaction
    const { rows } = await ctx.tx!.query(
      'INSERT INTO orders (product_id, qty) VALUES ($1, $2) RETURNING id',
      [input.productId, input.qty],
    )
    return { output: { orderId: rows[0].id } }
  }
}

The transactional flag can be set at two levels, with step() overriding the class:

// Class-level: this step is always transactional
class LedgerStep extends FunctionStep<...> {
  transactional = true
}

// step()-level: override per workflow wiring
steps = [
  step(LedgerStep, { transactional: false }),  // override class default
  step(NotifyStep, { transactional: true }),    // make non-transactional step transactional
]

Precedence: step() flag > class flag > false.

Constraints: transactional steps must be short-lived DB operations (the transaction stays open for the entire handler execution). Not suitable for external HTTP calls, AI steps, or anything that takes seconds — those should use the default path with externalActions for idempotency.

Durability × transactional: three independent axes

Workflow durability (buffered/committed) and step transactional are orthogonal:

| Concept | What it controls | Layer | |---|---|---| | buffered | How the workflow starts — HTTP returns before PG COMMIT | Ingestion | | committed | How the workflow starts — HTTP returns after PG fsync | Ingestion | | transactional | How a step executes — app writes + step result in one PG tx | Execution |

They compose naturally:

| Workflow durability | Step transactional | Behavior | |---|---|---| | buffered + normal | Default. Fast start, separate execution. | | buffered + transactional | Start is async (could be lost in crash window), but once the step runs, app writes + completion are atomic. | | committed + normal | Start is durable (fsync'd), but step execution and recording are separate operations. | | committed + transactional | Strongest. Start is durable, step execution is atomic. Zero gaps. |

Use committed + transactional for financial flows where both "trigger accepted" and "step executed" must be all-or-nothing. Use buffered + transactional for high-volume events where ingestion loss is acceptable but processing must be atomic.

Retry backoff

By default, failed steps retry immediately. For steps calling external APIs (rate-limited endpoints, push notifications, social APIs), configure backoff to avoid hammering a failing service:

class PushNotifyStep extends FunctionStep<{ userId: string }, { sent: boolean }> {
  stepName = 'push_notify' as const
  retries = 5
  backoff = { type: 'exponential' as const, delayMs: 1000, maxDelayMs: 60_000 }

  async handle(input) {
    await expo.sendPushNotification(input.userId)
    return { output: { sent: true } }
  }
}

| Option | Default | Description | |--------|---------|-------------| | type | — | 'exponential' (doubles each attempt) or 'fixed' (constant delay) | | delayMs | 1000 | Base delay before first retry | | maxDelayMs | 60000 | Cap for exponential growth | | multiplier | 2 | Exponential multiplier per attempt |

Delays include ±25% jitter to prevent thundering herd. Under the hood, retryAfterMs is stored on the step row — the PgConnector poll skips steps whose retry time hasn't arrived yet. No extra polling loop or scheduler needed.

Saga rollback

When a workflow fails terminally, you often need to undo side effects from steps that already completed — refund a charge, unreserve inventory, revoke an API key. Define a rollback() method on any step:

class ChargeCardStep extends FunctionStep<
  { customerId: string; amount: number },
  { chargeId: string }
> {
  stepName = 'charge_card' as const

  async handle(input) {
    const charge = await stripe.charges.create({ amount: input.amount, customer: input.customerId })
    return { output: { chargeId: charge.id } }
  }

  async rollback(input, output) {
    await stripe.refunds.create({ charge: output.chargeId })
  }
}

When the workflow transitions to FAILED (a downstream step exhausted its retries), the engine automatically calls rollback() on all completed steps that define it, in reverse topological order — last-completed first.

History is append-only. Step statuses stay COMPLETED — the engine never rewrites history. Rollback actions are logged as rollback_started, rollback_completed, or rollback_failed events in workflow_step_logs. If a rollback itself throws, the error is logged and remaining rollbacks continue (best-effort).

Human-in-the-loop

A step returning { waitForHuman: { prompt, schema } } transitions to WAITING_HUMAN. Resume via engine.submitHumanInput(...).

Durable sleep

engine.durableSleep(runId, stepName, tenantId, durationMs) persists the wake time to Postgres. Survives process restarts — on recovery, sleeps only the remaining duration.

Workflow forking

engine.forkWorkflow(runId, tenantId, fromStepName) creates a new run preserving completed step outputs up to the fork point. Enables retry-from-failure and A/B testing.

Workflow versioning

In-flight workflows are frozen to their original definition. When you deploy new code that changes a workflow's steps, retries, or config, running workflows continue with the definition they started with — no mid-flight surprises.

The engine stores a definitionSnapshot at workflow start. On step completion/failure, getDefinitionForRun() deserializes the snapshot instead of reading the live registry. Non-serializable callbacks (condition, mapInput, onComplete, onFail) are merged from the live registry if the workflow still exists, so code changes to those callbacks take effect immediately.

Workflow streaming

engine.writeStream(runId, key, value) appends to a durable, offset-based stream. Consumers read via engine.readStream(runId, key, fromOffset). Close with engine.closeStream(runId, key).

Delayed execution

Start workflows in the future: engine.start({ ..., delaySeconds: 60 }). The run enters DELAYED status and auto-transitions to RUNNING when the delay expires.

Cron scheduling

Schedule recurring workflows directly from the typed engine proxy. Multi-pod safe — uses FOR UPDATE SKIP LOCKED so only one instance fires each schedule. Timezone-aware — DST transitions handled automatically.

// Schedule a workflow to run every day at 9am New York time
await engine.daily_report.schedule({
  cron: '0 9 * * *',
  timezone: 'America/New_York',  // IANA timezone — DST-aware
  input: { region: 'us-east' },
})

// Fire immediately on deploy, then follow the cron
await engine.daily_report.schedule({
  cron: '0 9 * * *',
  timezone: 'America/New_York',
  runOnInit: true,
  input: { region: 'us-east' },
})

// List active schedules
const schedules = await engine.daily_report.listSchedules()

// Remove a schedule
await engine.daily_report.unschedule(scheduleId)

// Start the scheduler polling loop (call once at app startup)
engine.scheduler.start()

Cross-tenant dispatch

When a single backend serves multiple tenants (each with its own database), the dispatcher coordinates work across all tenants with O(1) persistent connections — no matter how many tenants you have.

The problem: Traditional BullMQ requires one persistent Worker per tenant per queue. At 100 tenants × 5 queues = 500 Redis connections. Doesn't scale.

The solution: A lightweight hint-based dispatch system. One listener catches all hints, fires HTTP requests to drain each tenant's queue on demand. Connections are ephemeral, created per dispatch cycle and closed immediately.

Job enqueued → onAfterQueue fires hint → single listener → HTTP POST → resolveTenant → drain queue → done

Two APIs, two lifecycles

createDispatcher() — process-level singleton (one per backend):

import { createDispatcher } from '@goatlab/delphi-core'

const dispatcher = createDispatcher({
  // Hint transport: Redis (BullMQ) or Postgres (LISTEN/NOTIFY)
  redis: platformRedisConnection,
  // OR: database: platformDbClient,

  // Where dispatch HTTP endpoint lives (external URL for Cloud Run scaling)
  dispatchUrl: 'https://api.myapp.com/dispatch/worker',

  // Consumer provides tenant resolution (DI boundary)
  resolveTenant: async (tenantId) => {
    return getEngineForTenant(tenantId) // your DI / container logic
  },
  listTenants: async () => getTenantIds(),
})

createEngine({ dispatcher }) — per-tenant, wires hints automatically:

// Each tenant's engine auto-fires hints through the dispatcher
const engine = createEngine({
  database: tenantPool,
  workflows: [PaymentWorkflow, OnboardingWorkflow] as const,
  tenantId: 'tenant-abc',
  redis: tenantRedisConnection, // or omit for PG-only
  dispatcher,                   // wires onAfterQueue automatically
})

Dispatch handler

Mount on your HTTP server — responds 202 immediately, processes in background:

app.post('/dispatch/worker', dispatcher.handler)

Startup sequence

// 1. Start HTTP server
app.listen(port)

// 2. Start dispatcher (AFTER server is accepting requests)
await dispatcher.start()

// 3. Sync cron schedules across all tenants
await dispatcher.syncSchedules('prod')

// Shutdown
await dispatcher.stop()

Declarative cron schedules

Instead of centralized job config arrays, declare schedules on workflows:

class DailyReportWorkflow extends Workflow<{ region: string }> {
  workflowName = 'daily_report' as const
  schedule = {
    cron: '0 9 * * *',                       // cron expression
    timezone: 'America/New_York',             // IANA timezone (default: 'UTC')
    runOnInit: true,                          // fire on first deploy
    input: { region: 'us-east' },             // default input per run
    environments: ['prod'],                   // only in production
    tenants: ['platform'],                    // only for specific tenants
  }
  steps = [step(GenerateReportStep)] as const
}

dispatcher.syncSchedules() reads these declarations and registers them across all tenants. The timezone field accepts any IANA timezone identifier (America/New_York, Europe/Stockholm, etc.) with full autocomplete support via the IANATimezone type.

Hint transport options

| Transport | Config | When to use | |-----------|--------|-------------| | Redis | redis: connection | You already have Redis. Lower latency (~1ms). | | Postgres | database: dbClient | No Redis in your stack. Uses LISTEN/NOTIFY + polling. |

Both transports are independent of each tenant's dispatch backend — a tenant can use PgConnector while hints travel through Redis, or vice versa.

When to use the dispatcher

| Setup | Use dispatcher? | |-------|----------------| | Single tenant, single process | No — createEngine() alone is enough | | Single tenant, multiple processes | No — PgConnector's FOR UPDATE SKIP LOCKED handles it | | Multi-tenant, shared backend | Yes — avoids O(N×M) persistent connections | | Multi-tenant, Cloud Run | Yes — HTTP dispatch triggers auto-scaling |

Signals & events

engine.signal(runId, signalName, data) — signal a running workflow
EventIngestionService — idempotent event ingest with ordering, trigger matching, stale-event skipping

Budgets

Per-run limits on tokens, cost, step count, and task executions. Enforced atomically on every step completion.

Lineage

Every run carries a traceId. engine.getTrace(traceId) returns the full set of runs, events, and external actions sharing that trace.

Timeout enforcement

engine.sweepTimedOutSteps() and engine.sweepTimedOutWorkflows() find and fail steps/runs that exceed their configured timeouts.

Worker nodes

Registered WorkerNode instances can pull step payloads via HTTP long-poll, execute remotely, and post results back. Label-based routing (requiresLabels) ensures steps land on capable workers.

API surface

Typed proxy (compile-time safe)

// Start (fire-and-forget — returns runId, not completion)
const { runId } = await engine.payment.start(input, { idempotencyKey })
const { runId } = engine.event_ingest.startBuffered(input)          // no await
const { runId } = await engine.payment.startCommitted(input)        // waits for PG fsync

// Check status / interact with running workflows
await engine.payment.getStatus(runId)
await engine.payment.cancel(runId)
await engine.payment.signal(runId, 'approved', { reviewer: 'alice' })
await engine.payment.submitHumanInput(runId, 'review', { approved: true })

String-based handlers (for HTTP adapters)

import { createWorkflowHandlers } from '@goatlab/delphi-core'
const handlers = createWorkflowHandlers(engine)

await handlers.start({ workflowName, tenantId, input })
await handlers.getStatus({ runId, tenantId })
await handlers.cancel({ runId, tenantId })
await handlers.signal({ runId, tenantId, signalName, data })
await handlers.listWorkflows({ tenantId })

Performance

All numbers from test-server/server.ts against a fresh stack (Docker Desktop, M-series Mac, PG + Redis colocated, CLUSTER_MODE=2). Production with managed PG will be faster.

Postgres-only vs Redis dispatch

delphi-core defaults to Postgres-only dispatch (no Redis required). Redis can be added via redis option for workloads above ~5k req/s.

Buffered durability (CLUSTER_MODE=2)

| Metric | PG-only | Redis | Notes | |---|---|---|---| | Avg RPS | 1,965/s | 1,975/s | identical | | p50 | 1ms | 1ms | identical | | p95 | 38ms | 63ms | PG-only lower tail | | p99 | 134ms | 216ms | PG-only lower tail | | Errors | 0% | 0% | |

Committed durability (CLUSTER_MODE=2)

| Metric | PG-only | Redis | Notes | |---|---|---|---| | Avg RPS | 741/s | 631/s | PG-only 17% faster | | p50 | 27ms | 74ms | PG-only 2.7x faster | | p95 | 273ms | 1,310ms | PG-only 4.8x faster | | Errors | 0% | 0% | |

Breakpoint — where does Redis start winning?

| Target | Redis (sustained) | PG-only (sustained) | Winner | |---|---|---|---| | 1k | 2,063/s | 4,240/s | PG-only | | 3k | 2,791/s | 2,841/s | tie | | 5k | 4,821/s | 4,818/s | crossover | | 8k | 6,976/s | 4,118/s | Redis +69% | | 10k | 7,806/s | 5,319/s | Redis +47% |

When to use what

| Scale | Recommendation | Why | |---|---|---| | < 5,000 req/s | Postgres-only (default) | Same performance, one less dependency | | 5,000 - 10,000 req/s | Either | Redis starts showing marginal benefit | | > 10,000 req/s | Add Redis (redis option) | PG pool becomes the bottleneck |

Most agent/workflow workloads are well under 5k req/s (each AI step takes 2-30s). Postgres-only is the right default.

Reproduce

# PG-only
DISPATCH_MODE=pg CLUSTER_MODE=2 npx tsx test-server/server.ts
k6 run packages/delphi-core/loadtest/k6-workflow-pgonly.js
k6 run packages/delphi-core/loadtest/k6-workflow-pgonly-committed.js

# Redis
DISPATCH_MODE=redis CLUSTER_MODE=2 npx tsx test-server/server.ts
k6 run packages/delphi-core/loadtest/k6-workflow-buffered.js
k6 run packages/delphi-core/loadtest/k6-workflow-committed.js

# Breakpoint (find the ceiling)
k6 run packages/delphi-core/loadtest/k6-breakpoint.js

Key optimizations

FOR UPDATE SKIP LOCKED polling with auto-detected LISTEN/NOTIFY wakeup
COPY FROM for bulk inserts (Hatchet-style, 60-90k writes/sec)
Debounced NOTIFY under load (coalesces into 1 per 10ms)
SET LOCAL synchronous_commit = OFF on buffered ingest (~45% PG TPS gain)
Per-workflow definition snapshot cached (avoids re-stringifying)
Bounded concurrent flushes to preserve PG pool headroom
Log buffering (50-entry / 50ms flush threshold)
Atomic budget updates via jsonb_set (no SELECT+UPDATE race)
N+1-free listWorkflows via aggregate JOIN

Testing

pnpm test                                                    # full suite (needs Docker)
npx vitest run src/__tests__/engine/lifecycle.spec.ts        # targeted
npx vitest run src/__tests__/engine/pg-dispatcher.spec.ts    # PG-only dispatch
npx vitest run src/__tests__/engine/dbos-parity.spec.ts      # DBOS-parity features

Key exports

| Export | Purpose | |---|---| | createEngine | Factory: typed engine with per-workflow proxy properties | | Workflow, Step, FunctionStep, step | Class-based workflow/step authoring | | createDbClient, createPool | Database connection helpers | | PgConnector | Postgres-only dispatch (default, used internally) | | WorkflowStepTask | Step handler that bridges dispatch → engine | | IngestBuffer, IngestWorker | Queue-first ingestion (buffered + committed) | | ExternalActionExecutor | Exactly-once external side effects | | EventIngestionService | Event ingest + trigger matching | | SchedulerService | Cron-based recurring workflow triggers (multi-pod safe) | | createDispatcher | Cross-tenant dispatch singleton (hint queue + HTTP handler) | | createDispatchHandler | Express-compatible dispatch endpoint (202 + background drain) | | PgHintTransport | Postgres-based hint transport (alternative to Redis) | | PgNotifier | LISTEN/NOTIFY with auto-detection | | computeRetryDelay | Backoff delay calculator (exponential/fixed with jitter) | | IANATimezone | Type-safe IANA timezone identifiers with autocomplete | | BackoffConfig | Retry backoff configuration type (exponential/fixed) | | dbRetry | Connection-resilient retry decorator | | runMigrations | Schema migration runner | | AdaptivePoller | Dynamic polling interval with backoff | | CREATE_TABLES_SQL | Schema bootstrap |

Schema

Plain TypeScript interfaces in src/entities/Database.ts. No ORM, no decorators, no reflect-metadata. All queries are parameterized SQL via pg.Pool. JSON columns stored as TEXT with toJson() / fromJson() helpers. All timestamp columns use TIMESTAMPTZ (not plain TIMESTAMP) — stores absolute instants, immune to server timezone changes. Time-critical fields (deadlines, delays, retry backoff) use BIGINT epoch milliseconds for comparison safety.

Inspiration

delphi-core stands on the shoulders of several excellent projects:

| Project | What we learned | |---|---| | DBOS | Postgres-only architecture. FOR UPDATE SKIP LOCKED as a work queue. LISTEN/NOTIFY with self-test fallback. Durable sleep. Workflow forking. Schema designed by Mike Stonebraker's team. | | Hatchet | COPY FROM bulk inserts for high-throughput ingestion. Buffered vs committed durability. Worker-side fan-out pattern. | | Temporal | DAG-based workflow definitions. Step-level retry/timeout. Human-in-the-loop approval gates. Signals and queries. Workflows as code, not YAML. | | Inngest | Event-driven workflow triggers. Idempotent event ingestion with sequence ordering. Step functions as the programming model. | | PgQue | Postgres-native event queue (reimplements Skype's PgQ in pure PL/pgSQL). Avoids SKIP LOCKED dead-tuple bloat via snapshot-based batching + three-table TRUNCATE rotation — zero VACUUM pressure by construction. Worth watching as an alternative queue primitive if dead-tuple accumulation becomes a concern at sustained high throughput. |

How delphi-core differs

Postgres-only by default — no Redis, no external orchestrator, no message broker. One peer dependency: pg.
Typed class API — compile-time safety on inputs, outputs, and dependencies. No string handler keys.
Multi-tenant — tenantId on every table, scoped queries, per-tenant isolation. Built-in cross-tenant dispatch with O(1) connections.
AI-native — budget guardrails, label-based worker routing, Claude Code executor, skill registry.
Flexible dispatch — Postgres handles ~5k req/s. Add Redis with one config line for more.

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@goatlab/delphi-core

What it is

Install

Quick start

What database accepts

Adding Redis for high throughput

How Postgres dispatch works

Tuning dispatch

Library vs service mode

Core concepts

Workflow & step

State

Execution

Starting workflows (fire-and-forget)

Getting results

Workflow durability (buffered vs committed)

Input validation

External actions (exactly-once)

Transactional steps

Durability × transactional: three independent axes

Retry backoff

Saga rollback

Human-in-the-loop

Durable sleep

Workflow forking

Workflow versioning

Workflow streaming

Delayed execution

Cron scheduling

Cross-tenant dispatch

Two APIs, two lifecycles

Dispatch handler

Startup sequence

Declarative cron schedules

Hint transport options

When to use the dispatcher

Signals & events

Budgets

Lineage

Timeout enforcement

Worker nodes

API surface

Typed proxy (compile-time safe)

String-based handlers (for HTTP adapters)

Performance

Postgres-only vs Redis dispatch

Buffered durability (CLUSTER_MODE=2)

Committed durability (CLUSTER_MODE=2)

Breakpoint — where does Redis start winning?

When to use what

Reproduce

Key optimizations

Testing

Key exports

Schema

Inspiration

How delphi-core differs

License

What `database` accepts

Workflow durability (`buffered` vs `committed`)