@ryanfw/prompt-orchestration-pipeline

v1.3.5

Published

6 days ago

A Prompt-orchestration pipeline (POP) is a framework for building, running, and experimenting with complex chains of LLM tasks.

0High
0Medium
0Low

ryan-fw

llm prompt-engineering pipeline orchestration chatgpt ai workflow automation

Prompt Orchestration Pipeline (POP)

A local-first, process-isolated framework for building reliable, autonomous AI agents.

This system allows engineers to compose complex chains of LLM tasks into resilient pipelines. Instead of relying on fragile "mega-prompts" or black-box scripts, POP decomposes work into standardized stages, enforcing quality through autonomous critique loops and robust architecture.

Documentation

Architecture Deep-Dive: Detailed breakdown of the orchestrator, runner, and state management.
Task Development Guide: How to build valid pipeline tasks using the 11-stage lifecycle.
LLM Provider Requirements: Standards for implementing new model providers.
Feature Audit: Comprehensive list of current system capabilities.
Jobs to be Done (JTBD): User motivations and desired outcomes for the framework.

What It Does (and Why It Matters)

This framework is designed for AI Engineers and Systems Integrators who need to move beyond simple chatbots to build reliable, multi-step workflows.

1. Orchestrate Reliable, Multi-Stage Workflows

Running long-duration AI tasks locally can be fragile. A single script crash or API timeout can waste hours of execution and dollars in token costs.

Process Isolation: Every pipeline runs in its own dedicated child process. If one agent crashes, your orchestrator stays alive.
Resumability: Pause, stop, and resume jobs from any specific task. Fix a bug in step 5 and restart exactly where you left off.
Run Control: Tasks can pause for human approval, skip downstream work, or add follow-up tasks to the current run.
Atomic State: Every stage transition is saved to disk instantly. You never lose progress.

2. Gain Radical Observability

"Black box" agents are impossible to debug. POP provides deep visibility into the "thought process" of your pipelines.

Real-Time Dashboard: Watch jobs progress stage-by-stage via a built-in UI (Server-Sent Events).
Gate Decisions: Approve or reject waiting jobs directly from the job detail view.
Granular Logging: Every input, output, and internal thought is captured in dedicated log files.
Cost Tracking: See exact token usage and cost breakdown for every task and model call.

3. Enforce Quality Autonomously

LLMs are probabilistic and prone to errors. POP enforces a rigid 11-stage lifecycle for every task to catch mistakes before they propagate.

Standardized Flow: Ingestion → Inference → Validation → Critique → Refinement.
Self-Correction: Configurable loops allow the system to detect invalid outputs, critique itself, and refine the answer automatically.
Structured Output: Built-in validators ensuring JSON schema compliance.

4. Avoid Vendor Lock-In

Switch models globally or per-task without rewriting your logic.

Supported Providers:
- OpenAI (GPT-4, GPT-4o)
- Anthropic (Claude 3.5 Sonnet, Haiku, Opus)
- DeepSeek (V3, R1)
- Google Gemini (Pro, Flash)
- Moonshot (Kimi)
- Zhipu (GLM-4)
- Claude Code (CLI integration)
CLI Agents: Tasks can also drive tool-using CLI coding agents (Claude, Codex, OpenCode) via the injected runAgent() helper — for file-aware, multi-turn work alongside single LLM calls. See the Task Development Guide.

System Architecture

The system follows a "Watch Folder" architectural pattern, designed for simplicity and ease of integration.

Level 1: System Context

The Prompt Orchestration Pipeline sits between your external triggers (users, scripts, cron jobs) and the LLM Providers.

Input: A JSON "seed" file dropped into a watched folder.
Processing: The system orchestrates the flow of data through defined tasks.
Output: Completed artifacts and logs written to disk.

Level 2: Containers

The system comprises three main runtime containers:

Orchestrator (Daemon)
- Role: The long-running supervisor.
- Responsibility: Watches the pipeline-data/pending/ directory. When a file appears, it initializes the job and spawns a Runner.
Pipeline Runner (Process)
- Role: An ephemeral, isolated worker.
- Responsibility: Loads the specific pipeline configuration (e.g., "Content Gen") and executes the sequence of tasks.
UI Server (Optional)
- Role: Observability layer.
- Responsibility: Reads state files and pushes real-time updates to the web dashboard via SSE.

Level 3: Components

Inside the Pipeline Runner, the logic is structured into:

Task Runner: The engine that drives a specific task (e.g., "Research") through the standardized lifecycle.
Run Control Contract: A task can write tasks/{taskName}/control.json to append tasks, skip pending tasks, or create an approval gate after it succeeds.
LLM Layer: A unified abstraction for all model providers, handling retries, cost calculation, and normalization.
Symlink Bridge: A specialized component that ensures every task has deterministic access to node_modules and shared utilities, regardless of where it is defined on disk.
Status Writer: An atomic file-writer that updates tasks-status.json safely, preventing data corruption.

Level 4: Code (The Task Lifecycle)

Every task implementation enforces a strict 11-stage flow to ensure consistency:

flowchart LR
    Ingest[1. Ingestion] --> Templ[3. Templating]
    Templ --> Infer[4. Inference]
    Infer --> Parse[5. Parsing]
    Parse --> Val[6. Validate]
    Val -->|Pass| Integ[11. Integration]
    Val -->|Fail| Crit[8. Critique]
    Crit --> Refine[9. Refine]
    Refine --> Templ

(Note: PreProcessing, ValidateQuality, and FinalValidation stages are also available but optional)

Getting Started

Choose one runtime path:

Bun CLI path: for Bun-based projects (bunx pipeline-orchestrator ...).
Standalone binary path: download a prebuilt release binary, no Bun runtime required.

1A. Install for Bun projects

Add the orchestrator as a dependency:

bun add @ryanfw/prompt-orchestration-pipeline

1B. Install standalone binary (no Bun required)

Download a release asset from: https://github.com/ryan-mahoney/prompt-orchestration-pipeline/releases

Available assets:

pipeline-orchestrator-linux-x64
pipeline-orchestrator-macos-x64
pipeline-orchestrator-macos-arm64
pipeline-orchestrator-windows-x64.exe

Linux/macOS example:

curl -L -o pipeline-orchestrator \
  https://github.com/ryan-mahoney/prompt-orchestration-pipeline/releases/latest/download/pipeline-orchestrator-macos-arm64
chmod +x pipeline-orchestrator

2. Initialize Structure

Use one of:

# Bun path
bunx pipeline-orchestrator init --root ./pipelines

# Standalone binary path
./pipeline-orchestrator init --root ./pipelines

3. Configure Scripts (Bun path)

For Bun projects, add:

{
  "scripts": {
    "pipeline": "bunx pipeline-orchestrator start --root pipelines --port 3010"
  }
}

4. Start the System

Use one of:

# Bun path
bun run pipeline

# Standalone binary path
./pipeline-orchestrator start --root pipelines --port 3010

This starts the file watcher and the web dashboard at http://localhost:3010.

Note: npx pipeline-orchestrator ... requires Bun to be installed because the npm bin entry runs through a Bun shebang.

5. Run a Job

Drop a JSON file into pipelines/pipeline-data/pending/:

{
  "name": "my-first-run",
  "pipeline": "default",
  "data": { "topic": "AI Architecture" }
}

The Orchestrator will pick it up, move it to current/, and start processing.

Building a Custom Frontend

The orchestrator exposes a documented HTTP/SSE API at http://localhost:<port>/api/* that any frontend can consume. See HTTP/SSE Protocol Reference for the full contract.

Custom frontends should run against a configured fixed port, usually the default 4000 or an explicit --port value. Treat bind failures or port conflicts as startup failures; the protocol does not provide dynamic port discovery.

Integration Modes

Subprocess mode (recommended): Run the orchestrator as a child process via pipeline-orchestrator start. The orchestrator runs in its own process — if it crashes, your application stays alive. This is the default and recommended approach.

Programmatic mode: Import startServer or startOrchestrator directly to embed the server in your process. This is simpler but forfeits process isolation — a crash in the orchestrator will take down your host process.

Cross-Origin Access

By default, the server only accepts same-origin requests. To allow a custom frontend on a different origin:

pipeline-orchestrator start --cors-origins "https://my-app.example,views://mainview"

For desktop webviews that report Origin: null, add --cors-allow-null-origin. Prefer allowlisting the concrete origin (e.g. views://mainview) when possible.

Protocol Versioning

The API follows a client-robustness contract: clients must ignore unknown JSON fields and unknown SSE event types. This means additive changes (new routes, new event types, new fields) are backward-compatible. Breaking changes (removing or renaming fields, changing types) require a protocolVersion bump and are documented in docs/http-api.md.

Capability negotiation is not built. Custom frontends should use the documented protocol and GET /api/meta instead of expecting negotiated feature flags or per-client capability exchange.