@gethelio/proxy

v0.6.0

Published

4 days ago

Open-source MCP governance proxy for AI agents

0High
0Medium
0Low

mcp ai-agents governance proxy audit policy

Helio is an MCP proxy that sits between your AI agents and the tools they use. Every tool call passes through Helio, which enforces policies, checks evidence, routes approvals, tracks spend, and records everything - without changing your agent code or your MCP servers.

npx @gethelio/proxy init

Why Helio?

Your agent just called an API you didn't expect. It spent money you didn't authorize. It modified a production record you can't easily undo.

Model providers are building governance for their own platforms but your agents run across Claude, ChatGPT, LangChain, CrewAI, and custom frameworks. No single platform governs the full picture. And none of them govern what happens in downstream systems like Stripe, Salesforce, or GitHub.

Helio governs what agents do to the rest of the world across any MCP-compatible agent, any tool, any platform.

How It Works

┌──────────────┐     ┌──────────────────────┐     ┌──────────────┐
│              │     │       Helio           │     │              │
│  MCP Client  │────▶│                       │────▶│  MCP Server  │
│  (Agent)     │◀────│  • Policy engine      │◀────│  (Tools)     │
│              │     │  • Evidence grounding  │     │              │
└──────────────┘     │  • Approval workflows  │     └──────────────┘
                     │  • Rate & spend limits │
        Optional     │  • Audit trail         │
      ┌─────────┐    │  • Self-repair feedback│
      │  SDK    │───▶│                       │
      │ (thin)  │    └──────────────────────┘
      └─────────┘

Two integration paths:

Proxy only: Point your MCP client at Helio instead of your MCP server. Zero code changes. Immediate governance.
Proxy + SDK: Add the thin Python SDK to annotate tool calls with evidence context and action dependencies. Richer governance, under 500 lines of code.

Quick Start (5 minutes)

1. Install

npx @gethelio/proxy init

2. Configure

Create a helio.yaml in your project root:

version: '1'

upstream:
  url: 'http://localhost:3001/mcp' # Your existing MCP server

listen:
  port: 3000 # Helio listens here

policies:
  default: allow

  rules:
    # Require approval for write operations
    - match:
        tool: '*'
        annotations:
          readOnlyHint: false
      action: require_approval
      approval:
        channel: dashboard
        timeout: 300s

    # Block destructive operations
    - match:
        tool: 'delete_*'
      action: deny
      feedback:
        message: 'Destructive operations are disabled'

    # Rate limit expensive API calls
    - match:
        tool: 'search_*'
      action: rate_limit
      limits:
        max_calls: 100
        window: 1h
        key: tool

    # Spend limit on payment tools
    - match:
        tool: 'create_payment'
      action: spend_limit
      limits:
        max_spend:
          field: '$.amount'
          limit: 5000
          currency: 'GBP'
          window: 24h

audit:
  storage: sqlite
  retention: 90d
  include_responses: true

dashboard:
  enabled: true
  port: 3100
  # Required whenever any rule uses `require_approval`. Generate with:
  #   openssl rand -hex 32
  api_secret: '${HELIO_DASHBOARD_SECRET}'

If you use the ${HELIO_DASHBOARD_SECRET} placeholder above, set it before start:

export HELIO_DASHBOARD_SECRET="$(openssl rand -hex 32)"

Host binding defaults. helio start binds listen.host and dashboard.host to 127.0.0.1 by default (the helio init template writes this value, and the schema defaults to it). Do not flip either to 0.0.0.0 without putting an authenticating reverse proxy in front — the MCP edge has no authentication at all, and the dashboard sideband's bearer is a shared secret, not a user session. The Docker quickstart inverts this layering: inside the container the bundled config binds 0.0.0.0 (correct for the container's virtual network) and Compose's ports: map publishes both ports back to 127.0.0.1 on the host.

3. Start Helio

npx @gethelio/proxy start

4. Point your agent at Helio

{
  "mcpServers": {
    "my-tools": {
      "url": "http://localhost:3000/mcp"
    }
  }
}

5. Open the dashboard

http://localhost:3100

That's it. Every tool call now passes through Helio. This includes full audit trail, approval workflows, rate limits, and spend controls.

Notification transport semantics (v0.1). For JSON-RPC notifications (requests with no id), Helio's streamable-http endpoint returns HTTP 202 Accepted with an empty body after forwarding upstream fire-and-forget. For the legacy sse transport, POST requests also return 202 with empty body and response payloads are delivered on the SSE stream. For correlated (non-notification) requests, Helio now requires upstream JSON-RPC responses to include an id; missing-id upstream replies are wrapped as protocol-invalid upstream errors.

Features

Policy Engine

Declarative YAML rules that match on tool name, annotations, input parameters, environment, and cumulative state. Policies hot-reload without restart.

rules:
  - match:
      tool: 'create_payment'
      input:
        '$.amount': { gt: 1000 }
    action: require_approval

Evidence Grounding

Require proof before high-stakes actions. A refund requires a prior order lookup. A deployment requires a passing test run. The optional SDK marks tool outputs as evidence; the proxy enforces evidence requirements.

rules:
  - match:
      tool: 'process_refund'
    action: deny
    evidence:
      requires: ['orders.lookup']

# Optional SDK enrichment
from helio import HelioContext

# Mark a tool output as evidence
with HelioContext() as ctx:
    result = orders.lookup(order_id)
    ctx.mark_evidence("orders.lookup", "order_data", result)

The SDK talks to the proxy over the sideband API (default 127.0.0.1:3200; bind host is configurable via sdk.host). On every helio start the proxy generates a fresh 32-byte hex token and prints it to stderr:

SDK sideband listening on http://127.0.0.1:3200
SDK token (pass as HELIO_SDK_TOKEN env var to your SDK clients):
  3f9c2b...d8a1

Pass the same value to the SDK process via HELIO_SDK_TOKEN and the SDK automatically attaches Authorization: Bearer <token> to every sideband call. The sideband also rejects any request carrying a non-null Origin header, so a malicious local HTML file cannot talk to it through a browser. Operators who need a stable token across restarts can set HELIO_SDK_TOKEN explicitly in the proxy's environment — the proxy respects a pre-set value instead of regenerating one.

Self-Repair Feedback

When Helio blocks an action, it returns structured feedback explaining what failed and what the agent should do next. The agents can then self-correct and retry.

{
  "blocked": true,
  "reason": "evidence_missing",
  "missing_evidence": ["orders.lookup"],
  "suggestion": "Call orders.lookup with the order ID before retrying"
}

Action Dependency Chains

Declare prerequisite actions in policy. The proxy tracks completed actions per session and blocks anything where prerequisites aren't met.

rules:
  - match:
      tool: 'process_refund'
    requires: ['orders.lookup', 'customer.verify']

Approval Workflows

Route sensitive actions to Slack, webhook, or the Helio dashboard. Configurable timeout and escalation, plus a dashboard-only break-glass override (REST API and dashboard UI; not exposed as a Slack button). If a channel delivery fails, Helio logs an operational warning and emits an approval_notification_failed dashboard event so operators can investigate without losing the underlying pending ticket.

Transaction Controls

Rate limits per tool and per session. Spend limits with cumulative tracking. Irreversible action detection. Dry-run mode that executes the full pipeline without forwarding to the MCP server.

Audit Trail

Every tool call recorded: timestamp, agent identity, tool name, inputs, policy decision, evidence chain, approval status, downstream response, and latency. Searchable dashboard. Export to JSON or CSV.

How Helio Compares

| | Helio | Obot | Cerbos | Built-in (Anthropic / OpenAI) | Framework (LangChain / CrewAI) | | -------------------------------------------- | -------------------------------------- | ------------------------------ | --------------------------------- | ------------------------------------- | ----------------------------------- | | What it governs | Per-call actions with cross-call state | Which tools/MCPs are reachable | App-level authorization decisions | Agent permissions inside one platform | Agent behavior inside one framework | | Architecture | Out-of-process MCP proxy | Out-of-process MCP gateway | Sidecar / library | In-platform | In-framework | | Open source | ✅ Apache 2.0 | ✅ Apache 2.0 | ✅ Apache 2.0 | ❌ | Varies | | Time to value | 5 minutes | Setup-dependent | Hours | Built-in | Built-in | | No agent code changes | ✅ | ✅ | ❌ | ✅ (within platform) | ❌ | | Governs agents you didn't build | ✅ Any MCP agent | ✅ Any MCP agent | ✅ (any app) | ❌ One platform only | ❌ One framework only | | Evidence grounding | ✅ Cumulative across calls | ❌ | ❌ | ❌ | Limited | | Self-repair feedback | ✅ Structured retry hints | ❌ | ❌ | ❌ | Limited | | Stateful spend / rate limits | ✅ Per-tool, per-session¹ | Basic | ❌ | ❌ | Limited | | Approval workflows | ✅ Slack, webhook, dashboard | ✅ | ❌ | Limited | Limited | | Audit trail (incl. downstream responses) | ✅ Captures upstream MCP responses | Decision logs | Decision logs | Platform telemetry | Framework logs |

* Per-tool and per-session spend limits ship in v0.1. Cross-tool spend aggregation is planned for v0.2.

Works With

Helio works with any MCP-compatible agent or framework:

Claude (Anthropic)
ChatGPT (OpenAI)
LangChain / LangGraph
CrewAI
AutoGen
Custom agents using any MCP client SDK

Documentation

The full docs live in the monorepo and are not bundled into this package tarball.

Getting Started: Install and configure in 5 minutes
Configuration Reference: Every YAML option explained
Policy Guide: How to write rules with examples
Approval Workflows: Slack, webhook, and dashboard approvals
Audit Trail: What's recorded, how to search, how to export

Examples

Ready-made configurations for common patterns:

Basic: Deny destructive operations, allow everything else
Slack Approvals: Route destructive actions to Slack
Spend Limits: Govern payment tool usage

Contributing

We welcome contributions. See CONTRIBUTING.md for setup instructions, coding standards, and PR process.

Good first issues are labeled good-first-issue.

Community

GitHub Issues: Bug reports and feature requests
Twitter/X: Updates and announcements

License

Apache 2.0 - see LICENSE.