@gethelio/proxy
v0.6.0
Published
Open-source MCP governance proxy for AI agents
Maintainers
Readme
Helio is an MCP proxy that sits between your AI agents and the tools they use. Every tool call passes through Helio, which enforces policies, checks evidence, routes approvals, tracks spend, and records everything - without changing your agent code or your MCP servers.
npx @gethelio/proxy initWhy Helio?
Your agent just called an API you didn't expect. It spent money you didn't authorize. It modified a production record you can't easily undo.
Model providers are building governance for their own platforms but your agents run across Claude, ChatGPT, LangChain, CrewAI, and custom frameworks. No single platform governs the full picture. And none of them govern what happens in downstream systems like Stripe, Salesforce, or GitHub.
Helio governs what agents do to the rest of the world across any MCP-compatible agent, any tool, any platform.
How It Works
┌──────────────┐ ┌──────────────────────┐ ┌──────────────┐
│ │ │ Helio │ │ │
│ MCP Client │────▶│ │────▶│ MCP Server │
│ (Agent) │◀────│ • Policy engine │◀────│ (Tools) │
│ │ │ • Evidence grounding │ │ │
└──────────────┘ │ • Approval workflows │ └──────────────┘
│ • Rate & spend limits │
Optional │ • Audit trail │
┌─────────┐ │ • Self-repair feedback│
│ SDK │───▶│ │
│ (thin) │ └──────────────────────┘
└─────────┘Two integration paths:
- Proxy only: Point your MCP client at Helio instead of your MCP server. Zero code changes. Immediate governance.
- Proxy + SDK: Add the thin Python SDK to annotate tool calls with evidence context and action dependencies. Richer governance, under 500 lines of code.
Quick Start (5 minutes)
1. Install
npx @gethelio/proxy init2. Configure
Create a helio.yaml in your project root:
version: '1'
upstream:
url: 'http://localhost:3001/mcp' # Your existing MCP server
listen:
port: 3000 # Helio listens here
policies:
default: allow
rules:
# Require approval for write operations
- match:
tool: '*'
annotations:
readOnlyHint: false
action: require_approval
approval:
channel: dashboard
timeout: 300s
# Block destructive operations
- match:
tool: 'delete_*'
action: deny
feedback:
message: 'Destructive operations are disabled'
# Rate limit expensive API calls
- match:
tool: 'search_*'
action: rate_limit
limits:
max_calls: 100
window: 1h
key: tool
# Spend limit on payment tools
- match:
tool: 'create_payment'
action: spend_limit
limits:
max_spend:
field: '$.amount'
limit: 5000
currency: 'GBP'
window: 24h
audit:
storage: sqlite
retention: 90d
include_responses: true
dashboard:
enabled: true
port: 3100
# Required whenever any rule uses `require_approval`. Generate with:
# openssl rand -hex 32
api_secret: '${HELIO_DASHBOARD_SECRET}'If you use the ${HELIO_DASHBOARD_SECRET} placeholder above, set it before start:
export HELIO_DASHBOARD_SECRET="$(openssl rand -hex 32)"Host binding defaults.
helio startbindslisten.hostanddashboard.hostto127.0.0.1by default (thehelio inittemplate writes this value, and the schema defaults to it). Do not flip either to0.0.0.0without putting an authenticating reverse proxy in front — the MCP edge has no authentication at all, and the dashboard sideband's bearer is a shared secret, not a user session. The Docker quickstart inverts this layering: inside the container the bundled config binds0.0.0.0(correct for the container's virtual network) and Compose'sports:map publishes both ports back to127.0.0.1on the host.
3. Start Helio
npx @gethelio/proxy start4. Point your agent at Helio
{
"mcpServers": {
"my-tools": {
"url": "http://localhost:3000/mcp"
}
}
}5. Open the dashboard
http://localhost:3100That's it. Every tool call now passes through Helio. This includes full audit trail, approval workflows, rate limits, and spend controls.
Notification transport semantics (v0.1). For JSON-RPC notifications (requests with no
id), Helio'sstreamable-httpendpoint returnsHTTP 202 Acceptedwith an empty body after forwarding upstream fire-and-forget. For the legacyssetransport, POST requests also return202with empty body and response payloads are delivered on the SSE stream. For correlated (non-notification) requests, Helio now requires upstream JSON-RPC responses to include anid; missing-id upstream replies are wrapped as protocol-invalid upstream errors.
Features
Policy Engine
Declarative YAML rules that match on tool name, annotations, input parameters, environment, and cumulative state. Policies hot-reload without restart.
rules:
- match:
tool: 'create_payment'
input:
'$.amount': { gt: 1000 }
action: require_approvalEvidence Grounding
Require proof before high-stakes actions. A refund requires a prior order lookup. A deployment requires a passing test run. The optional SDK marks tool outputs as evidence; the proxy enforces evidence requirements.
rules:
- match:
tool: 'process_refund'
action: deny
evidence:
requires: ['orders.lookup']# Optional SDK enrichment
from helio import HelioContext
# Mark a tool output as evidence
with HelioContext() as ctx:
result = orders.lookup(order_id)
ctx.mark_evidence("orders.lookup", "order_data", result)The SDK talks to the proxy over the sideband API (default 127.0.0.1:3200; bind host is configurable via sdk.host). On every helio start the proxy generates a fresh 32-byte hex token and prints it to stderr:
SDK sideband listening on http://127.0.0.1:3200
SDK token (pass as HELIO_SDK_TOKEN env var to your SDK clients):
3f9c2b...d8a1Pass the same value to the SDK process via HELIO_SDK_TOKEN and the SDK automatically attaches Authorization: Bearer <token> to every sideband call. The sideband also rejects any request carrying a non-null Origin header, so a malicious local HTML file cannot talk to it through a browser. Operators who need a stable token across restarts can set HELIO_SDK_TOKEN explicitly in the proxy's environment — the proxy respects a pre-set value instead of regenerating one.
Self-Repair Feedback
When Helio blocks an action, it returns structured feedback explaining what failed and what the agent should do next. The agents can then self-correct and retry.
{
"blocked": true,
"reason": "evidence_missing",
"missing_evidence": ["orders.lookup"],
"suggestion": "Call orders.lookup with the order ID before retrying"
}Action Dependency Chains
Declare prerequisite actions in policy. The proxy tracks completed actions per session and blocks anything where prerequisites aren't met.
rules:
- match:
tool: 'process_refund'
requires: ['orders.lookup', 'customer.verify']Approval Workflows
Route sensitive actions to Slack, webhook, or the Helio dashboard. Configurable timeout and escalation, plus a dashboard-only break-glass override (REST API and dashboard UI; not exposed as a Slack button).
If a channel delivery fails, Helio logs an operational warning and emits an approval_notification_failed dashboard event so operators can investigate without losing the underlying pending ticket.
Transaction Controls
Rate limits per tool and per session. Spend limits with cumulative tracking. Irreversible action detection. Dry-run mode that executes the full pipeline without forwarding to the MCP server.
Audit Trail
Every tool call recorded: timestamp, agent identity, tool name, inputs, policy decision, evidence chain, approval status, downstream response, and latency. Searchable dashboard. Export to JSON or CSV.
How Helio Compares
| | Helio | Obot | Cerbos | Built-in (Anthropic / OpenAI) | Framework (LangChain / CrewAI) | | -------------------------------------------- | -------------------------------------- | ------------------------------ | --------------------------------- | ------------------------------------- | ----------------------------------- | | What it governs | Per-call actions with cross-call state | Which tools/MCPs are reachable | App-level authorization decisions | Agent permissions inside one platform | Agent behavior inside one framework | | Architecture | Out-of-process MCP proxy | Out-of-process MCP gateway | Sidecar / library | In-platform | In-framework | | Open source | ✅ Apache 2.0 | ✅ Apache 2.0 | ✅ Apache 2.0 | ❌ | Varies | | Time to value | 5 minutes | Setup-dependent | Hours | Built-in | Built-in | | No agent code changes | ✅ | ✅ | ❌ | ✅ (within platform) | ❌ | | Governs agents you didn't build | ✅ Any MCP agent | ✅ Any MCP agent | ✅ (any app) | ❌ One platform only | ❌ One framework only | | Evidence grounding | ✅ Cumulative across calls | ❌ | ❌ | ❌ | Limited | | Self-repair feedback | ✅ Structured retry hints | ❌ | ❌ | ❌ | Limited | | Stateful spend / rate limits | ✅ Per-tool, per-session¹ | Basic | ❌ | ❌ | Limited | | Approval workflows | ✅ Slack, webhook, dashboard | ✅ | ❌ | Limited | Limited | | Audit trail (incl. downstream responses) | ✅ Captures upstream MCP responses | Decision logs | Decision logs | Platform telemetry | Framework logs |
* Per-tool and per-session spend limits ship in v0.1. Cross-tool spend aggregation is planned for v0.2.
Works With
Helio works with any MCP-compatible agent or framework:
- Claude (Anthropic)
- ChatGPT (OpenAI)
- LangChain / LangGraph
- CrewAI
- AutoGen
- Custom agents using any MCP client SDK
Documentation
The full docs live in the monorepo and are not bundled into this package tarball.
- Getting Started: Install and configure in 5 minutes
- Configuration Reference: Every YAML option explained
- Policy Guide: How to write rules with examples
- Approval Workflows: Slack, webhook, and dashboard approvals
- Audit Trail: What's recorded, how to search, how to export
Examples
Ready-made configurations for common patterns:
- Basic: Deny destructive operations, allow everything else
- Slack Approvals: Route destructive actions to Slack
- Spend Limits: Govern payment tool usage
Contributing
We welcome contributions. See CONTRIBUTING.md for setup instructions, coding standards, and PR process.
Good first issues are labeled good-first-issue.
Community
- GitHub Issues: Bug reports and feature requests
- Twitter/X: Updates and announcements
License
Apache 2.0 - see LICENSE.
