decoy-tripwire
v0.13.2
Published
Security tripwires and local MCP proxy for AI agents. Detect prompt injection and enforce tool-call policy.
Maintainers
Readme
Know when your agents are compromised. Decoy Tripwire drops decoy MCP tools alongside your real ones — tools like execute_command, make_payment, access_credentials that no legitimate agent should ever call. When a prompt injection triggers one, the proxy pauses the compromised agent immediately and alerts you.
Every decoy returns a realistic error (timeout, permission denied). The agent sees a broken real tool, not a detection. Attackers don't know they've been caught.
Works with: Claude Desktop, Cursor, Windsurf, VS Code, Claude Code
🚀 Get Started
npx decoy-tripwire initThat's it. init signs you up, installs the local proxy, wraps your existing MCP servers, and drops the tripwires. Restart your MCP host — tripwires are live.
When a tripwire fires:
- The compromised agent is paused for 10 minutes (auto-expires)
- A desktop notification surfaces which tool was tripped
- Every wrapped MCP server denies subsequent calls from that agent in sub-ms
- Full context appears in your dashboard
Clear the pause early with npx decoy-tripwire resume <agent-id>.
🧑💻 Install
npx decoy-tripwire init # Sign up and install (wraps upstreams by default)
npx decoy-tripwire init --no-wrap # Install without wrapping existing MCP servers
npx decoy-tripwire login --token=xxx # Log in with an existing tokenRequires Node.js 18+. Zero runtime dependencies.
🎓 Docs
🛠 Commands
# Monitor
npx decoy-tripwire test # Fire a test trigger
npx decoy-tripwire status # Local pauses + hosted triggers
npx decoy-tripwire watch # Live tail of triggers
# When a tripwire fires
npx decoy-tripwire resume <agent-id> # Clear an auto-pause immediately
npx decoy-tripwire resume --all # Clear every pause
npx decoy-tripwire lock <agent-id> # Turn an auto-pause into a permanent block
npx decoy-tripwire lockdown on # Any tripwire hit pauses every agent
# Manage
npx decoy-tripwire agents # List connected agents
npx decoy-tripwire config # View alert configuration
npx decoy-tripwire upgrade # Upgrade to Team (via dashboard)
npx decoy-tripwire uninstall --confirm # Remove from all MCP hostsAll commands support --json for scripting and --token=xxx to override the stored token.
🪤 Tripwire Tools
| Tool | What it traps | Severity |
|------|--------------|----------|
| execute_command | Shell execution | Critical |
| write_file | File system persistence | Critical |
| make_payment | Unauthorized payments | Critical |
| authorize_service | Trust grants to external services | Critical |
| modify_dns | DNS record hijacking | Critical |
| read_file | Credential theft | High |
| http_request | Data exfiltration | High |
| database_query | SQL execution | High |
| access_credentials | API key theft | High |
| send_email | Phishing via agent | High |
| install_package | Supply chain attack | High |
| get_environment_variables | Secret harvesting | High |
Plus dynamically generated tools from 6 threat categories (cloud infrastructure, secrets management, payments, CI/CD, identity, network). Each deployment gets a unique, deterministic set.
🧠 How auto-block works
init rewrites each MCP host config so upstream servers run through node proxy.mjs -- <original command>. The proxy intercepts every tools/call:
- Checks the shared pause registry at
~/.decoy/pause.json— if the agent is paused, denies immediately. - If the call is a tripwire, returns a fake error and writes a 10-min pause entry for the agent.
- Otherwise forwards to upstream.
Every proxy instance reads the registry on its hot path, so one tripwire hit blocks every wrapped server in the same process lifecycle. Sub-ms. Works offline. Dashboard sync is fire-and-forget.
Turn on lockdown mode to escalate — any tripwire pauses every agent, not just the one that tripped.
📦 Plans
| | Free | Team ($29/user/mo) | Business ($99/user/mo) | |---|---|---|---| | Tripwires (12+ dynamic) | Yes | Yes | Yes | | Auto-block via local proxy | Yes | Yes | Yes | | Email alerts | Yes | Yes | Yes | | 7-day history | Yes | | | | SARIF/JSON export | Yes | Yes | Yes | | Slack/webhook alerts | | Yes | Yes | | Agent profiles + fingerprinting | | Yes | Yes | | 90-day history | | Yes | Yes | | Threat intel API | | Yes | Yes | | Security testing | | Yes | Yes | | OWASP compliance reports | | | Yes | | Custom detection rules | | | Yes | | Gateway integrations | | | Yes |
🚢 Release Notes
See CHANGELOG.md or the hosted changelog.
🤝 Contribute
See CONTRIBUTING.md.
🔗 Related
- decoy-scan — Find security risks in your MCP servers
- decoy-redteam — Autonomous red team for MCP servers
- Decoy Guard — Dashboard, threat intel, compliance reports
- OWASP Agentic Top 10
📝 License
MIT — see LICENSE.
