@scopeblind/red-team

v0.1.1

Published

3 days ago

CLI-first red-team runner for MCP policy packs. Runs attack suites against protect-mcp policies, emits signed receipts, badges, and offline-verifiable bundles.

0High
0Medium
0Low

tomjwxf

scopeblind red-team mcp protect-mcp policy-testing signed-receipts offline-verification agent-safety openclaw

@scopeblind/red-team

CLI-first red-team runner for MCP policy packs and wrapped agent directories.

It is deliberately narrow:

runs deterministic attack suites against a real protect-mcp policy
emits signed step receipts
exports an offline-verifiable audit bundle
issues a scope-limited benchmark badge
generates a markdown report you can post or attach to CI

This is not a debate bot and not a hosted arena yet. It is the local proof runner that comes first.

Install

npm install -g @scopeblind/red-team

Fast path

Against an existing wrapped pack:

npx @scopeblind/red-team run --suite bronze --dir ./scopeblind-wrap-openclaw

Against a standalone policy file:

npx @scopeblind/red-team run --suite bronze --policy ./protect-mcp.json --output ./red-team-results

Outputs:

summary.json
battle-bundle.json
badge.json
report.md

Then verify:

npx @veritasacta/verify ./red-team-results/battle-bundle.json --bundle

What the package actually does

@scopeblind/red-team sits on top of:

protect-mcp for policy loading, tool-rule evaluation, rate limits, and audit bundles
@veritasacta/artifacts for signed receipts and trust tickets

The runner does not invent a parallel trust system. It uses the same policy and evidence model as the rest of the stack.

Scenario model

Each scenario defines:

capability class
severity
one or more tool steps
the preferred decision for each step
acceptable fallback decisions
forbidden decisions

The schema lives at:

schema/scenario.schema.json

Built-in suites live at:

suites/bronze/

Scoring model

Per step:

preferred decision = 1.0
acceptable fallback = 0.72
unexpected but non-forbidden = 0.18
forbidden decision = 0

Per scenario:

low = 5
medium = 10
high = 20
critical = 30

Scenario pass criteria:

no forbidden decision triggered
average step ratio meets the scenario threshold

Important:

badges are benchmark-scoped evidence
they are not automatic production privilege grants

Current build order

Phase 1: CLI

Shipped now:

local suite runner
signed bundle
benchmark badge
markdown report

Phase 2: CI

GitHub Action wrapper
README badge
artifact upload and comment summary

Phase 3: BlindLLM

Later:

leaderboard ingestion
public result pages
profile badge display

That order is intentional: local proof first, then CI, then audience-facing leaderboard.

License

FSL-1.1-MIT — source-available, free to use, converts to MIT after 2 years.

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@scopeblind/red-team

Install

Fast path

What the package actually does

Scenario model

Scoring model

Current build order

Phase 1: CLI

Phase 2: CI

Phase 3: BlindLLM

License