aegis-audit

v3.3.1

Published

5 days ago

AI-powered smart contract security auditor — OWASP SC Top 10 (2026), MITRE ATT&CK, CWE, and NIST SSDF compliance outputs with a benchmarked detection engine

Downloads

708

🛡️ Aegis

AI-Powered Smart Contract Security Auditor

Detect vulnerabilities before attackers do — mapped to OWASP, MITRE ATT&CK, CWE & NIST.

Aegis is a command-line security auditor for Solidity smart contracts. It combines a deterministic static-analysis engine with an AI semantic layer, maps every finding to industry frameworks, and produces enterprise-grade compliance artifacts (SARIF, SBOM, signed audit logs). Its detection accuracy is measured against an academic benchmark — not asserted.

npm install -g aegis-audit
aegis audit ./contracts/MyToken.sol

Why Aegis

Most scanners hand you a list of bugs. Aegis is built for teams that ship to production:

Full OWASP SC Top 10 (2026) coverage — including the categories that actually cause losses. Access control alone was $953M of $1.42B in 2024 losses, far ahead of reentrancy.
Framework traceability — every finding carries its SC0X:2026, CWE-XXX, and MITRE TXXXX identifiers for audit and compliance reporting.
Red-team attack-path synthesis — chains individual findings into the multi-step exploits an APT would actually run (flash-loan price manipulation, proxy takeover, recursive drain).
Offline mode — --offline runs all static detectors without your source code ever leaving the machine. Built for proprietary and regulated codebases.
CI/CD native — SARIF 2.1.0 output, configurable fail thresholds, proper exit codes.
NIST SSDF (SP 800-218) outputs — CycloneDX SBOM generation, encrypted key storage, and a tamper-evident hash-chained audit log.
Measured, not claimed — ships with a benchmark harness scored against the SmartBugs Curated dataset.

Install

npm install -g aegis-audit
aegis config        # set encrypted API key (or use --offline)

Get a free Anthropic API key at console.anthropic.com. Enterprises should prefer setting ANTHROPIC_API_KEY via a secrets manager, or use --offline.

Usage

# Audit a local file, a folder, or a verified on-chain address
aegis audit ./contracts/MyToken.sol
aegis audit ./contracts/
aegis audit 0x1f9840a85d5aF5bf1D1762F925BDADdC4201F984 --network ethereum

# Enterprise / regulated: never transmit source code
aegis audit ./contracts/ --offline

# Generate compliance + CI artifacts
aegis audit ./contracts/ --sarif results.sarif --sbom sbom.json --output report.md

# CI gate (non-zero exit on high+ findings)
aegis audit ./contracts/ --ci --fail-on high

# Measure detector accuracy against labeled datasets
aegis benchmark
aegis benchmark --fetch-smartbugs

What it detects — OWASP Smart Contract Top 10 (2026)

| ID | Category | Detectors | |----|----------|-----------| | SC01 | Access Control | Missing modifiers, tx.origin auth, unprotected selfdestruct | | SC02 | Business Logic | Precision loss, unbounded loops (DoS), weak randomness, timestamp logic | | SC03 | Price Oracle Manipulation | Spot-price-as-oracle detection | | SC04 | Flash Loan | Callback-invariant flags | | SC05 | Input Validation | Missing zero-address checks | | SC06 | Unchecked External Calls | Unchecked .call, unsafe ERC20 transfer | | SC07 / SC09 | Arithmetic / Overflow | Pre-0.8 Solidity, unchecked blocks | | SC08 | Reentrancy | External-call-before-state, missing guards | | SC10 | Proxy & Upgradeability | Unprotected initializer, delegatecall |

Plus a Claude AI semantic layer for business-logic and economic attacks that pattern matching misses.

Accuracy

Aegis ships with a benchmark harness so detection is evidence-based. On the academic SmartBugs Curated dataset (143 labeled contracts), the deterministic static layer alone scores:

| Metric | Value | |--------|-------| | Overall recall | 62.6% | | Reentrancy precision | 94.1% | | Unchecked-call recall | 76.9% | | Full-dataset scan time | < 1 second |

The AI layer adds semantic recall on top of this floor. Full per-category numbers and methodology are in benchmark/README.md. For comparison, the ICSE 2020 study found individual mature tools each detect only a fraction of the dataset — which is why running multiple tools, plus a human audit, is the recommended practice.

CI example (GitHub Actions)

- name: Aegis audit
  env:
    ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
  run: |
    npm install -g aegis-audit
    aegis audit ./contracts/ --ci --fail-on high --sarif results.sarif
- uses: github/codeql-action/upload-sarif@v3
  with:
    sarif_file: results.sarif

Security & threat model

This tool is itself part of your software supply chain. State-sponsored groups (e.g. Lazarus/BlueNoroff, MITRE G0032) actively target Web3 developer toolchains via malicious packages (T1195). Accordingly:

API keys are encrypted at rest (AES-256-GCM); enterprises should prefer a secrets manager.
--offline guarantees no source transmission.
The audit log is append-only and hash-chained — any edit to history is detectable via aegis config.
All detector patterns are bounded against regex denial-of-service.

Disclaimer

Aegis is an AI-assisted automated scanner. It is not a substitute for a professional manual audit, formal verification, or economic review. Automated analysis produces both false positives and false negatives. For high-value or production deployments, commission an independent human audit and, where applicable, formal verification of critical invariants.

Contributing

Contributions are welcome — see CONTRIBUTING.md. The most valuable contributions right now are new detectors (front-running/MEV, improved DoS) with corresponding benchmark fixtures.

License

Multi-AI Panel Mode (v3.1)

Run several AI auditors in parallel and have their findings cross-referenced by a consensus engine — agreement across independent models is a strong precision signal.

# Configure provider keys (stored encrypted)
aegis config

# Run the full panel: every configured provider + the local semantic engine
aegis audit ./contracts/MyToken.sol --panel

# Or pick specific providers
aegis audit ./contracts/MyToken.sol --provider anthropic,openai

Supported provider types: anthropic (Claude), openai (GPT), and any openai-compatible gateway (set a baseURL). Keys come from aegis config or the ANTHROPIC_API_KEY / OPENAI_API_KEY environment variables.

The panel produces:

Consensus clusters — each issue with how many scanners agreed and a confidence rating (HIGH = 3+ scanners, MEDIUM = 2, LOW = unique).
Panel discussion — a summary of where scanners converged, where they were split (review these), and unique leads.

The local semantic engine counts as one independent scanner, so the panel works even with a single AI provider configured.

Advanced Analysis (v3.2)

Aegis now combines five analysis techniques in one pass:

| Technique | Tier | What it adds | |---|---|---| | AST semantic model | 1–2 | Per-function guards, data flow, self-scope awareness | | Cross-function taint | 2 | Traces attacker-controlled input across function calls to dangerous sinks; only flags unsanitized flows | | Inheritance resolution | — | Recognizes guards (onlyOwner, nonReentrant) defined in base contracts, including OpenZeppelin | | Confidence calibration | — | Every finding carries an empirical precision score learned from the benchmark | | Formal verification | 5 | --formal runs solc SMTChecker to mathematically check assertions/overflow | | Fuzz scaffolding | 4 | --fuzz generates Foundry + Echidna invariant test files for your contract | | Multi-AI panel | — | --panel runs multiple AI auditors + consensus adjudication |

# Everything: semantic + taint + formal + fuzz scaffolds + AI panel
aegis audit ./contracts/Vault.sol --formal --fuzz --panel

# Fast deterministic pass (no AI, no external tools)
aegis audit ./contracts/Vault.sol --offline

What requires external tools

--formal needs solc installed (degrades gracefully if absent).
--fuzz generates test files; you run them with forge / echidna.

Everything else (semantic, taint, inheritance, calibration) runs with zero external dependencies.

Local AI Models via Ollama (v3.3) — free & private

Run the AI panel against local models (gemma, deepseek, llama, mistral, qwen…) with zero cost, no API key, and your source code never leaving the machine.

# 1. Install Ollama (ollama.com), then pull any model you like:
ollama pull qwen2.5-coder    # code-tuned, recommended; or gemma2, llama3.2, mistral

# 2. Run Aegis against it. Aegis auto-detects your installed model,
#    or name one explicitly with --model:
aegis audit ./contracts/Vault.sol --provider ollama
aegis audit ./contracts/Vault.sol --provider ollama --model qwen2.5-coder

# 3. Mix local + cloud in one panel for cross-model consensus:
aegis audit ./contracts/Vault.sol --panel   # runs every configured provider

No model name is hardcoded: with --provider ollama and no --model, Aegis asks Ollama which models you have installed and uses one (telling you the others, so you can pick with --model). A small model like gemma2:2b runs the pipeline but may miss or misclassify subtle bugs; a code-tuned model like qwen2.5-coder is stronger.

Ollama uses an OpenAI-compatible endpoint at http://localhost:11434/v1, so no key is required. To use a different local model, set it in your config (default is gemma2). Aegis gives a clear hint if Ollama isn't running or the model isn't pulled.

Why this matters: the local model becomes an independent scanner in the consensus panel. In testing, a local model caught a signature-replay bug (missing nonce) that the deterministic static layer alone could not — for free, fully offline.