muaddib-scanner

v2.6.1

Published

2 days ago

Supply-chain threat detection & response for npm & PyPI/Python

0High
0Medium
0Low

security npm pypi python supply-chain malware scanner typosquatting shai-hulud detection ast sarif sandbox threat-intelligence

Why MUAD'DIB?

npm and PyPI supply-chain attacks are exploding. Shai-Hulud compromised 25K+ repos in 2025. Existing tools detect threats but don't help you respond.

MUAD'DIB combines static analysis + deobfuscation engine (v2.2.5) + inter-module dataflow (v2.2.6) + per-file max scoring (v2.2.11) + dynamic analysis (Docker sandbox with monkey-patching preload for time-bomb detection, v2.4.9) + behavioral anomaly detection (v2.0) + ground truth validation (v2.1) + security audit (41 issues remediated, v2.5.0–v2.5.6) + audit hardening (v2.5.13–v2.5.14) + FP reduction P5/P6 (v2.5.15–v2.5.16) to detect threats AND guide your response — even before they appear in any IOC database.

Positioning

MUAD'DIB is an educational tool and a free first line of defense. It detects known npm and PyPI threats (225,000+ IOCs) and basic suspicious patterns.

For enterprise protection, use:

Socket.dev - ML behavioral analysis, cloud sandboxing
Snyk - Massive vulnerability database, CI/CD integrations
Opengrep - Advanced dataflow analysis, Semgrep rules

MUAD'DIB does not replace these tools. It complements them for devs who want a quick, free check before installing an unknown package.

Installation

npm (recommended)

npm install -g muaddib-scanner

From source

git clone https://github.com/DNSZLSK/muad-dib
cd muad-dib
npm install
npm link

Usage

Basic scan

muaddib scan .
muaddib scan /path/to/project

Scans both npm (package.json, node_modules) and Python (requirements.txt, setup.py, pyproject.toml) dependencies.

Interactive mode

muaddib

Launches an interactive menu to guide you through all features.

Safe install

muaddib install <package>
muaddib install lodash axios --save-dev
muaddib i express -g
muaddib install suspicious-pkg --force    # Force install despite threats

Scans packages for threats BEFORE installing. Blocks known malicious packages.

Risk score

Each scan displays a 0-100 risk score:

[SCORE] 58/100 [***********---------] HIGH

Explain mode (full details)

muaddib scan . --explain

Shows for each detection:

Rule ID
MITRE ATT&CK technique
References (articles, CVEs)
Response playbook

Export

muaddib scan . --json > results.json     # JSON
muaddib scan . --html report.html        # HTML
muaddib scan . --sarif results.sarif     # SARIF (GitHub Security)

Severity threshold

muaddib scan . --fail-on critical  # Fail only on CRITICAL
muaddib scan . --fail-on high      # Fail on HIGH and CRITICAL (default)
muaddib scan . --fail-on medium    # Fail on MEDIUM, HIGH, CRITICAL

Paranoid mode

muaddib scan . --paranoid

Ultra-strict detection with lower tolerance. Useful for critical projects. Detects any network access, subprocess execution, dynamic code evaluation, and sensitive file access.

Discord/Slack webhook

muaddib scan . --webhook "https://discord.com/api/webhooks/..."

Sends an alert with score and threats to Discord or Slack. Strict filtering (v2.1.2): alerts are only sent for IOC matches, sandbox-confirmed threats, or canary token exfiltration — reducing noise from heuristic-only detections.

Real-time monitoring

muaddib watch .

Daemon mode

muaddib daemon
muaddib daemon --webhook "https://discord.com/api/webhooks/..."

Automatically monitors all npm install commands and scans new packages.

Update IOCs (fast, ~5 seconds)

muaddib update

Loads the 225,000+ IOCs shipped in the package, merges YAML IOCs and additional GitHub sources (GenSecAI, DataDog). Run this after npm install for an instant IOC refresh.

Scrape IOCs (full, ~5 minutes)

muaddib scrape

Full refresh from all primary sources. Downloads OSV bulk dumps for npm and PyPI (~100-200MB), OSSF, and all other sources. Run this when you want the absolute latest data.

Sources:

OSV.dev npm dump - Bulk download of all MAL-* entries
OSV.dev PyPI dump - Bulk download of all PyPI MAL-* entries
GenSecAI Shai-Hulud 2.0 Detector - Consolidated list of 700+ Shai-Hulud packages
DataDog Security Labs - Consolidated IOCs from multiple vendors
OSSF Malicious Packages - OpenSSF database (8000+ reports via OSV.dev)
GitHub Advisory Database - Malware-tagged advisories
Snyk Known Malware - Historical malware packages
Static IOCs - Socket.dev, Phylum, npm-removed packages

Docker Sandbox

muaddib sandbox <package-name>
muaddib sandbox <package-name> --strict

Dynamic analysis: installs the package in an isolated Docker container and monitors runtime behavior via strace, tcpdump, and filesystem diffing.

Multi-layer monitoring:

System tracing (strace): file access, process spawns, syscall monitoring
Network capture (tcpdump): DNS resolutions with resolved IPs, HTTP requests (method, host, path, body), TLS SNI detection
Filesystem diff: snapshot before/after install, detects files created in suspicious locations
Data exfiltration detection: 16 sensitive patterns (tokens, credentials, SSH keys, private keys, .env)
CI-aware environment (v2.1.2): simulates CI environments (GITHUB_ACTIONS, GITLAB_CI, TRAVIS, CIRCLECI, JENKINS) to trigger CI-aware malware that would otherwise stay dormant
Enriched canary tokens (v2.1.2): 6 honeypot credentials injected as env vars (GITHUB_TOKEN, NPM_TOKEN, AWS_ACCESS_KEY_ID, AWS_SECRET_ACCESS_KEY, SLACK_WEBHOOK_URL, DISCORD_WEBHOOK_URL). If exfiltrated via network, DNS, or filesystem, triggers CRITICAL alert with +50 score
Monkey-patching preload (v2.4.9): Runtime instrumentation via NODE_OPTIONS=--require /opt/preload.js. Patches time APIs (Date.now, setTimeout→0, setInterval→immediate), intercepts network/filesystem/process/env calls. Multi-run mode at [0h, 72h, 7d] offsets to detect time-bomb malware (MITRE T1497.003)
Scoring engine: 0-100 risk score based on behavioral severity

Use --strict to block all non-essential outbound network traffic via iptables.

Requires Docker Desktop installed.

muaddib sandbox lodash          # Safe package
muaddib sandbox suspicious-pkg  # Analyze unknown package

Sandbox network report

muaddib sandbox-report <package-name>
muaddib sandbox-report <package-name> --strict

Same as sandbox but displays a detailed network report: DNS resolutions, HTTP requests, TLS connections, blocked connections (strict mode), and data exfiltration alerts.

Diff (compare versions)

muaddib diff <ref> [path]

Compare threats between the current version and a previous commit/tag. Shows only NEW threats introduced since the reference point.

muaddib diff HEAD~1             # Compare with previous commit
muaddib diff v1.2.0             # Compare with tag
muaddib diff main               # Compare with branch
muaddib diff abc1234            # Compare with specific commit

Example output:

[MUADDIB DIFF] Comparing abc1234 -> def5678

  Risk Score: 25 -> 45 (+20 worse)
  Threats:    3 -> 5

  NEW threats:     2
  REMOVED threats: 0
  Unchanged:       3

  NEW THREATS (introduced since v1.2.0)
  ------------------------------------
  1. [HIGH] suspicious_dependency
     Known malicious package detected
     File: package.json

Use in CI to only fail on new threats, not existing technical debt:

- run: muaddib diff ${{ github.event.pull_request.base.sha }} --fail-on high

Pre-commit hooks

muaddib init-hooks [options]

Automatically scan before each commit. Supports multiple hook systems:

muaddib init-hooks                        # Auto-detect (husky/pre-commit/git)
muaddib init-hooks --type husky           # Force husky
muaddib init-hooks --type pre-commit      # Force pre-commit framework
muaddib init-hooks --type git             # Force native git hooks
muaddib init-hooks --mode diff            # Only block NEW threats

With pre-commit framework

Add to .pre-commit-config.yaml:

repos:
  - repo: https://github.com/DNSZLSK/muad-dib
    rev: v2.5.17
    hooks:
      - id: muaddib-scan        # Scan all threats
      # - id: muaddib-diff      # Or: only new threats
      # - id: muaddib-paranoid  # Or: ultra-strict mode

With husky

npx husky add .husky/pre-commit "npx muaddib scan . --fail-on high"
# Or for diff mode:
npx husky add .husky/pre-commit "npx muaddib diff HEAD --fail-on high"

Remove hooks

muaddib remove-hooks [path]

Removes all MUAD'DIB hooks (husky and git native).

Native git hooks

muaddib init-hooks --type git
# Creates .git/hooks/pre-commit

Zero-Day Monitor

MUAD'DIB continuously monitors npm and PyPI registries for new packages in real-time, scanning each one automatically with Docker sandbox analysis and webhook alerting. This runs internally on our infrastructure — detected threats feed into the IOC database and threat feed API.

Score breakdown

muaddib scan . --breakdown

Shows explainable score breakdown: how each finding contributes to the final risk score, with per-rule weights and severity multipliers.

Ground truth replay

muaddib replay
muaddib ground-truth

Replay real-world supply-chain attacks against the scanner to validate detection coverage. Current results: 46/49 detected (93.9% TPR) from 51 samples (49 active).

4 out-of-scope misses: lottie-player, polyfill-io, trojanized-jquery (browser-only DOM attacks), websocket-rat (FP-risky pattern).

Version check

MUAD'DIB automatically checks for new versions on startup and notifies you if an update is available.

Features

Python / PyPI support

MUAD'DIB automatically detects and scans Python projects:

requirements.txt - All formats including -r recursive includes, extras, environment markers
setup.py - Extracts install_requires and setup_requires
pyproject.toml - PEP 621 dependencies and Poetry dependencies

Python packages are checked against 14,000+ known malicious PyPI packages (from OSV.dev) and tested for typosquatting against popular PyPI packages (requests, numpy, flask, django, pandas, etc.) using PEP 503 name normalization.

[PYTHON] Detected Python project (3 dependency files)
  requirements.txt: 12 packages
  setup.py: 3 packages
  pyproject.toml: 8 packages

[CRITICAL] PyPI IOC match: malicious-pkg (all versions)
[HIGH] PyPI typosquat: "reqeusts" looks like "requests"

Typosquatting detection

MUAD'DIB detects packages with names similar to popular packages (npm and PyPI):

[HIGH] Package "lodahs" looks like "lodash" (swapped_chars). Possible typosquatting.

Dataflow analysis

Detects when code reads credentials AND sends them over the network:

[CRITICAL] Suspicious flow: credential read (readFileSync, GITHUB_TOKEN) + network send (fetch)

GitHub Actions scanning

Detects malicious patterns in .github/workflows/ YAML files, including Shai-Hulud 2.0 backdoor indicators.

Detected attacks

| Campaign | Packages | Status | |----------|----------|--------| | Shai-Hulud v1 (Sept 2025) | @ctrl/tinycolor, ng2-file-upload | Detected | | Shai-Hulud v2 (Nov 2025) | @asyncapi/specs, posthog-node, kill-port | Detected | | Shai-Hulud v3 (Dec 2025) | @vietmoney/react-big-calendar | Detected | | event-stream (2018) | flatmap-stream, event-stream | Detected | | eslint-scope (2018) | eslint-scope | Detected | | Protestware | node-ipc, colors, faker | Detected | | Typosquats | crossenv, mongose, babelcli | Detected |

Detected techniques

| Technique | MITRE | Detection | |-----------|-------|-----------| | Credential theft (.npmrc, .ssh) | T1552.001 | AST | | Env var exfiltration | T1552.001 | AST | | Remote code execution | T1105 | Pattern | | Reverse shell | T1059.004 | Pattern | | Dead man's switch | T1485 | Pattern | | Obfuscated code | T1027 | Heuristics | | JS obfuscation patterns | T1027.002 | Pattern detection | | Shannon entropy (strings) | T1027 | Entropy calculation | | Typosquatting (npm + PyPI) | T1195.002 | Levenshtein | | Supply chain compromise | T1195.002 | IOC matching | | PyPI malicious package | T1195.002 | IOC matching | | Sandbox dynamic analysis | Multiple | Docker + strace + tcpdump | | Sudden lifecycle script addition | T1195.002 | Temporal analysis | | Dangerous API injection between versions | T1195.002 | Temporal AST diff | | Publish frequency anomaly | T1195.002 | Registry metadata | | Maintainer/publisher change | T1195.002 | Registry metadata | | Canary token exfiltration | T1552.001 | Sandbox honey tokens | | AI agent weaponization | T1059.004 | AST (s1ngularity/Nx flags) | | AI config prompt injection | T1059.004 | File scanning (.cursorrules, CLAUDE.md) | | Credential CLI theft (gh, gcloud, aws) | T1552.001 | AST | | Binary dropper (chmod + exec /tmp) | T1105 | AST | | Prototype hooking (fetch, XMLHttpRequest) | T1557 | AST | | Workflow injection (.github/workflows) | T1195.002 | AST | | Crypto wallet harvesting | T1005 | Dataflow | | Require cache poisoning | T1574.001 | AST | | Staged eval decode (eval+atob/Buffer) | T1140 | AST | | Deobfuscation (string concat, charcode, base64, hex) | T1140 | AST pre-processing | | Cross-file dataflow (inter-module exfiltration) | T1041 | Module graph |

Supply Chain Anomaly Detection (v2.0)

MUAD'DIB 2.0 introduces a paradigm shift: from IOC-based detection (reactive, requires known threats) to behavioral anomaly detection (proactive, detects unknown threats by spotting suspicious changes).

Traditional supply-chain scanners rely on blocklists of known malicious packages. The problem: they can only detect threats AFTER they've been identified and reported. Attacks like ua-parser-js (2021), event-stream (2018), and Shai-Hulud (2025) went undetected for hours or days because no IOC existed yet.

MUAD'DIB 2.0 adds 5 behavioral detection features that can catch these attacks before they appear in any IOC database, by analyzing what changed between package versions.

New features

1. Sudden Lifecycle Script Detection (`--temporal`)

Detects when preinstall, install, or postinstall scripts suddenly appear in a new version of a package that never had them before. This is the #1 attack vector for supply-chain attacks.

muaddib scan . --temporal

2. Temporal AST Diff (`--temporal-ast`)

Downloads the two latest versions of each dependency and compares their AST (Abstract Syntax Tree) to detect newly added dangerous APIs: child_process, eval, Function, net.connect, process.env, fetch, etc.

muaddib scan . --temporal-ast

3. Publish Frequency Anomaly (`--temporal-publish`)

Detects abnormal publishing patterns: burst of versions in 24h, dormant package suddenly updated after 6+ months, rapid version succession (multiple releases in under 1h).

muaddib scan . --temporal-publish

4. Maintainer Change Detection (`--temporal-maintainer`)

Detects changes in package maintainers between versions: new maintainer added, sole maintainer replaced (event-stream pattern), suspicious maintainer names, new publisher.

muaddib scan . --temporal-maintainer

5. Canary Tokens / Honey Tokens (sandbox)

Injects fake credentials into the sandbox environment before installing a package. If the package attempts to exfiltrate these honey tokens via HTTP, DNS, filesystem, or stdout, it's flagged as confirmed malicious.

6 honeypot credentials are injected:

GITHUB_TOKEN / NPM_TOKEN — Package registry tokens
AWS_ACCESS_KEY_ID / AWS_SECRET_ACCESS_KEY — Cloud credentials
SLACK_WEBHOOK_URL / DISCORD_WEBHOOK_URL — Messaging webhooks

Both dynamic tokens (random per session, from canary-tokens.js) and static fallback tokens (in sandbox-runner.sh) are used for defense in depth.

muaddib sandbox suspicious-package

Full temporal scan

Enable all temporal analysis features at once:

muaddib scan . --temporal-full

Usage examples

# Full behavioral scan (all 5 features)
muaddib scan . --temporal-full

# Only lifecycle script detection
muaddib scan . --temporal

# AST diff + maintainer change
muaddib scan . --temporal-ast --temporal-maintainer

# Sandbox with canary tokens (enabled by default)
muaddib sandbox suspicious-package

# Sandbox without canary tokens
muaddib sandbox suspicious-package --no-canary

New detection rules (v2.0)

| Rule ID | Name | Severity | Feature | |---------|------|----------|---------| | MUADDIB-TEMPORAL-001 | Sudden Lifecycle Script Added (Critical) | CRITICAL | --temporal | | MUADDIB-TEMPORAL-002 | Sudden Lifecycle Script Added | HIGH | --temporal | | MUADDIB-TEMPORAL-003 | Lifecycle Script Modified | MEDIUM | --temporal | | MUADDIB-TEMPORAL-AST-001 | Dangerous API Added (Critical) | CRITICAL | --temporal-ast | | MUADDIB-TEMPORAL-AST-002 | Dangerous API Added (High) | HIGH | --temporal-ast | | MUADDIB-TEMPORAL-AST-003 | Dangerous API Added (Medium) | MEDIUM | --temporal-ast | | MUADDIB-PUBLISH-001 | Publish Burst Detected | HIGH | --temporal-publish | | MUADDIB-PUBLISH-002 | Dormant Package Spike | HIGH | --temporal-publish | | MUADDIB-PUBLISH-003 | Rapid Version Succession | MEDIUM | --temporal-publish | | MUADDIB-MAINTAINER-001 | New Maintainer Added | HIGH | --temporal-maintainer | | MUADDIB-MAINTAINER-002 | Suspicious Maintainer Detected | CRITICAL | --temporal-maintainer | | MUADDIB-MAINTAINER-003 | Sole Maintainer Changed | HIGH | --temporal-maintainer | | MUADDIB-MAINTAINER-004 | New Publisher Detected | MEDIUM | --temporal-maintainer | | MUADDIB-CANARY-001 | Canary Token Exfiltration | CRITICAL | sandbox |

Why it matters

These features detect attacks like:

Shai-Hulud (2025): Would be caught by temporal lifecycle + AST diff (sudden postinstall + child_process added)
ua-parser-js (2021): Would be caught by maintainer change + lifecycle script detection
event-stream (2018): Would be caught by sole maintainer change + AST diff (new flatmap-stream dependency with eval)
coa/rc (2021): Would be caught by publish burst + lifecycle script detection

All without needing a single IOC entry.

IOC Sources

MUAD'DIB aggregates threat intelligence from verified sources only:

| Source | Type | Coverage | |--------|------|----------| | OSV.dev npm dump | Bulk zip | 200,000+ npm MAL-* entries | | OSV.dev PyPI dump | Bulk zip | 14,000+ PyPI MAL-* entries | | GenSecAI Shai-Hulud Detector | GitHub | 700+ Shai-Hulud packages | | DataDog Security Labs | GitHub | Consolidated IOCs from 7 vendors | | OSSF Malicious Packages | OSV API | 8000+ malware reports | | GitHub Advisory | OSV API | Malware-tagged advisories | | Snyk Known Malware | Static | Historical attacks | | Socket.dev / Phylum | Static | Manual additions |

VS Code

The VS Code extension automatically scans your npm projects.

Installation

Search "MUAD'DIB" in VS Code Extensions, or:

code --install-extension dnszlsk.muaddib-vscode

Commands

MUAD'DIB: Scan Project - Scan entire project
MUAD'DIB: Scan Current File - Scan current file

Settings

muaddib.autoScan - Auto-scan on project open (default: true)
muaddib.webhookUrl - Discord/Slack webhook URL
muaddib.failLevel - Alert level (critical/high/medium/low)

CI/CD

GitHub Actions (Marketplace)

Use the official MUAD'DIB action from the GitHub Marketplace:

name: Security Scan

on: [push, pull_request]

jobs:
  scan:
    runs-on: ubuntu-latest
    permissions:
      security-events: write
      contents: read
    steps:
      - uses: actions/checkout@v4
      - uses: DNSZLSK/muad-dib@v1
        with:
          path: '.'
          fail-on: 'high'
          sarif: 'results.sarif'

Action Inputs

| Input | Description | Default | |-------|-------------|---------| | path | Path to scan | . | | fail-on | Minimum severity to fail (critical/high/medium/low) | high | | sarif | Path for SARIF output file | `` | | paranoid | Enable ultra-strict detection | false |

Action Outputs

| Output | Description | |--------|-------------| | sarif-file | Path to generated SARIF file | | risk-score | Risk score (0-100) | | threats-count | Number of threats detected | | exit-code | Exit code (0 = clean) |

Alerts appear in Security > Code scanning alerts.

Architecture

MUAD'DIB 2.6.1 Scanner
|
+-- IOC Match (225,000+ packages, JSON DB)
|   +-- OSV.dev npm dump (200K+ MAL-* entries)
|   +-- OSV.dev PyPI dump (14K+ MAL-* entries)
|   +-- GenSecAI Shai-Hulud Detector
|   +-- DataDog Consolidated IOCs
|   +-- OSSF Malicious Packages (via OSV)
|   +-- GitHub Advisory (malware)
|   +-- Snyk Known Malware
|   +-- Static IOCs (Socket, Phylum)
|
+-- Deobfuscation Pre-processing (v2.2.5, --no-deobfuscate to disable)
|   +-- String concat folding, CharCode reconstruction
|   +-- Base64 decode, Hex array resolution
|   +-- Const propagation (Phase 2)
|
+-- Inter-module Dataflow (v2.2.6, --no-module-graph to disable)
|   +-- Module dependency graph, tainted export annotation
|   +-- 3-hop re-export chains, class method analysis
|   +-- Cross-file credential read -> network sink detection
|
+-- Intent Coherence Analysis (v2.6.0)
|   +-- Intra-file source-sink pairing (credential read + eval/network in same file)
|   +-- Cross-file detection delegated to module-graph (proven taint paths only)
|   +-- LOW severity threats excluded (respects FP reductions)
|
+-- 14 Parallel Scanners (129 rules)
|   +-- AST Parse (acorn) — eval/Function, credential CLI theft, binary droppers, prototype hooks
|   +-- Pattern Matching (shell, scripts)
|   +-- Obfuscation Detection (skip .min.js, ignore hex/unicode alone)
|   +-- Typosquat Detection (npm + PyPI, Levenshtein)
|   +-- Python Scanner (requirements.txt, setup.py, pyproject.toml)
|   +-- Shannon Entropy (string-level, 5.5 bits + 50 chars min)
|   +-- JS Obfuscation Patterns (_0x* vars, encoded arrays, eval+entropy)
|   +-- GitHub Actions Scanner
|   +-- AI Config Scanner (.cursorrules, CLAUDE.md, copilot-instructions.md)
|   +-- Package, Dependencies, Hash, npm-registry, Dataflow scanners
|
+-- Supply Chain Anomaly Detection (v2.0)
|   +-- Temporal Lifecycle Script Detection (--temporal)
|   +-- Temporal AST Diff (--temporal-ast)
|   +-- Publish Frequency Anomaly (--temporal-publish)
|   +-- Maintainer Change Detection (--temporal-maintainer)
|   +-- Canary Tokens / Honey Tokens (sandbox)
|
+-- Validation & Observability (v2.1)
|   +-- Datadog 17K Benchmark (88.2% raw, ~100% JS/Node.js adjusted)
|   +-- Ground Truth Dataset (51 real-world attacks, 93.9% TPR)
|   +-- Detection Time Logging (first_seen tracking, lead time metrics)
|   +-- FP Rate Tracking (daily stats, false positive rate)
|   +-- Score Breakdown (explainable per-rule scoring)
|   +-- Threat Feed API (HTTP server, JSON feed for SIEM)
|
+-- FP Reduction Post-processing (v2.2.8-v2.3.1, v2.5.7-v2.5.8, v2.5.15-v2.5.16)
|   +-- Count-based severity downgrade (dynamic_require, dataflow, module_compile, etc.)
|   +-- Framework prototype scoring cap + HTTP client whitelist
|   +-- Obfuscation in dist/build/.cjs/.mjs/.js >100KB → LOW
|   +-- Safe env var + prefix filtering + DATAFLOW_SAFE_ENV_VARS
|   +-- Dataflow telemetry source categorization (os.platform/arch → telemetry_read)
|   +-- DEP whitelist (es5-ext, bootstrap-sass) + npm alias skip
|   +-- IOC wildcard audit (v2.5.8): FPR 10.8% → 6.0%
|   +-- P5 heuristic precision (v2.5.15): 7 fixes
|   +-- P6 compound detection precision (v2.5.16): 6 fixes
|
+-- Per-File Max Scoring (v2.2.11)
|   +-- Score = max(file_scores) + package_level_score
|   +-- Eliminates score accumulation across many files
|   +-- Package-level threats (lifecycle, typosquat, IOC) scored separately
|
+-- Sandbox Monkey-Patching Preload (v2.4.9)
|   +-- Runtime time manipulation (Date.now, setTimeout→0, setInterval→immediate)
|   +-- Network/filesystem/process/env interception and logging
|   +-- Multi-run [0h, 72h, 7d] for time-bomb detection (T1497.003)
|
+-- Security Audit (v2.5.0-v2.5.6)
|   +-- 41 issues remediated (14 CRITICAL, 18 HIGH, 9 MEDIUM)
|   +-- Native addon path traversal, atomic writes, AST bypasses
|
+-- Audit Hardening (v2.5.13-v2.5.14)
|   +-- Scoring: plugin loader threshold, lifecycle CRITICAL floor, percentage guard 40%
|   +-- AST: eval alias, globalThis indirect, require(obj.prop), variable reassignment
|   +-- Dataflow: Promise .then() tainting, JSON taint propagation
|   +-- Shell: mkfifo+nc, base64|bash, wget+base64 (3 new patterns)
|   +-- Entropy: fragment cluster, windowed analysis
|   +-- 8 new rules (SHELL-013 to 015, ENTROPY-004, +4 audit fixes)
|
+-- Paranoid Mode (ultra-strict)
+-- Docker Sandbox (behavioral analysis, network capture, canary tokens, CI-aware, preload)
+-- Zero-Day Monitor (internal: npm + PyPI RSS polling, Discord alerts, daily report)
|
v
Dataflow Analysis (credential read -> network send)
|
v
Threat Enrichment (rules, MITRE ATT&CK, playbooks)
|
v
Output (CLI, JSON, HTML, SARIF, Webhook, Threat Feed)

Evaluation Metrics

| Metric | Result | Details | |--------|--------|---------| | Wild TPR (Datadog 17K) | 88.2% raw · ~100% adjusted | 17,922 real malware samples. 2,077 misses are all out-of-scope (see below) | | TPR (Ground Truth) | 93.9% (46/49) | 51 real-world attacks (49 active). 3 out-of-scope: browser-only (3) | | FPR (Benign, global) | 12.3% (65/532) | 532 npm packages, real source code via npm pack, threshold > 20 | | ADR (Adversarial + Holdout) | 97.3% (73/75) | 53 adversarial + 40 holdout evasive samples (75 available on disk). 2 misses: require-cache-poison (P3 trade-off), getter-defineProperty-exfil |

Datadog 17K benchmark — DataDog Malicious Software Packages Dataset, 17,922 real malware samples (npm). Raw TPR: 88.2% (15,810/17,922). The 2,077 misses (score=0) were manually categorized:

| Category | Count | Reason | |----------|-------|--------| | Phishing pages (HTML/CSS/JS frontend) | 1,233 | No Node.js APIs (no require, child_process, fs, process.env). Fake login pages, redirects, captchas. | | Native binaries (no JS files) | 824 | Platform-specific binaries (darwin-arm64, linux-x64, etc.). 201 from @42ailab alone. | | Corrected libraries | 20 | Temporarily compromised then fixed. Malicious code removed before scan. |

All 2,077 misses lack Node.js malware patterns. MUAD'DIB performs AST-based Node.js static analysis — phishing HTML and native binaries are out of scope. Adjusted TPR on JS/Node.js malware: ~100% (15,810/~15,845). See Evaluation Methodology.

FPR by package size — FPR correlates linearly with package size. Per-file max scoring (v2.2.11) significantly reduces FP on medium/large packages:

| Category | Packages | FP | FPR | |----------|----------|-----|-----| | Small (<10 JS files) | 290 | 18 | 6.2% | | Medium (10-50 JS files) | 135 | 16 | 11.9% | | Large (50-100 JS files) | 40 | 10 | 25.0% | | Very large (100+ JS files) | 62 | 25 | 40.3% |

FPR progression: 0% (invalid, empty dirs, v2.2.0-v2.2.6) → 38% (first real measurement, v2.2.7) → 19.4% (v2.2.8) → 17.5% (v2.2.9) → ~13% (v2.2.11, per-file max scoring) → 8.9% (v2.3.0, P2) → 7.4% (v2.3.1, P3) → 6.0% (v2.5.8, P4 + IOC wildcard audit) → ~13.6% (v2.5.14, audit hardening added stricter detection) → 12.3% (v2.5.16, P5 + P6) → 12.3% (v2.6.0, intent graph v2 — zero FP added) → 12.3% (v2.6.1, module-graph bounded path — zero FP added)

Note on FPR evolution: The historic 6.0% FPR (v2.5.8) relied on a BENIGN_PACKAGE_WHITELIST that excluded certain known packages from scoring — a data leakage bias removed in v2.5.10. The current 12.3% FPR is an honest measurement without whitelisting, against 532 real benign packages. The intent graph (v2.6.0) adds zero false positives by using intra-file pairing only and excluding LOW-severity threats.

Holdout progression (pre-tuning scores, rules frozen):

| Holdout | Score | Focus | |---------|-------|-------| | v1 | 30% (3/10) | General patterns | | v2 | 40% (4/10) | Env charcode, lifecycle, prototype | | v3 | 60% (6/10) | Require cache, DNS TXT, reverse shell | | v4 | 80% (8/10) | Deobfuscation effectiveness | | v5 | 50% (5/10) | Inter-module dataflow (new scanner) |

Wild TPR (Datadog Benchmark): detection rate on 17,922 real malware packages from the DataDog Malicious Software Packages Dataset. Raw 88.2% (15,810/17,922). Adjusted ~100% on JS/Node.js malware when excluding out-of-scope samples (1,233 phishing HTML pages, 824 native binaries, 20 corrected libraries). See Evaluation Methodology.
TPR (True Positive Rate): detection rate on 49 real-world supply-chain attacks (event-stream, ua-parser-js, coa, flatmap-stream, eslint-scope, solana-web3js, and 43 more). 3 misses are browser-only (lottie-player, polyfill-io, trojanized-jquery) — see Threat Model.
FPR (False Positive Rate): packages scoring > 20 out of 529 real npm packages (source code scanned, not empty dirs).
ADR (Adversarial Detection Rate): detection rate on 120 evasive malicious samples — 53 adversarial + 40 holdout (6 adversarial waves + 4 holdout batches). 75 available on disk. 2 misses on available samples: require-cache-poison (P3 trade-off), getter-defineProperty-exfil.
Holdout (pre-tuning): detection rate on 10 unseen samples with rules frozen (measures generalization)

Datasets: 17,922 Datadog malware samples, 532 npm + 132 PyPI benign packages, 120 adversarial/holdout samples (75 available on disk), 51 ground-truth attacks (65 documented malware packages). 1932 tests, 86% code coverage.

See Evaluation Methodology for the full experimental protocol.

Contributing

Add IOCs

Edit YAML files in iocs/:

- id: NEW-MALWARE-001
  name: "malicious-package"
  version: "*"
  severity: critical
  confidence: high
  source: community
  description: "Threat description"
  references:
    - https://example.com/article
  mitre: T1195.002

Development

git clone https://github.com/DNSZLSK/muad-dib
cd muad-dib
npm install
npm test

Testing

1932 unit/integration tests across 44 modular test files - 86% code coverage via Codecov
56 fuzz tests - Malformed YAML, invalid JSON, binary files, ReDoS, unicode, 10MB inputs
Datadog 17K benchmark - 17,922 real malware samples, 88.2% raw TPR, ~100% on JS/Node.js malware (2,077 out-of-scope misses: phishing, binaries, corrected libs)
120 adversarial/holdout samples - 53 adversarial + 40 holdout (75 available on disk), 73/75 detection rate (97.3% ADR). 2 misses: require-cache-poison (P3 trade-off), getter-defineProperty-exfil
Ground truth validation - 51 real-world attacks (46/49 detected = 93.9% TPR). 3 out-of-scope: browser-only (lottie-player, polyfill-io, trojanized-jquery)
False positive validation - 12.3% FPR global (65/532) on real npm source code via npm pack
ESLint security audit - eslint-plugin-security with 14 rules enabled

Community

Discord: https://discord.gg/y8zxSmue

Documentation

Evaluation Methodology - Experimental protocol, raw holdout scores, attack sources
Threat Model - What MUAD'DIB detects and doesn't detect
Security Audit Report v1.4.1 - Full security audit (58 issues fixed)
IOCs YAML - Threat database

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Why MUAD'DIB?

Positioning

Installation

npm (recommended)

From source

Usage

Basic scan

Interactive mode

Safe install

Risk score

Explain mode (full details)

Export

Severity threshold

Paranoid mode

Discord/Slack webhook

Real-time monitoring

Daemon mode

Update IOCs (fast, ~5 seconds)

Scrape IOCs (full, ~5 minutes)

Docker Sandbox

Sandbox network report

Diff (compare versions)

Pre-commit hooks

With pre-commit framework

With husky

Remove hooks

Native git hooks

Zero-Day Monitor

Score breakdown

Ground truth replay

Version check

Features

Python / PyPI support

Typosquatting detection

Dataflow analysis

GitHub Actions scanning

Detected attacks

Detected techniques

Supply Chain Anomaly Detection (v2.0)

New features

1. Sudden Lifecycle Script Detection (--temporal)

2. Temporal AST Diff (--temporal-ast)

3. Publish Frequency Anomaly (--temporal-publish)

4. Maintainer Change Detection (--temporal-maintainer)

5. Canary Tokens / Honey Tokens (sandbox)

Full temporal scan

Usage examples

New detection rules (v2.0)

Why it matters

IOC Sources

VS Code

Installation

Commands

Settings

CI/CD

GitHub Actions (Marketplace)

Action Inputs

Action Outputs

Architecture

Evaluation Metrics

Contributing

Add IOCs

Development

Testing

Community

Documentation

License

1. Sudden Lifecycle Script Detection (`--temporal`)

2. Temporal AST Diff (`--temporal-ast`)

3. Publish Frequency Anomaly (`--temporal-publish`)

4. Maintainer Change Detection (`--temporal-maintainer`)