codedrift

v1.2.12

Published

2 months ago

Guardrails for AI-assisted development - Detects IDOR, missing input validation, hardcoded secrets, and other critical bugs in AI-generated code

Downloads

170

CodeDrift

The integrity layer for AI-generated code. Catches the bugs that Copilot, Cursor, ChatGPT and Claude silently ship to production.

AI coding assistants are writing more production code every month. They're fast, fluent and confidently wrong. They generate async functions that never complete, import packages that don't exist, leak secrets in error responses, introduce taint flows across module boundaries, and pass code review because the code looks correct.

CodeDrift is the safety layer between AI-generated code and production. It performs cross-file taint analysis, CFG-driven data flow tracking, and semantic pattern detection to catch the class of bugs that are syntactically valid but semantically dangerous — the ones that ESLint, TypeScript and human reviewers miss because the code reads fine.

Why This Exists

AI Code Is Scaling Faster Than Human Review

Every team using Copilot, Cursor, Claude or ChatGPT is accumulating structural debt they can't see. Missing awaits silently corrupt data. Hallucinated dependencies pass CI and crash in production. Tainted user input flows through AI-generated service layers without a single validation check. Stack traces with API keys leak through error handlers that look perfectly reasonable.

This isn't a tooling gap. It's a trust gap. AI assistants generate code that compiles, passes linting, and reads correctly — but carries semantic bugs that only surface in production. CodeDrift closes that gap not by slowing down AI adoption but by making it safe.

Quick Start

npx codedrift

Installation

# Run directly without installation
npx codedrift

# Install as dev dependency
npm install --save-dev codedrift

# Install globally
npm install -g codedrift

Requirements: Node.js 16+

The Problem

You ask GitHub Copilot or ChatGPT to create an API endpoint:

app.get('/api/documents/:id', async (req, res) => {
  const doc = await db.documents.findById(req.params.id);
  res.json(doc);
});

TypeScript compiles. ESLint passes. Tests pass. Code review approved. Deployed to production.

Three days later: any user can access any document by changing the URL.

You just shipped an IDOR vulnerability.

What CodeDrift Catches

CodeDrift detects this before deployment:

npx codedrift

CRITICAL Issues (1)

  src/api/documents.ts:23
  Database query using user-supplied ID without authorization check
  IDOR vulnerability allows any user to access any document
  Fix: Add authorization check before query

Detection Engines

| Engine | Detects | Severity | |--------|---------|----------| | IDOR | Database queries without authorization checks (Express, Koa, Hapi) | Critical | | Missing Input Validation | Request data used without validation (Express, Koa, Hapi, NestJS) | Critical | | Hardcoded Secrets | API keys and tokens in source code | Critical | | Stack Trace Exposure | Error stacks leaked in API responses | Critical | | Missing Await | Async functions called without await | Critical | | Async forEach | forEach/map/filter with async callbacks — skips Promise.all/allSettled/any/race wrapping | Critical | | Hallucinated Dependencies | Imports of packages that do not exist | Critical | | Unsafe Regex | Regular expressions vulnerable to ReDoS | Error | | Console in Production | console.log in production code | Warning | | Empty Catch | Empty catch blocks that hide errors | Warning |

Common Patterns

IDOR Vulnerability

AI generates database queries that skip authorization:

app.get('/orders/:id', async (req, res) => {
  const order = await db.orders.findById(req.params.id);
  res.json(order);
});

CodeDrift detects:

Database query using user-supplied ID without authorization check
Fix: Verify order.userId === req.user.id before returning data

Missing Input Validation

AI uses request data without validation:

app.post('/users', async (req, res) => {
  const { email, role } = req.body;
  await db.users.create({ email, role });
});

CodeDrift detects:

API route uses req.body without validation
Vulnerable to privilege escalation and injection attacks
Fix: Add input validation with zod, joi, or yup

Async forEach Bug

AI creates loops that silently fail:

async function syncInventory(products) {
  products.forEach(async (p) => {
    await updateStock(p.id, p.quantity);
  });
  console.log('Sync complete');
}

The function returns immediately. Updates happen asynchronously and may never complete.

CodeDrift detects:

forEach with async callback does not await
90% of updates will fail silently
Fix: Use for...of loop or Promise.all

Hardcoded Secrets

AI embeds credentials from training data:

const stripe = new Stripe('sk_live_51HqK2KDj3...');
const sgMail = require('@sendgrid/mail');
sgMail.setApiKey('SG.1234567890abcdef...');

CodeDrift detects:

Hardcoded Stripe API key detected
Hardcoded SendGrid API key detected
Fix: Use environment variables

Stack Trace Exposure

AI copies debugging patterns that leak secrets:

catch (error) {
  res.status(500).json({
    error: error.message,
    stack: error.stack
  });
}

Exposes internal paths, database credentials, and API keys to users.

CodeDrift detects:

Stack trace exposed in API response
Production secrets and file paths visible to attackers
Fix: Only log stack traces server-side

Confidence Levels

CodeDrift assigns confidence levels to every issue to help you prioritize fixes:

High: Clear security vulnerabilities or bugs in production code (fix immediately)
Medium: Issues in test files or generated code (review and fix)
Low: Potential false positives or edge cases (investigate when convenient)

Confidence levels are automatically adjusted based on file context:

// src/api/users.ts - HIGH confidence
app.get('/users/:id', async (req, res) => {
  const user = await db.users.findById(req.params.id); // IDOR: High confidence
  res.json(user);
});

// tests/api.test.ts - MEDIUM confidence (downgraded)
test('should fetch user', async () => {
  const user = await db.users.findById('123'); // Same pattern, medium confidence
});

// dist/bundle.js - MEDIUM confidence (generated file)
// Generated code automatically gets reduced confidence

Filtering by Confidence

Use --confidence-threshold to filter issues:

# Show only high confidence issues
npx codedrift --confidence-threshold high

# Show high and medium confidence (default)
npx codedrift --confidence-threshold medium

# Show all issues including low confidence
npx codedrift --confidence-threshold low

Configuration

Create codedrift.config.json in your project root:

{
  "exclude": [
    "node_modules/**",
    "dist/**",
    "**/*.test.ts"
  ],
  "rules": {
    "idor": "error",
    "missing-input-validation": "error",
    "hardcoded-secret": "error",
    "stack-trace-exposure": "error",
    "missing-await": "warn",
    "async-foreach": "error",
    "hallucinated-deps": "warn",
    "unsafe-regex": "error",
    "console-in-production": "warn",
    "empty-catch": "warn"
  },
  "failOn": "error",
  "excludeTestFiles": true,
  "confidenceThreshold": "high",
  "respectGitignore": true
}

Options:

rules.<name>: Set to "error", "warn", or "off"
failOn: Exit with code 1 on "error" or "warn"
exclude: Array of glob patterns to skip
excludeTestFiles: Skip test files entirely (default: true)
confidenceThreshold: Minimum confidence level to report ("high", "medium", "low", default: "high")
respectGitignore: Honor .gitignore patterns when scanning (default: true)

Monorepo and Workspace Support

CodeDrift automatically detects and supports monorepo configurations for npm, yarn, and pnpm workspaces.

Automatic Workspace Detection

// Root package.json
{
  "name": "my-monorepo",
  "workspaces": [
    "packages/*",
    "apps/*"
  ]
}

CodeDrift will:

Detect workspace packages automatically
Resolve dependencies from both workspace and root package.json
Include workspace name in error messages for context
Handle scoped packages (@org/package) correctly

Workspace-Aware Error Messages

CRITICAL Issues (2)

  packages/api/src/server.ts:15
  Hallucinated dependency: 'express-rate-limiter' not found in workspace '@myorg/api' package.json
  Fix: Run 'npm install express-rate-limiter' or remove import if AI hallucinated this package

  apps/web/src/App.tsx:3
  Hallucinated dependency: 'react-super-hooks' not found in workspace '@myorg/web' package.json
  Fix: Run 'npm install react-super-hooks' or remove import if AI hallucinated this package

Supported Workspace Formats

npm workspaces: "workspaces": ["packages/*"]
yarn workspaces: "workspaces": ["packages/*"]
pnpm workspaces: via pnpm-workspace.yaml
Lerna: via lerna.json (experimental)

Running in Monorepos

# Run from monorepo root (analyzes all workspaces)
npx codedrift

# Run in specific workspace
cd packages/api && npx codedrift

# Exclude specific workspaces
npx codedrift --exclude "packages/legacy/**"

CI/CD Integration

GitHub Actions

name: Security Check
on: [push, pull_request]

jobs:
  codedrift:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v3
      - uses: actions/setup-node@v3
      - run: npx codedrift --confidence-threshold high

  # Advanced: separate jobs for different confidence levels
  high-priority:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v3
      - uses: actions/setup-node@v3
      - run: npx codedrift --confidence-threshold high --exclude-tests

GitLab CI

codedrift:
  stage: test
  image: node:18
  script:
    - npx codedrift --format json --output codedrift-report.json --confidence-threshold medium
  artifacts:
    reports:
      codequality: codedrift-report.json

CircleCI

version: 2.1

jobs:
  security-scan:
    docker:
      - image: cimg/node:18.0
    steps:
      - checkout
      - run: npx codedrift --format json --output report.json --confidence-threshold high
      - store_artifacts:
          path: report.json

Output Formats

Terminal

npx codedrift

Smart summary: severity breakdown, top 5 issues, and timing:

📊 Analysis Complete

  🔴 Critical:                 3    ← Fix these first!
  🟠 High:                     5
  🟡 Medium:                   0
  🔵 Low:                      0
  ──────────────────────────────
  Total:                       8 (high confidence only)

🎯 Top 5 Issues to Fix:

  1.  🔴 Missing Input Validation (src/api/users.ts:23)
      API route uses req.body without validation
  2.  🔴 IDOR (src/api/orders.ts:45)
      Database query using user-supplied ID without authorization check
  3.  🔴 Hardcoded Secret (src/config/aws.ts:12)
      Hardcoded AWS access key detected

💡 Run with --details to see all issues

Analyzed 42 files in 0.41s (102 files/sec)

JSON

npx codedrift --format json --output report.json

Machine-readable format for CI/CD pipelines and custom tooling.

HTML

Automatically generated on every local run no flags needed:

npx codedrift
# → Report: codedrift-report.html (open in browser)

Or write to a custom path:

npx codedrift --output my-report.html

Interactive report with filtering, grouping, and search. In CI (any CI env var or non-TTY), HTML is not auto-generated — use --output explicitly if you need it as an artifact.

SARIF

For GitHub Code Scanning and other SAST integrations:

npx codedrift --format sarif --output results.sarif

Upload to GitHub:

- run: npx codedrift --format sarif --output results.sarif
- uses: github/codeql-action/upload-sarif@v3
  with:
    sarif_file: results.sarif

Results appear under Security → Code scanning alerts in your repository.

Baseline Mode

For existing projects with many issues:

# Save current state as baseline
npx codedrift --baseline

# Only new issues will fail CI
npx codedrift --compare-baseline

This allows incremental adoption without blocking development on existing issues.

Suppressing False Positives

// Disable next line
// codedrift-disable-next-line
void backgroundTask();

// Disable current line
dangerousOperation(); // codedrift-disable-line

CLI Reference

codedrift [options]

Options:
  --format <type>               Output format: summary, detailed, compact, json, html, sarif
  --output <file>               Write report to file
  --baseline                    Save current issues as baseline
  --compare-baseline            Show only new issues since baseline
  --baseline-file <path>        Custom baseline file path
  --full                        Force full scan, ignore cache
  --confidence-threshold <lvl>  Minimum confidence: high, medium, low (default: high)
  --exclude-tests               Skip test files entirely (default: true)
  -h, --help                    Show help
  -v, --version                 Show version

Examples:
  # Basic scan
  codedrift

  # Only show high-confidence issues
  codedrift --confidence-threshold high

  # Skip test files for faster CI
  codedrift --exclude-tests --confidence-threshold high

  # Generate HTML report
  codedrift --output report.html

  # Baseline workflow
  codedrift --baseline
  codedrift --compare-baseline

  # JSON for custom processing
  codedrift --format json --output report.json

  # Monorepo: analyze specific workspace
  cd packages/api && codedrift --confidence-threshold high

How It Works

CodeDrift uses the TypeScript Compiler API to parse source code into an AST with full JSX/TSX support. Ten detection engines traverse the AST looking for semantically dangerous patterns — not just style issues.

Architecture

┌─────────────────────────────────────────────────────────────────┐
│  CLI Layer                                                      │
│  Interactive dashboard · Progress tracking · CI/CD integration  │
├─────────────────────────────────────────────────────────────────┤
│  Core Analyzer                                                  │
│  File discovery · Engine orchestration · Smart filters          │
├─────────────────────────────────────────────────────────────────┤
│  Detection Engines (10)                                         │
│  IDOR · Input validation · Secrets · Missing await · Regex ...  │
├─────────────────────────────────────────────────────────────────┤
│  Cross-File Taint Analysis                                      │
│  Project graph · Module resolver · Function summaries           │
│  Multi-hop flow resolution · Summary store                      │
├─────────────────────────────────────────────────────────────────┤
│  Data Flow Analysis (CFG-driven)                                │
│  Control flow graphs · Abstract interpretation · Fixpoint       │
│  Path sensitivity · Field sensitivity · Def-use chains          │
├─────────────────────────────────────────────────────────────────┤
│  Post-Processing                                                │
│  Smart auto-ignore · Confidence scoring · Severity adjustment   │
│  Risk scoring · Deduplication                                   │
├─────────────────────────────────────────────────────────────────┤
│  Output                                                         │
│  Terminal · JSON · HTML · SARIF · Baseline diff                 │
└─────────────────────────────────────────────────────────────────┘

Parser: Converts code to AST; files are parsed once and cached — never read twice per run
Engines: Pattern detectors with multi-level confidence scoring. Each engine walks the AST and classifies findings by severity and confidence before they surface
Cross-file taint analysis: Builds a project-wide module graph, computes function summaries with taint transfer semantics, and resolves multi-hop data flows across file boundaries — catches vulnerabilities where user input in one file reaches a sink in another
Data flow analysis: CFG-driven forward solver using abstract interpretation with worklist algorithm, reverse postorder traversal, path-sensitive branch refinement, field-sensitive heap modeling, and def-use chain computation — augments AST-based detection with precise flow tracking
Interactive dashboard: Live terminal UI during analysis showing phase progress, file throughput, issue counts (raw during scan, final after filtering), recent findings, and phase timeline
Formatter: Outputs results as terminal (summary/detailed/compact), JSON, HTML, or SARIF
CLI: Orchestrates analysis, handles exit codes, and integrates with CI/CD pipelines

Engine Highlights

IDOR / Input Validation: Multi-framework — detects Express (req.params), Koa (ctx.params), Hapi (request.payload), and NestJS patterns. Recognises global validation middleware (app.use(validate()))
Stack trace: Distinguishes res.json({ error: err.stack }) (report) from logger.error({ stack: err.stack }) (skip) — skips development guards (if (NODE_ENV !== 'production')) and detects GraphQL formatError stack leaks
async-forEach: Recognises await Promise.allSettled/any/race(array.map(...)) and chained .then() as safe patterns
missing-await: Scope-aware with cross-file heuristics — detects likely-async imports by naming convention, module origin, and .then()/await usage elsewhere in the file
Secrets: Entropy-filtered, path-aware — detects secrets in template literals, ignores migration filenames, file path arguments, and test fixtures

Cross-File Taint Tracking

AI-generated code often introduces vulnerabilities that span multiple files:

// routes/user.ts
import { processUser } from '../services/user-service';
app.post('/user', (req, res) => {
  processUser(req.body);  // tainted input crosses file boundary
});

// services/user-service.ts
export function processUser(data: any) {
  db.query(`SELECT * FROM users WHERE id = ${data.id}`);  // SQL injection
}

CodeDrift traces req.body from the route handler through the import boundary into processUser, detecting the SQL injection sink even though no single file contains the full vulnerability.

Performance: 100+ files per second on typical projects Privacy: 100% local analysis, no telemetry, code never leaves your machine

Contributing

Contributions are welcome. Open an issue first to discuss proposed changes.

Adding a new detector:

// src/engines/my-detector.ts
import { BaseEngine } from './base-engine.js';
import { AnalysisContext, Issue } from '../types/index.js';

export class MyDetector extends BaseEngine {
  readonly name = 'my-detector';

  async analyze(context: AnalysisContext): Promise<Issue[]> {
    // Detection logic here
    return issues;
  }
}

FAQ

Does this replace ESLint or TypeScript?

No. CodeDrift complements them. ESLint catches syntax and style issues. TypeScript catches type errors. CodeDrift catches the semantic security vulnerabilities that AI code generators introduce — the ones those tools were never designed to find.

Does my code get sent anywhere?

No. Analysis runs entirely locally. Zero telemetry. Your code never leaves your machine. Unlike cloud-based SAST tools, CodeDrift is fully offline.

Can I use this with hand-written code?

Yes. These bugs occur in all code but are significantly more common in AI-generated code. If your team uses Copilot, Cursor, Claude, or ChatGPT for any part of your codebase, CodeDrift should be in your CI pipeline.

How is this different from Semgrep or SonarQube?

CodeDrift is purpose-built for AI-generated code patterns. It includes cross-file taint tracking, hallucinated dependency detection, and async correctness checks that general-purpose SAST tools miss. It runs in seconds with zero configuration — npx codedrift and you're done.

Will this catch all bugs?

No tool catches everything. CodeDrift focuses on the most critical patterns that AI assistants generate — the ones that pass code review because they look correct but fail in production.

Roadmap

~~Cross-file taint analysis~~ — shipped in v1.2.x
~~CFG-driven data flow analysis~~ — shipped in v1.2.x
~~Interactive CLI dashboard~~ — shipped in v1.2.x
VS Code extension for real-time detection
Auto-fix suggestions
Python and Go support
Custom rule engine
GitHub App for PR comments

License

MIT

Support

Issues and bugs: GitHub Issues
Feature requests: GitHub Discussions
Security vulnerabilities: Report privately via GitHub Security Advisories

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

CodeDrift

Why This Exists

AI Code Is Scaling Faster Than Human Review

Quick Start

Installation

The Problem

What CodeDrift Catches

Detection Engines

Common Patterns

IDOR Vulnerability

Missing Input Validation

Async forEach Bug

Hardcoded Secrets

Stack Trace Exposure

Confidence Levels

Filtering by Confidence

Configuration

Monorepo and Workspace Support

Automatic Workspace Detection

Workspace-Aware Error Messages

Supported Workspace Formats

Running in Monorepos

CI/CD Integration

GitHub Actions

GitLab CI

CircleCI

Output Formats

Terminal

JSON

HTML

SARIF

Baseline Mode

Suppressing False Positives

CLI Reference

How It Works

Architecture

Engine Highlights

Cross-File Taint Tracking

Contributing

FAQ

Roadmap

License

Support