e2e-ai

v1.5.1

Published

23 days ago

AI-powered test automation pipeline — record, transcribe, generate, heal and ship Playwright tests from a single CLI

Downloads

1,122

0High
0Medium
0Low

davide97

e2e-ai

AI-powered E2E test automation pipeline. Takes you from a manual browser recording all the way to a stable, documented Playwright test — with optional Zephyr integration.

Includes a codebase scanner that builds a QA map of your application's features, workflows, and test scenarios using AI analysis.

Quick Start

# Install
npm install e2e-ai

# Initialize config + copy agents
npx e2e-ai init

# Generate project context (use init-agent in your AI tool)
# → produces .e2e-ai/context.md

# Show all commands
npx e2e-ai --help

# Full pipeline for an issue
npx e2e-ai run --key PROJ-101

# Scan your codebase and generate a QA map
npx e2e-ai scan
npx e2e-ai analyze

Architecture

e2e-ai has two main workflows:

Test Pipeline

record -> transcribe -> scenario -> generate -> refine -> test -> heal -> qa
  |            |            |           |          |         |       |      |
codegen     whisper    transcript   scenario    refactor  playwright self-  QA doc
+ audio     + merge    + scenario     agent      agent    runner   healing  + export
              agent      agent                                     agent   agent

Each step produces an artifact that feeds the next. You can run the full pipeline or any step individually.

Scanner Pipeline

scan -> analyze -> push
  |        |         |
 AST    feature    remote
 scan   analyzer   API
        + scenario  upload
        planner

Scans your codebase structure (routes, components, hooks), uses AI to identify features and workflows, generates test scenarios, and optionally pushes the QA map to a remote API.

Commands

`init` - Project Setup

Interactive wizard that creates .e2e-ai/config.ts and copies agent prompts to .e2e-ai/agents/.

npx e2e-ai init

# Non-interactive (use defaults)
npx e2e-ai init --non-interactive

After init, use the init-agent in your AI tool (Claude Code, Cursor, etc.) to generate .e2e-ai/context.md. This context file teaches all downstream agents about your project's test conventions. If you have the MCP server configured, the AI tool can call e2e_ai_scan_codebase to analyze your project automatically.

`record` - Browser Recording

Launches Playwright codegen with optional voice recording and trace capture.

# Record with voice + trace (default)
npx e2e-ai record --key PROJ-101

# Silent mode (no mic, no trace replay)
npx e2e-ai record --key PROJ-101 --no-voice --no-trace

# Generic session (no issue key)
npx e2e-ai record my-session

What it does: Spawns Playwright codegen. When --key is provided, files go to <workingDir>/<KEY>/. Press R during recording to pause/resume audio.

Output: codegen-<timestamp>.ts + voice-<timestamp>.wav

`transcribe` - Voice Transcription

Sends the .wav recording to OpenAI Whisper and merges voice comments into the codegen file.

npx e2e-ai transcribe --key PROJ-101
npx e2e-ai transcribe path/to/recording.wav

What it does: Calls Whisper API for timestamped segments, generates a markdown summary table, and injects // [Voice HH:MM - HH:MM] "text" comments into the codegen file.

Output: <session>-transcript.json + <session>-transcript.md

`scenario` - AI Scenario Generation

Analyzes the codegen + voice transcript and produces a structured YAML test scenario.

npx e2e-ai scenario --key PROJ-101

What it does: Two-step LLM pipeline:

transcript-agent: Maps voice comments to codegen actions, translates multilingual speech to English, classifies relevance
scenario-agent: Converts the narrative into a YAML scenario with numbered steps, semantic actions, selectors, and expected results

If no transcript exists, it generates the scenario from codegen actions alone.

Output: <testsDir>/<KEY>/<KEY>.yaml

`generate` - AI Test Generation

Converts a YAML scenario into a complete Playwright test file following your project conventions.

npx e2e-ai generate --key PROJ-101
npx e2e-ai generate path/to/scenario.yaml

What it does: The playwright-generator-agent receives the scenario + project context (from e2e-ai.context.md) and produces a test using your project's fixtures, helpers, and conventions.

Output: <testsDir>/<KEY>/<KEY>.test.ts (+ optional Zephyr XML if configured)

`refine` - AI Test Refactoring

Improves an existing test by replacing raw locators with project helpers and applying conventions.

npx e2e-ai refine --key PROJ-101
npx e2e-ai refine path/to/test.test.ts

What it does: The refactor-agent scans the test for:

Hardcoded selectors that could use semantic alternatives
CSS class chains that should be replaced with stable selectors
Missing timeouts on assertions
Raw Playwright code that could use existing project helpers
waitForTimeout() calls that should be proper waits

Output: Updated test file in-place

`test` - Test Runner

Runs a Playwright test with full trace/video/screenshot capture.

npx e2e-ai test --key PROJ-101
npx e2e-ai test path/to/test.test.ts

What it does: Spawns npx playwright test with trace, video, and screenshot capture enabled. Captures exit code, stdout/stderr, and trace path.

Output: Test results + trace files

`heal` - AI Self-Healing

Automatically diagnoses and fixes a failing test using up to 3 retry attempts.

npx e2e-ai heal --key PROJ-101

What it does: The self-healing-agent receives the failing test + error output and:

Classifies the failure: SELECTOR_CHANGED, TIMING_ISSUE, ELEMENT_NOT_INTERACTABLE, ASSERTION_MISMATCH, NAVIGATION_FAILURE, STATE_NOT_READY
Produces a patched test with // HEALED: <reason> comments
Re-runs the test
If still failing, tries a different approach (up to 3 attempts)

Never removes assertions. Never changes test structure.

Output: Patched test file + healing log

`qa` - QA Documentation

Generates formal QA documentation from a test.

npx e2e-ai qa --key PROJ-101

What it does: The qa-testcase-agent produces:

QA Markdown: Test ID, Title, Preconditions, Steps Table, Postconditions, Automation Mapping
Zephyr XML (optional): Import-ready XML if outputTarget includes zephyr

Output: <qaDir>/<KEY>.md (+ optional Zephyr XML)

`run` - Full Pipeline

Executes all steps in sequence: record -> transcribe -> scenario -> generate -> refine -> test -> heal -> qa

# Full pipeline
npx e2e-ai run --key PROJ-101

# Skip recording, start from scenario
npx e2e-ai run --key PROJ-101 --from scenario

# Skip voice and healing
npx e2e-ai run --key PROJ-101 --no-voice --skip heal

# Skip interactive steps, just do AI generation from existing data
npx e2e-ai run --key PROJ-101 --from scenario --skip test,heal

Options:

--from <step>: Start from a specific step (skips all prior steps)
--skip <steps>: Comma-separated list of steps to skip

Each step's output feeds the next via PipelineContext. If a step fails, the pipeline stops (unless the step is marked nonBlocking).

`scan` - Codebase AST Scanner

Scans your codebase and extracts structural information: routes, components, hooks, imports, and dependencies.

# Scan using config defaults (scans src/)
npx e2e-ai scan

# Scan a specific directory
npx e2e-ai scan --scan-dir app

# Write output to a custom path
npx e2e-ai scan --output my-scan.json

# Disable file-level caching
npx e2e-ai scan --no-cache

What it does:

Collects all .ts, .tsx, .js, .jsx files matching configured include/exclude patterns
Parses each file with a regex-based TypeScript parser to extract imports, exports, components, and hooks
Extracts routes from Next.js App Router (app/) and Pages Router (pages/) conventions
Caches results per-file (by content hash) for fast incremental re-scans

Output: ASTScanResult JSON containing:

files — all scanned files with imports, exports, line counts
routes — extracted routes with paths, methods, dynamic segments, layout files
components — React components with props, hook calls, export status
hooks — custom hook definitions with dependencies
dependencies — import graph edges between files

Default output location: .e2e-ai/ast-scan.json

Options:

| Flag | Description | Default | |------|-------------|---------| | --output <file> | Write output to specific file | .e2e-ai/ast-scan.json | | --scan-dir <dir> | Directory to scan | from config (src) | | --no-cache | Disable file-level caching | cache enabled |

`analyze` - AI QA Map Generation

Analyzes an AST scan result using AI to identify features, workflows, components, and test scenarios.

# Analyze the default scan output
npx e2e-ai analyze

# Analyze a specific scan file
npx e2e-ai analyze path/to/ast-scan.json

# Only extract features (skip scenario generation)
npx e2e-ai analyze --skip-scenarios

# Write output to a custom path
npx e2e-ai analyze --output my-qa-map.json

What it does — two-stage AI pipeline:

Stage 2 — Feature Analysis (feature-analyzer-agent): Receives the AST scan and groups routes, components, and hooks into logical features and user-facing workflows. Identifies component roles (form, display, navigation, modal, layout, feedback) and maps API routes to workflow steps.
Stage 3 — Scenario Planning (scenario-planner-agent): Receives the feature map and generates test scenarios for each workflow. Covers happy paths, validation, error handling, edge cases, and permission checks. Assigns priorities (critical/high/medium/low) and links scenarios to specific workflow steps.

Output: QAMapV2Payload JSON containing:

features — logical application features with routes and source files
workflows — user journeys with ordered steps, API calls, conditional branches
components — UI components with types, props, workflow references
scenarios — test scenarios with steps, preconditions, expected outcomes, priorities

Default output location: .e2e-ai/qa-map.json

Options:

| Flag | Description | Default | |------|-------------|---------| | --output <file> | Write QA map to specific file | .e2e-ai/qa-map.json | | --skip-scenarios | Only run feature analysis | both stages |

`push` - Push QA Map to API

Pushes a generated QA map to a remote API endpoint.

# Push the default QA map
npx e2e-ai push

# Push a specific file
npx e2e-ai push path/to/qa-map.json

# Associate with a git commit
npx e2e-ai push --commit-sha abc123

What it does: Sends the QAMapV2Payload JSON as a POST request to the configured API endpoint with Bearer token authentication. The API responds with version info and auto-linking statistics.

Requires push.apiUrl and push.apiKey in config (or E2E_AI_API_URL / E2E_AI_API_KEY env vars).

Options:

| Flag | Description | Default | |------|-------------|---------| | --commit-sha <sha> | Git commit SHA to associate | - |

Typical Workflows

Workflow 1: Full Recording Pipeline

Record from scratch for a new issue:

npx e2e-ai run --key PROJ-101

Opens the browser for recording, transcribes your voice, generates a scenario + test, runs it, heals failures, and produces QA docs.

Workflow 2: AI-Only (No Recording)

When you already have a codegen file or want to write the scenario manually:

# 1. Write/edit the scenario YAML manually
# 2. Generate the test
npx e2e-ai generate --key PROJ-101

# 3. Run and iterate
npx e2e-ai test --key PROJ-101

# 4. Auto-heal if failing
npx e2e-ai heal --key PROJ-101

# 5. Generate QA docs
npx e2e-ai qa --key PROJ-101

Workflow 3: Existing Test Improvement

# Refactor to use project helpers + conventions
npx e2e-ai refine --key PROJ-101

# Generate updated QA documentation
npx e2e-ai qa --key PROJ-101

Workflow 4: From Existing Recording Data

When codegen + voice already exist:

# Transcribe the voice recording
npx e2e-ai transcribe --key PROJ-101

# Generate scenario from codegen + transcript
npx e2e-ai scenario --key PROJ-101

# Continue with generate -> test -> qa
npx e2e-ai run --key PROJ-101 --from generate

Workflow 5: Codebase Scanner Pipeline

Generate a full QA map from your codebase structure:

# Step 1: Scan the codebase AST
npx e2e-ai scan

# Step 2: AI analysis → features, workflows, scenarios
npx e2e-ai analyze

# Step 3: Push to remote API (optional)
npx e2e-ai push

Or run stages selectively:

# Just extract features (no scenarios)
npx e2e-ai analyze --skip-scenarios

# Re-scan after code changes (cached, fast)
npx e2e-ai scan
npx e2e-ai analyze

Configuration

Run npx e2e-ai init to generate .e2e-ai/config.ts:

import { defineConfig } from 'e2e-ai/config';

export default defineConfig({
  // --- General ---
  inputSource: 'none',       // 'jira' | 'linear' | 'none'
  outputTarget: 'markdown',  // 'zephyr' | 'markdown' | 'both'

  // --- LLM ---
  llm: {
    provider: 'openai',      // 'openai' | 'anthropic'
    model: null,              // null = use agent defaults (gpt-4o)
    agentModels: {},          // per-agent overrides: { 'scenario-agent': 'gpt-4o-mini' }
  },

  // --- Paths ---
  paths: {
    tests: 'e2e/tests',
    scenarios: 'e2e/scenarios',
    recordings: 'e2e/recordings',
    transcripts: 'e2e/transcripts',
    traces: 'e2e/traces',
    workingDir: '.e2e-ai',
    qaOutput: 'qa',
  },

  // --- Scanner ---
  scanner: {
    scanDir: 'src',                         // root directory to scan
    include: ['**/*.ts', '**/*.tsx', '**/*.js', '**/*.jsx'],
    exclude: [
      '**/node_modules/**', '**/dist/**', '**/build/**',
      '**/.next/**', '**/*.test.*', '**/*.spec.*',
      '**/__tests__/**', '**/*.d.ts',
    ],
    cacheDir: '.e2e-ai/scan-cache',         // file-level parse cache
  },

  // --- Push (remote QA map API) ---
  push: {
    apiUrl: null,   // or set E2E_AI_API_URL env var
    apiKey: null,   // or set E2E_AI_API_KEY env var
  },

  // --- Recording ---
  voice: { enabled: true },
  playwright: {
    browser: 'chromium',
    timeout: 120_000,
    retries: 0,
    traceMode: 'on',
  },
});

All configuration lives inside the .e2e-ai/ directory — no files at project root.

Global Options

| Flag | Description | Default | |------|-------------|---------| | -k, --key <KEY> | Issue key (e.g. PROJ-101, LIN-42) | - | | --provider <p> | LLM provider: openai or anthropic | openai (or AI_PROVIDER env) | | --model <m> | Override LLM model | gpt-4o / claude-sonnet-4-20250514 | | -v, --verbose | Show debug output (API calls, file paths) | off | | --no-voice | Disable voice recording in record step | voice enabled | | --no-trace | Disable trace replay after recording | trace enabled |

Environment Variables

Required in .env:

OPENAI_API_KEY=sk-...          # Required for LLM calls + Whisper transcription

Optional:

AI_PROVIDER=openai              # openai | anthropic
AI_MODEL=gpt-4o                 # Model override
ANTHROPIC_API_KEY=sk-ant-...    # Required if AI_PROVIDER=anthropic
BASE_URL=https://...            # Your application URL
E2E_AI_API_URL=https://...      # Remote API for push command
E2E_AI_API_KEY=key-...          # API key for push command

AI Agents

Nine specialized agents live in agents/*.md, numbered by pipeline order. Each has a system prompt, input/output schemas, rules, and examples.

| # | File | Input | Output | Used by | |---|------|-------|--------|---------| | 0 | 0.init-agent | Codebase scan | .e2e-ai/context.md | init (AI chat) | | 1.1 | 1_1.transcript-agent | codegen + transcript JSON | Structured narrative with intent mapping | scenario | | 1.2 | 1_2.scenario-agent | narrative + issue context | YAML test scenario | scenario | | 2 | 2.playwright-generator-agent | scenario + project context | .test.ts file | generate | | 3 | 3.refactor-agent | test + project context | Improved test file | refine | | 4 | 4.self-healing-agent | failing test + error output | Diagnosis + patched test | heal | | 5 | 5.qa-testcase-agent | test + scenario + issue data | QA markdown + test case JSON | qa | | 6.1 | 6_1.feature-analyzer-agent | AST scan result | Features, workflows, components JSON | analyze | | 6.2 | 6_2.scenario-planner-agent | Features + workflows | Complete QA map with scenarios JSON | analyze |

Agents are loaded by bare name (e.g., loadAgent('scenario-agent')) — the numbered prefix is resolved automatically. You can customize agent behavior by editing the .md files in .e2e-ai/agents/.

Output Directory Structure

Default paths (configurable via .e2e-ai/config.ts):

e2e/
  tests/<KEY>/         # .test.ts + .yaml scenario (+ optional Zephyr XML)
  traces/              # trace.zip + results.json

qa/                    # QA documentation .md files

.e2e-ai/
  config.ts            # project configuration
  context.md           # project context (generated by init-agent)
  agents/              # agent prompt definitions (.md files)
  <KEY>/               # per-issue working dir: codegen, recordings/, intermediate files
  ast-scan.json        # scan command output
  qa-map.json          # analyze command output
  scan-cache/          # file-level parse cache (gitignored)

MCP Server

e2e-ai ships an MCP (Model Context Protocol) server that lets AI assistants interact with your project's test infrastructure directly. The server binary is e2e-ai-mcp.

Setup

Add to your MCP client configuration:

Claude Desktop (~/Library/Application Support/Claude/claude_desktop_config.json):

{
  "mcpServers": {
    "e2e-ai": {
      "command": "npx",
      "args": ["e2e-ai-mcp"],
      "cwd": "/path/to/your/project"
    }
  }
}

Claude Code:

claude mcp add e2e-ai -- npx e2e-ai-mcp

Cursor (.cursor/mcp.json):

{
  "mcpServers": {
    "e2e-ai": {
      "command": "npx",
      "args": ["e2e-ai-mcp"]
    }
  }
}

If e2e-ai is installed globally or as a project dependency, you can use the binary path directly instead of npx:

{
  "command": "node",
  "args": ["node_modules/.bin/e2e-ai-mcp"]
}

Available Tools

Orchestration (workflow automation)

| Tool | Description | Input | |------|-------------|-------| | e2e_ai_plan_workflow | Plan an automation workflow — returns an ordered todo list of steps | goal, key?, from?, skip?, voice?, trace?, scanDir? | | e2e_ai_execute_step | Execute a single pipeline step | step, key?, voice?, trace?, scanDir?, output?, extraArgs? | | e2e_ai_get_workflow_guide | Get the full workflow guide explaining how the pipeline works | (none) |

Project setup

| Tool | Description | Input | |------|-------------|-------| | e2e_ai_scan_codebase | Scan project for test files, configs, fixtures, path aliases, and sample test content | projectRoot? (defaults to cwd) | | e2e_ai_validate_context | Validate that a context markdown file has all required sections | content (markdown string) | | e2e_ai_read_agent | Load an agent prompt by name — returns system prompt + config | agentName (e.g. scenario-agent) | | e2e_ai_get_example | Get the example context markdown template | (none) |

How AI Orchestration Works

The MCP server includes built-in orchestration instructions that teach AI assistants (Claude Code, Cursor, etc.) how to run e2e-ai workflows autonomously. The protocol is:

Plan — The AI calls e2e_ai_plan_workflow with your goal. It returns an ordered step list.
Approve — The AI presents the plan to you for review. You can adjust steps before proceeding.
Execute — The AI runs each step one at a time via e2e_ai_execute_step, reporting results between steps. If a step fails, it stops and asks you how to proceed.

Each step is executed as a separate job (ideally a subagent) to keep context clean. The AI never runs multiple pipeline steps at once.

Example interaction:

You: "Run the full test pipeline for PROJ-101"
AI: Calls e2e_ai_plan_workflow, then presents:
record — Launch browser codegen + voice recording
transcribe — Transcribe voice via Whisper
scenario — Generate YAML test scenario
generate — Generate Playwright test
refine — Refactor test with AI
test — Run Playwright test
heal — Self-heal if failing (can skip if test passes)
qa — Generate QA documentation
"Does this look right? Ready to start?"
You: "Skip voice, go ahead"
AI: Removes transcribe, executes each step sequentially

Library API

e2e-ai also exports types and config helpers for programmatic use:

import { defineConfig, loadConfig, getProjectRoot } from 'e2e-ai';
import type {
  E2eAiConfig,
  ResolvedConfig,
  ASTScanResult,
  QAMapV2Payload,
  FeatureV2,
  WorkflowV2,
  ScenarioV2,
  ComponentV2,
  PushResult,
} from 'e2e-ai';

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

e2e-ai

Quick Start

Architecture

Test Pipeline

Scanner Pipeline

Commands

init - Project Setup

record - Browser Recording

transcribe - Voice Transcription

scenario - AI Scenario Generation

generate - AI Test Generation

refine - AI Test Refactoring

test - Test Runner

heal - AI Self-Healing

qa - QA Documentation

run - Full Pipeline

scan - Codebase AST Scanner

analyze - AI QA Map Generation

push - Push QA Map to API

Typical Workflows

Workflow 1: Full Recording Pipeline

Workflow 2: AI-Only (No Recording)

Workflow 3: Existing Test Improvement

Workflow 4: From Existing Recording Data

Workflow 5: Codebase Scanner Pipeline

Configuration

Global Options

Environment Variables

AI Agents

Output Directory Structure

MCP Server

Setup

Available Tools

Orchestration (workflow automation)

Project setup

How AI Orchestration Works

Library API

License

`init` - Project Setup

`record` - Browser Recording

`transcribe` - Voice Transcription

`scenario` - AI Scenario Generation

`generate` - AI Test Generation

`refine` - AI Test Refactoring

`test` - Test Runner

`heal` - AI Self-Healing

`qa` - QA Documentation

`run` - Full Pipeline

`scan` - Codebase AST Scanner

`analyze` - AI QA Map Generation

`push` - Push QA Map to API