e2e-test-agent

v0.0.2

Published

a month ago

An AI-powered testing framework that uses LLM agents to execute natural language test cases

Downloads

208

0High
0Medium
0Low

armannaj

testing ai llm langchain playwright mcp model-context-protocol e2e automation natural-language

🤖 E2E Test Agent

AI-powered natural language end-to-end testing framework.

e2e-test-agent is an AI-powered testing framework that uses LLM agents to execute natural language test cases. Write tests in plain English and let AI agents interact with your applications to verify behavior.

Instead of writing brittle selector-based tests, you describe what you want to test in natural language, and the AI agent figures out how to do it.

📖 Table of Contents

🌟 What is E2E Test Agent?

E2E Test Agent is a revolutionary testing approach that combines:

LLM-powered agents for intelligent test execution
Natural language test cases written in plain English
Automated browser interactions (via Playwright MCP server)

🎯 Why e2e test agent is better

Traditional Testing

// Brittle, breaks when UI changes
await page.goto("https://playwright.dev");
await page.locator("#main-content").scrollIntoView();
await page.click('button[data-testid="get-started-btn"]');
await expect(page.locator(".sidebar-menu")).toBeVisible();

Problems:

❌ Breaks when CSS selectors change
❌ Requires constant maintenance
❌ No understanding of context or intent
❌ Fragile across UI updates

E2E Test Agent Approach

open playwright.dev
scroll all the way down,
click on "Get started",
check if the page side menu is visible.

Benefits:

✅ Intent-based: Describes what to do, not how
✅ Self-healing: AI adapts to UI changes automatically
✅ Readable: Anyone can write and understand tests (POs, stakeholders, BAs)
✅ Resilient: Survives refactors and redesigns
✅ Context-aware: AI understands page structure and user intent
✅ No maintenance: Tests rarely need updates when UI changes

🚀 How It Works

┌─────────────────┐
│  Test Files     │ Plain English test steps
│  (.test files)  │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  TestAgent      │ Orchestrates test execution
│  Class          │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  LLM Agent      │ Interprets tests & decides actions
│                 │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  MCP Tools      │ Browser automation, web search, etc.
│  (Playwright)   │
└────────┬────────┘
         │
         ▼
┌─────────────────┐
│  Your App       │ Real interactions, real results
└─────────────────┘

Write tests in natural language (.test files)
TestAgent reads and enriches each test with context (date/time, output format)
LLM Agent interprets the test steps and plans actions
Results are collected and summarized

📦 Installation

npm install e2e-test-agent

Or with yarn:

yarn add e2e-test-agent

Setup

Create a .env file in your project root:

MODEL_NAME="gpt-4o"
API_KEY="your-openai-api-key"
BASE_URL="https://api.openai.com/v1"
TESTS_DIR="./tests"

💡 API Compatibility: This package works with OpenAI and any OpenAI-compatible APIs, including:
OpenAI (GPT-4o, GPT-4, etc.)
Anthropic Claude (via OpenAI-compatible endpoints)
OpenRouter (access to multiple models)
Local LLMs (Ollama, LM Studio, etc.)
Any other service that implements the OpenAI API format
Simply configure your BASE_URL and API_KEY accordingly.

🎮 Usage

Quick Start

Create a test runner file (e.g., run-tests.ts):

import { TestAgent } from "e2e-test-agent";
import "dotenv/config";

async function main() {
  const testAgent = new TestAgent({
    modelName: process.env.MODEL_NAME || "gpt-4o",
    apiKey: process.env.API_KEY!,
    baseURL: process.env.BASE_URL,
    testsDir: process.env.TESTS_DIR || "./tests",
    maxSteps: 20,
  });

  const results = await testAgent.runAllTests();
  testAgent.printSummary(results);
}

main().catch(console.error);

Run your tests:

# With tsx (recommended for development)
npx tsx run-tests.ts

# Or compile and run with Node
tsc run-tests.ts
node run-tests.js

Writing Tests

Create .test files in your tests/ directory:

Example: tests/1.test

open playwright.dev
scroll all the way down,
click on "Get started",
check if the page side menu is visible.

Example: tests/2.test

navigate to github.com
search for "typescript"
click on the first repository
verify the repository has a README file

Programmatic Usage

import { TestAgent } from "e2e-test-agent";

const testAgent = new TestAgent({
  modelName: "gpt-4o",
  apiKey: process.env.API_KEY!,
  baseURL: process.env.BASE_URL,
  testsDir: "./tests",
  maxSteps: 20,
});

// Run all tests
const results = await testAgent.runAllTests();
testAgent.printSummary(results);

// Or run a specific test
const result = await testAgent.runSingleTest("./tests/1.test", 1);

🔧 Configuration

Environment Variables

| Variable | Description | Default | | ------------ | ------------------------------- | -------------- | | MODEL_NAME | LLM model to use | gpt-4o | | API_KEY | OpenAI API key | Required | | BASE_URL | API base URL | OpenAI default | | TESTS_DIR | Directory containing test files | ./tests |

Custom MCP Servers

const testAgent = new TestAgent({
  apiKey: "...",
  mcpServers: {
    playwright: { command: "npx", args: ["@playwright/mcp@latest"] },
    filesystem: {
      command: "npx",
      args: ["@modelcontextprotocol/server-filesystem"],
    },
    // Add more MCP servers as needed
  },
});

📊 Test Results

Test Agent provides detailed results for each test:

============================================================
Running Test #1: 1.test
============================================================

Test Content:
open playwright.dev
scroll all the way down,
click on "Get started",
check if the page side menu is visible.

Result: {
  "success": true,
  "steps_completed": [
    "Opened playwright.dev",
    "Scrolled to bottom",
    "Clicked Get started button",
    "Verified sidebar visibility"
  ],
  "observations": "All steps completed successfully",
  "final_status": "passed"
}

============================================================
TEST SUMMARY
============================================================
✅ PASSED - Test #1: 1.test

Total: 1 | Passed: 1 | Failed: 0

🤝 Contributing

Contributions are welcome! This framework can be extended with:

More MCP servers (database access, API testing, etc.)
Custom test reporters
Parallel test execution
Test retry mechanisms
Screenshot/video capture on failures

📝 License

MIT

🔗 Resources

Built with ❤️ by Arman