ficta

v1.1.10

Published

2 days ago

Universal test data generator for Node.js, browsers, and CLI. Generate realistic test data in multiple formats (CSV, JSON, XML, Excel, TSV, SQL, YAML, TOML) using Faker.js.

Ficta

Ficta is a comprehensive, production-ready test data generator designed to eliminate the tedious manual creation of mock data across your entire development workflow. Whether you're a QA engineer seeding test databases, a frontend developer prototyping API responses, or a DevOps engineer preparing staging environments, Ficta delivers realistic, schema-compliant data in seconds—not hours.

Universal by design: Run it as a CLI command, integrate it into your Node.js build scripts, or generate data directly in the browser without any backend. From a single interface, produce industry-standard formats including CSV, JSON, Excel, SQL (with full DDL and foreign key support), XML, YAML, TOML, and Apache Parquet.

Built for real-world complexity: Ficta doesn't just generate random values—it understands your data structures. Import existing SQL schemas and automatically generate referentially-intact test data that respects foreign key constraints. Convert OpenAPI specifications or GraphQL schemas into test datasets. Infer column types from existing data files. Stream millions of rows without exhausting memory.

Trusted by teams who demand quality: With 100% test coverage across 948 test cases, Ficta ensures your test data generation is as reliable as your production code. Battle-tested in production environments, it handles edge cases like CSV escaping, SQL injection prevention, multi-dialect compatibility (PostgreSQL, MySQL, SQLite), and deterministic seeding for reproducible test scenarios.

Key Capabilities

🚀 Universal Runtime Support

CLI: One-line commands for quick data generation (ficta -t users -r 1000 -o data.csv)
Node.js: Full programmatic API for build scripts, test fixtures, and automation
Browser: Client-side generation via CDN—no backend required, works offline
TypeScript: Complete type definitions for IDE autocomplete and type safety

📊 Production-Ready Format Support

Generate data in 9 industry-standard formats with intelligent auto-detection:

CSV/TSV: RFC 4180 compliant with proper escaping for data imports
JSON: Pretty-printed or compact for API mocking and fixtures
Excel (XLSX): Formatted worksheets with headers for business reports
SQL: Full DDL + DML with dialect support (PostgreSQL, MySQL, SQLite, generic)
XML: Well-formed output with configurable element names
YAML/TOML: Human-readable formats perfect for configuration files
Parquet: Apache Parquet columnar format for big data pipelines (Node.js)

🗄️ Advanced Database Features

For DevOps engineers, DBAs, and backend developers:

DDL Import: Parse existing .sql schema files and generate FK-aware test data
Foreign Key Support: Automatically maintains referential integrity across tables
Multi-Dialect SQL: Native support for PostgreSQL, MySQL, SQLite with dialect-specific features
SQL Modes: INSERT, UPSERT (ON CONFLICT/ON DUPLICATE KEY), DDL+INSERT, TRUNCATE+INSERT, batch inserts
Schema Builder: Fluent code-first API for defining multi-table schemas with relationships
Topological Ordering: Intelligent dependency resolution ensures parent rows exist before children

🧪 Testing & QA Workflow Integration

For QA engineers, testers, and automation developers:

Reproducible Data: Seed Faker for deterministic output in CI/CD pipelines
Locale Support: Generate region-specific data in 60+ languages (names, addresses, phones)
Schema Inference: Auto-detect column types from existing CSV or JSON files
Preview Mode: Inspect first 3 rows before generating large datasets
Streaming API: Memory-efficient generation of millions of rows (CSV/NDJSON)
Watch Mode: Auto-regenerate when schema files change during development

🔌 Schema Import & Conversion

For API developers and architects:

OpenAPI Bridge: Convert OpenAPI 3.x / JSON Schema specs to Ficta schemas
GraphQL Bridge: Transform GraphQL SDL definitions into test data configurations
JSON Schema Files: Define complex multi-table schemas in ficta.schema.json format

🎯 Developer Experience

40+ Data Types: Realistic names, emails, addresses, UUIDs, dates, prices, and more via Faker.js
Special Types: Auto-increment sequences, enums, numeric ranges, custom patterns with counters
Plugin API: Register custom data types and reusable templates at runtime
Predefined Templates: Battle-tested schemas for users, products, transactions, addresses, contacts
Smart Detection: Format automatically inferred from file extensions
Zero Config Browser: Single <script> tag or ES module import
100% Test Coverage: 921 tests across 13 suites ensure production reliability

Who Should Use Ficta?

👨‍💻 Backend Developers

Database Seeding: Generate SQL INSERT statements with proper foreign key relationships for development databases
Migration Testing: Create realistic data to test database migrations and schema changes
Performance Testing: Stream millions of rows to test query optimization and indexing strategies
Multi-Tenant Systems: Use locales and seeds to generate region-specific test datasets

🧪 QA Engineers & Test Automation Developers

API Testing: Generate JSON payloads matching your OpenAPI specifications for automated integration tests
Data-Driven Testing: Use seeded generation for reproducible test scenarios across CI/CD runs
Edge Case Coverage: Create boundary conditions with ranges, enums, and custom patterns
Test Data Management: Infer schemas from production samples, then generate sanitized test versions

🎨 Frontend Developers

Rapid Prototyping: Generate mock API responses in-browser without backend dependencies
Component Testing: Create realistic datasets for UI component development and visual regression tests
Demo Applications: Build convincing demos with professional-looking data in seconds
Offline Development: Work disconnected using browser-based generation

🔧 DevOps & Site Reliability Engineers

Environment Provisioning: Populate staging/QA databases from CI/CD pipelines using CLI commands
Load Testing: Generate large CSV/JSON datasets for stress testing data processing pipelines
Disaster Recovery Drills: Create realistic datasets to test backup/restore procedures
Monitoring: Use watch mode to continuously regenerate test data as schemas evolve

📊 Data Engineers & Analysts

Pipeline Development: Generate Parquet files for testing ETL workflows and data transformations
Schema Validation: Test data warehouse schemas with properly typed sample data
Query Development: Create representative datasets for developing and optimizing analytics queries
Format Conversion: Transform data between CSV, JSON, Parquet, and other formats

🏗️ Database Administrators

Capacity Planning: Generate large datasets to test storage, indexing, and query performance
Schema Design: Validate table relationships and constraints with FK-aware test data
Dialect Migration: Test schema portability across PostgreSQL, MySQL, and SQLite
Backup Strategy: Create known datasets for testing backup/restore reliability

Installation

For Node.js / CLI

npm install ficta

For Browser (No Installation)

Just include the script in your HTML:

<script src="https://unpkg.com/ficta/dist/ficta.browser.js"></script>

Or download the browser bundle and include it locally.

Browser / CDN

Use Ficta directly in any browser without a build step via jsDelivr or unpkg.

jsDelivr

<!-- IIFE (global window.Ficta) -->
<script src="https://cdn.jsdelivr.net/npm/ficta/dist/ficta.browser.min.js"></script>

<!-- ES Module -->
<script type="module">
  import { generateData, toCSV } from 'https://cdn.jsdelivr.net/npm/ficta/dist/ficta.esm.js';
</script>

unpkg (fallback)

<script src="https://unpkg.com/ficta/dist/ficta.browser.min.js"></script>

Minimal browser example

<!DOCTYPE html>
<html>
<head><title>Ficta Demo</title></head>
<body>
  <pre id="output"></pre>
  <!-- Load the IIFE bundle (exposes window.Ficta) -->
  <script src="https://cdn.jsdelivr.net/npm/ficta/dist/ficta.browser.min.js"></script>
  <script>
    // Generate 10 records with the 'users' template
    const result = Ficta.generateData({ template: 'users', rows: 10 });

    // Format as CSV and display it
    const csv = Ficta.toCSV(result.records, result.columns);
    document.getElementById('output').textContent = csv;
  </script>
</body>
</html>

An interactive playground is available at examples/playground.html — open it directly in any browser, no build step required.

Quick Start

CLI

# CSV (default)
ficta -c "id:autoIncrement,name:fullName,email" -r 100 -o users.csv

# JSON
ficta -c "id:autoIncrement,name:fullName,email" -r 100 -o users.json

# Excel
ficta -t users -r 500 -o users.xlsx

# SQL schema with DDL (PostgreSQL)
ficta -t users -r 100 -o schema.sql --sql-mode ddl+insert --sql-dialect postgres

# SQL with custom table name (legacy INSERT mode)
ficta -c "id,name,email" -r 100 -o users.sql --table-name my_users

# Reproducible output (same data every time)
ficta -t users -r 100 -o users.csv --seed 42

# Localized data (French)
ficta -t contacts -r 50 -o contacts-fr.csv --locale fr

# Generate from existing SQL schema
ficta schema ./schema.sql --rows 20 --dialect postgres --mode ddl+insert -o seed.sql

Format is automatically detected from the file extension!

Node.js

import { generateAndSave } from 'ficta';

await generateAndSave({
    columns: 'id:autoIncrement,name:fullName,email',
    rows: 100,
    output: 'users.csv'
});

Browser

<!DOCTYPE html>
<html>
<body>
    <div id="app"></div>
    <script src="https://unpkg.com/ficta/dist/ficta.browser.js"></script>
    <script>
        // Create built-in interactive UI
        Ficta.createUI('#app');
        
        // Or generate programmatically
        const result = Ficta.generateData({
            columns: 'id:autoIncrement,name:fullName,email',
            rows: 50
        });
        
        // Convert to CSV and download the file
        const csv = Ficta.toCSV(result.records, result.columns);
        Ficta.downloadCSV(csv, 'data.csv');
    </script>
</body>
</html>

Usage

CLI (Command Line)

The CLI provides a simple interface for generating test files from the terminal.

# Basic usage
ficta -c "id:autoIncrement,name:fullName,email" -r 100 -o users.csv

# Use a template
ficta -t users -r 500 -o users.csv

# Custom patterns with counter
ficta -c "email:pattern:user+{COUNTER}@test.com" -r 50 -o emails.csv

# Show preview before saving
ficta -t products -r 20 -o products.csv -p

# List available types
ficta --list-types

# List available templates
ficta --list-templates

CLI Options

CLI Subcommands

| Subcommand | Description | |------------|-------------| | ficta schema <file> | Generate test data from a SQL DDL schema file | | ficta infer <file> | Infer Ficta column definitions from an existing .csv or .json file | | ficta from-openapi <file> | Convert an OpenAPI 3.x or JSON Schema file to ficta.schema.json | | ficta from-graphql <file> | Convert a GraphQL SDL file to ficta.schema.json |

Global CLI Options

| Option | Alias | Description | Default | |--------|-------|-------------|---------| | --output | -o | Output filename | test-data.csv | | --format | -f | Output format (csv, json, xml, xlsx, tsv, sql, yaml, yml, toml) | Auto-detect from filename | | --columns | -c | Column definitions (name:type,...) | - | | --rows | -r | Number of rows to generate | 100 | | --template | -t | Use predefined template | - | | --preview | -p | Show preview of first 3 rows | false | | --list-types | - | List all available data types | - | | --list-templates | - | List all available templates | - | | --table-name | - | SQL table name (for SQL format) | data_table | | --sql-dialect | - | SQL dialect (postgres, mysql, sqlite, generic) | generic | | --sql-mode | - | SQL mode (insert, ddl, ddl+insert, upsert, truncate+insert) | insert | | --sql-batch | - | Use batch INSERT statements | false | | --sheet-name | - | Excel worksheet name (for XLSX format) | Sheet1 | | --pretty | - | Pretty-print JSON (for JSON format) | true | | --seed | - | Integer seed for reproducible output | - | | --locale | - | Faker.js locale for localized data (e.g. fr, de, ja, pt_BR) | - | | --header | - | Include header row in CSV/TSV (--no-header to omit) | true | | --header-format | - | Header casing: title (default) or raw (exact key names) | title | | --schema-file | -s | Path to a ficta.schema.json file for structured multi-table generation | - | | --help | -h | Show help | - |

ficta schema <file> — generate from DDL file:

ficta schema ./schema.sql --rows 20 --dialect postgres --mode ddl+insert -o seed.sql

# Watch for changes and regenerate automatically
ficta schema ./schema.sql --rows 10 -o seed.sql --watch

ficta infer <file> — infer column types from data:

# Print inferred column string
ficta infer ./users.csv
# Output: id:autoIncrement,email:email,name:fullName,...

# Output as JSON column list
ficta infer ./data.json --format json

# Write to file
ficta infer ./users.csv -o inferred-columns.txt

ficta from-openapi <file> — convert OpenAPI spec:

# Convert OpenAPI YAML to ficta.schema.json
ficta from-openapi ./openapi.yaml -o ficta.schema.json

# Target a specific component schema
ficta from-openapi ./openapi.json --schema User --rows 50 --dialect postgres

ficta from-graphql <file> — convert GraphQL SDL:

# Convert GraphQL schema to ficta.schema.json
ficta from-graphql ./schema.graphql -o ficta.schema.json

# Target a specific object type
ficta from-graphql ./schema.graphql --type Post --rows 100

Node.js (Programmatic)

Use as a library in your Node.js application.

import { generateAndSave, generateData, toCSV, templates } from 'ficta';

// Generate and save to file
await generateAndSave({
    columns: 'id:autoIncrement,name:fullName,email',
    rows: 1000,
    output: 'users.csv',
    preview: true  // Show preview in console
});

// Generate JSON format
await generateAndSave({
    columns: 'id:autoIncrement,name:fullName,email',
    rows: 100,
    output: 'users.json',
    format: 'json'
});

// Generate with format-specific options
await generateAndSave({
    columns: 'id:autoIncrement,name:fullName,email',
    rows: 100,
    output: 'users.xlsx',
    format: 'xlsx',
    formatOptions: {
        sheetName: 'User Data'
    }
});

// Generate data in memory (no file)
const result = generateData({
    columns: templates.users.columns,
    rows: 100
});
const csv = toCSV(result.records, result.columns);
console.log(csv);            // CSV string
console.log(result.records); // Array of objects

Import from submodules:

// Import only what you need
import { generateData, parseColumns } from 'ficta/core';
import { templates } from 'ficta/node';

Browser (Script Tag)

Perfect for quick prototyping or standalone HTML files.

<!DOCTYPE html>
<html>
<head>
    <title>Ficta</title>
</head>
<body>
    <div id="csv-ui"></div>

    <!-- Include the library -->
    <script src="dist/ficta.browser.js"></script>
    
    <script>
        // Option 1: Use the built-in UI
        Ficta.createUI('#csv-ui');
        
        // Option 2: Generate programmatically
        const result = Ficta.generateData({
            columns: 'id:autoIncrement,email:pattern:user+{COUNTER}@test.com,name:fullName',
            rows: 50
        });
        
        // Convert to CSV and download
        const csv = Ficta.toCSV(result.records, result.columns);
        Ficta.downloadCSV(csv, 'data.csv');
        
        // Or use the records directly
        console.log(result.records); // Array of objects
    </script>
</body>
</html>

Try the examples: Open examples/simple.html in your browser!

Browser (ES Module)

For modern web apps with module support.

<script type="module">
    import * as Ficta from './dist/ficta.esm.js';
    
    // Generate and download
    const result = Ficta.generateAndDownload({
        columns: 'id:autoIncrement,name:fullName,email,phone',
        rows: 100,
        filename: 'users.csv'
    });
    
    console.log(`Generated ${result.rowCount} rows!`);
</script>

Try the example: Open examples/esmodule.html in your browser!

Supported Formats

Generate test data in 9 popular file formats:

| Format | Extension | Description | Node.js | Browser | |--------|-----------|-------------|---------|---------|| | CSV | .csv | Comma-separated values with proper escaping | ✅ | ✅ | | JSON | .json | JavaScript Object Notation (pretty or compact) | ✅ | ✅ | | XML | .xml | Extensible Markup Language with configurable elements | ✅ | ✅ | | Excel | .xlsx | Microsoft Excel workbook with formatting | ✅ | ❌ | | TSV | .tsv | Tab-separated values | ✅ | ✅ | | SQL | .sql | SQL DDL (CREATE TABLE) and DML (INSERT, UPSERT) statements | ✅ | ✅ | | YAML | .yaml | YAML Ain't Markup Language - human-readable data format | ✅ | ✅ | | YML | .yml | YAML format with shorter extension | ✅ | ✅ | | TOML | .toml | Tom's Obvious, Minimal Language - config file format | ✅ | ✅ | | Parquet | .parquet | Apache Parquet columnar storage format | ✅ | ❌ |

Format Auto-Detection

If you don't specify the -f format option, the generator automatically detects the format from the output filename extension:

# These automatically use the correct format
ficta -c "id,name" -r 100 -o users.json   # → JSON
ficta -c "id,name" -r 100 -o users.xml    # → XML
ficta -c "id,name" -r 100 -o users.xlsx   # → Excel
ficta -c "id,name" -r 100 -o users.tsv    # → TSV
ficta -c "id,name" -r 100 -o users.sql    # → SQL
ficta -c "id,name" -r 100 -o users.yaml   # → YAML
ficta -c "id,name" -r 100 -o users.yml    # → YML
ficta -c "id,name" -r 100 -o users.toml   # → TOML

Data Types

Column Definition Format

Columns are defined as name:type pairs separated by commas:

columnName:dataType,anotherColumn:dataType

If no type is specified, defaults to word:

-c "name,email,phone"  # All use default 'word' type

Available Types (40+)

Person

firstName, lastName, fullName, jobTitle, prefix, suffix

Internet

email, username, password, url, ipv4, userAgent

Phone

phone

Address

street, city, state, country, zipCode, latitude, longitude

Company

company, department

Commerce

product, price, productDescription

Finance

amount, accountNumber, iban, creditCardNumber, currency

Date

pastDate, futureDate, recentDate, timestamp

Numbers

number, float

Text

word, words, sentence, paragraph

IDs

uuid, nanoid, autoIncrement

Other

boolean, color, emoji, json

Aliases

Convenient shorthand names for common types: string (= word), text (= sentence), int / integer (= number), date (= recentDate)

List all types:

ficta --list-types

Special Types

Auto Increment

Sequential numbering starting from 1.

-c "id:autoIncrement"

Output: 1, 2, 3, 4, ...

Static Values

Use the same value for all rows.

-c "status:static:active,country:static:USA"

Output: All rows have active status and USA country

Enums

Random selection from a list of values.

-c "status:enum:active|inactive|pending"

Output: Randomly selects from: active, inactive, or pending

Number Ranges

Random number within a specified range.

-c "age:range:18-65,score:range:0-100"

Output: age between 18-65, score between 0-100

Patterns

Custom patterns with placeholders:

{COUNTER} - Auto-incrementing number (1, 2, 3, ...)
# - Random digit (0-9)

# Email with counter
-c "email:pattern:user+{COUNTER}@test.com"
# Output: [email protected], [email protected], ...

# Product SKU with random digits
-c "sku:pattern:PRD-######"
# Output: PRD-847291, PRD-192834, ...

# Order number combining counter and random
-c "orderNum:pattern:ORD-{COUNTER}-##"
# Output: ORD-1-42, ORD-2-17, ...

Templates

Predefined column sets for common data structures.

Available Templates

| Template | Description | Columns | |----------|-------------|---------| | users | User account data | id, firstName, lastName, email, phone, company, jobTitle, registeredDate (pastDate) | | products | Product catalog | sku (autoIncrement), name (product), category (department), price, stock (number), description (productDescription) | | transactions | Financial transactions | id (uuid), date (timestamp), customerId (number), amount, currency, status (word), paymentMethod (word) | | addresses | Address data with coordinates | id, street, city, state, zipCode, country, lat (latitude), lng (longitude) | | contacts | Contact information | id, fullName, email, phone, company, jobTitle, website (url) |

Usage

CLI:

ficta -t users -r 500 -o users.csv
ficta -t products -r 1000 -o products.json

Node.js:

import { templates, generateData, toCSV } from 'ficta';

const result = generateData({
    columns: templates.users.columns,
    rows: 100
});
const csv = toCSV(result.records, result.columns);
console.log(csv);

List all templates:

ficta --list-templates

CLI Subcommands

`ficta schema <file>`

Generate test data directly from a SQL DDL schema file.

# Generate 20 rows per table from a .sql schema, output to stdout
ficta schema ./schema.sql --rows 20 --dialect postgres --mode ddl+insert

# Write output to a file
ficta schema ./schema.sql --rows 10 --output ./seed.sql

| Option | Description | Default | |--------|-------------|---------| | --rows / -r | Rows per table | 10 | | --dialect | SQL dialect | generic | | --mode | SQL output mode | insert | | --output / -o | Output file (optional) | stdout | | --locale | Faker.js locale | - |

Format-Specific Options

Excel (XLSX)

# Specify worksheet name
ficta -t users -r 100 -o users.xlsx --sheet-name "User Data"

Features:

Formatted headers (bold, gray background)
Auto-fitted column widths
Custom worksheet names

SQL

Generate SQL INSERT statements or complete database schemas with DDL.

Basic INSERT Statements (Legacy)

# Simple INSERT statements
ficta -c "id,name,email" -r 100 -o users.sql --table-name my_users

Advanced SQL Schema Generation

Generate PostgreSQL schema with DDL:

ficta -t users -r 100 -o schema.sql \
  --sql-dialect postgres \
  --sql-mode ddl+insert \
  --table-name users

Generate MySQL upsert statements:

ficta -c "id:autoIncrement,username,email" -r 50 -o data.sql \
  --sql-dialect mysql \
  --sql-mode upsert \
  --table-name users

Batch inserts for better performance:

ficta -t products -r 1000 -o products.sql \
  --sql-batch \
  --table-name products

SQL Dialect Options

| Dialect | Description | Special Features | |---------|-------------|------------------| | postgres | PostgreSQL | SERIAL, JSONB, ON CONFLICT | | mysql | MySQL/MariaDB | AUTO_INCREMENT, ENUM types, ON DUPLICATE KEY | | sqlite | SQLite | Simplified types, INTEGER PRIMARY KEY AUTOINCREMENT | | generic | Generic SQL | Standard SQL (default) |

SQL Generation Modes

| Mode | Description | Output | |------|-------------|--------| | insert | INSERT statements only (default) | INSERT INTO table ... | | ddl | CREATE TABLE statements only | CREATE TABLE ... | | ddl+insert | Schema + data | CREATE TABLE ... + INSERT INTO ... | | upsert | UPSERT statements | ON CONFLICT (Postgres) / ON DUPLICATE KEY (MySQL) | | truncate+insert | Clear data first | TRUNCATE TABLE ... + INSERT INTO ... |

Node.js API:

import { generateAndSave } from 'ficta';

// Generate PostgreSQL schema with DDL
await generateAndSave({
  template: 'users',
  rows: 100,
  output: 'users-postgres.sql',
  formatOptions: {
    dialect: 'postgres',
    mode: 'ddl+insert',
    tableName: 'users'
  }
});

// Generate MySQL upserts
await generateAndSave({
  columns: 'id:autoIncrement,sku:pattern:PRD-{COUNTER},name:product,price',
  rows: 50,
  output: 'products-upsert.sql',
  formatOptions: {
    dialect: 'mysql',
    mode: 'upsert',
    tableName: 'products'
  }
});

// Batch inserts (more efficient)
await generateAndSave({
  columns: 'id,name,email',
  rows: 10000,
  output: 'users-batch.sql',
  formatOptions: {
    tableName: 'users',
    batch: true  // Single INSERT with multiple VALUES
  }
});

Advanced: Multi-Table Schema with Foreign Keys:

import { generateSchema } from 'ficta/src/sql-schema.js';
import { generateData } from 'ficta';

// Generate sample data
const customers = generateData({
  columns: 'id:autoIncrement,firstName,lastName,email',
  rows: 10
});

const orders = generateData({
  columns: 'id:autoIncrement,customerId:range:1-10,amount:price,status:word',
  rows: 30
});

// Create multi-table schema
const schema = {
  schema: 'ecommerce',
  dialect: 'postgres',
  mode: 'ddl+insert',
  insertOrder: 'auto',  // Resolve FK dependencies automatically
  tables: [
    {
      table: 'customers',
      columns: [
        { name: 'id', type: 'autoIncrement', primaryKey: true },
        { name: 'firstName', type: 'firstName' },
        { name: 'lastName', type: 'lastName' },
        { name: 'email', type: 'email', unique: true, nullable: false }
      ],
      records: customers.records
    },
    {
      table: 'orders',
      columns: [
        { name: 'id', type: 'autoIncrement', primaryKey: true },
        {
          name: 'customerId',
          type: 'number',
          references: { table: 'customers', column: 'id' },
          onDelete: 'CASCADE'
        },
        { name: 'amount', type: 'price' },
        { name: 'status', type: 'string' }
      ],
      records: orders.records
    }
  ]
};

const sql = generateSchema(schema);
// Outputs complete schema with proper table ordering

Features:

DDL Generation: CREATE TABLE statements with proper column types
Type Mapping: Automatic conversion from Faker types to SQL types
Constraints: PRIMARY KEY, UNIQUE, NOT NULL, DEFAULT, FOREIGN KEY
Multi-Dialect: PostgreSQL, MySQL, SQLite support
Foreign Keys: Automatic dependency resolution and insert ordering
Batch Mode: Multiple VALUES in single INSERT for performance
Upserts: Dialect-aware UPSERT/MERGE statements
Proper Escaping: Single quotes, NULL values, boolean conversion

See Examples:

examples/node/sql-simple.js - Basic SQL generation
examples/node/sql-schema-examples.js - Advanced multi-table schemas
examples/sql-schema.html - Interactive browser demo

SQL from Existing Schema (DDL Import)

Import an existing .sql schema file and automatically generate realistic test data that respects your table structure, column types, and foreign key relationships.

Node.js programmatic API:

import { generateFromDDL } from 'ficta';

// Read schema.sql and generate test data
const sql = await generateFromDDL({
  schemaFile: './schema.sql',   // Path to your DDL file
  rows: 20,                     // Rows per table
  outputMode: 'ddl+insert',     // 'insert' | 'upsert' | 'truncate+insert' | 'ddl+insert'
  dialect: 'postgres',          // 'postgres' | 'mysql' | 'sqlite' | 'generic'
  output: './seed.sql'          // Optional: write output to file
});

console.log(sql); // Complete SQL script ready to execute

Universal API (generateFromSchema) — works in browsers too:

import { generateFromSchema } from 'ficta/src/schema-generator.js';

// Pass a DDL string directly (browser-compatible)
const sql = generateFromSchema({
  ddl: `
    CREATE TABLE users (
      id SERIAL PRIMARY KEY,
      email VARCHAR(255) NOT NULL,
      created_at TIMESTAMP
    );
    CREATE TABLE posts (
      id SERIAL PRIMARY KEY,
      user_id INT REFERENCES users(id),
      title VARCHAR(255)
    );
  `,
  rows: 10,
  outputMode: 'ddl+insert',
  dialect: 'postgres'
});

Low-level: Parse DDL then inspect or modify before generating:

import { parseDDL, orderByDependencies } from 'ficta/src/ddl-parser.js';
import { generateFromSchema } from 'ficta/src/schema-generator.js';

// Parse DDL into structured table definitions
const tables = parseDDL(rawDDLString);
// tables → [{ tableName, columns, primaryKey, foreignKeys }, ...]

// Sort tables in FK dependency order
const ordered = orderByDependencies(tables);

// Generate data from pre-parsed tables
const sql = generateFromSchema({ tables: ordered, rows: 5, dialect: 'mysql' });

What ddl-parser understands:

AUTO_INCREMENT, SERIAL, IDENTITY auto-increment variants
Inline and table-level FOREIGN KEY … REFERENCES syntax
ENUM('a','b','c') column types → maps to enum:a|b|c
NOT NULL, DEFAULT, UNIQUE, PRIMARY KEY modifiers
SQL comments (-- single-line and /* */ block)
Quoted identifiers (backtick and double-quote)
Two-layer type resolution: column name hints first, SQL type fallback second

JSON

# Disable pretty-printing (compact JSON)
ficta -c "id,name" -r 100 -o users.json --pretty false

Features:

Pretty-printed by default
Compact mode available

CSV

Standard comma-separated values with:

Proper escaping of commas, quotes, and newlines
RFC 4180 compliant

TSV

Tab-separated values, suitable for importing into Excel or databases.

XML

Well-formed XML with:

Configurable root and record element names
Proper character escaping

YAML / YML

Human-readable data serialization format:

Clean, indented structure
Compatible with most YAML parsers
Suitable for configuration files

# Generate YAML file
ficta -c "id,name,email" -r 100 -o config.yaml

TOML

Tom's Obvious, Minimal Language for configuration:

Array of tables format ([[records]])
Type-safe values
Popular for config files (Cargo.toml, pyproject.toml)

# Generate TOML file
ficta -c "id,name,email" -r 100 -o data.toml

API Reference

Core Functions

`generateData(options)`

The universal entry point: generate records in memory without writing to disk.

import { generateData } from 'ficta';

const result = generateData({
    columns: 'id:autoIncrement,name:fullName,email',
    rows: 100
});
// Returns:
// {
//   records: Array<Object>,  // Array of row objects
//   columns: Array,          // Parsed column definitions
//   rowCount: number,        // Number of rows generated
//   columnCount: number      // Number of columns
// }

`parseColumns(columnString)`

Parse a column definition string into structured column objects.

import { parseColumns } from 'ficta';

const columns = parseColumns('id:autoIncrement,name:fullName,email');
// Returns: [
//   { name: 'id', type: 'autoIncrement' },
//   { name: 'name', type: 'fullName' },
//   { name: 'email', type: 'email' }
// ]

`setFaker(faker)`

Set the Faker.js instance (required in browser when not using the self-contained bundle).

import { setFaker } from 'ficta';
import { faker } from '@faker-js/faker';
setFaker(faker);

`seedFaker(seed)`

Seed the Faker RNG for reproducible, deterministic output.

import { seedFaker } from 'ficta';
seedFaker(42);

`setLocale(locale)`

Set the Faker locale for localized data generation.

import { setLocale } from 'ficta';
setLocale('fr'); // French locale
setLocale('pt_BR'); // Brazilian Portuguese

`registerType(name, generatorFn, options?)`

import { registerType, generateData } from 'ficta';

registerType('hashtag', () => '#' + Math.random().toString(36).slice(2, 8));

const result = generateData({ columns: 'tag:hashtag', rows: 5 });

`unregisterType(name)`

Remove a previously registered custom type. Built-in types cannot be removed.

`registerTemplate(name, config, options?)`

import { registerTemplate, generateData } from 'ficta';

registerTemplate('employees', {
  columns: 'id:autoIncrement,firstName,lastName,email,jobTitle,department',
  rows: 50
});

const result = generateData({ template: 'employees', rows: 20 });

`unregisterTemplate(name)`

Remove a previously registered custom template. Built-in templates cannot be removed.

`listTypes()` / `listTemplates()`

Return an array of all registered type or template names.

Node.js Functions

`generateAndSave(options)`

Generate data and save to file. Format is auto-detected from the file extension if not specified.

import { generateAndSave } from 'ficta';

await generateAndSave({
    columns: 'id:autoIncrement,name:fullName,email',
    rows: 100,
    output: 'users.csv',
    format: 'csv',       // Optional: auto-detected from filename
    preview: true,       // Optional: show preview in console
    seed: 42,            // Optional: reproducible output
    locale: 'de',        // Optional: German locale
    formatOptions: {     // Optional: format-specific options
        tableName: 'users',  // SQL
        sheetName: 'Data',   // XLSX
        pretty: true,        // JSON
        header: true,        // CSV/TSV header row
        headerFormat: 'title' // 'title' | 'raw'
    }
});

`generateFromDDL(options)`

Read a SQL DDL file and generate FK-aware test data.

import { generateFromDDL } from 'ficta';

const sql = await generateFromDDL({
  schemaFile: './schema.sql',
  rows: 20,
  outputMode: 'ddl+insert', // 'insert' | 'upsert' | 'truncate+insert' | 'ddl+insert'
  dialect: 'postgres',
  output: './seed.sql',
  locale: 'fr'
});

`generateFromSchemaFile(options)`

Read a ficta.schema.json file and generate SQL test data.

import { generateFromSchemaFile } from 'ficta';

const sql = await generateFromSchemaFile({
  schemaFile: './ficta.schema.json',
  rows: 50,
  outputMode: 'ddl+insert',
  output: './seed.sql'
});

`generateStream(options)`

Return a Node.js Readable stream for memory-efficient generation of large datasets.

import { generateStream } from 'ficta';
import { createWriteStream } from 'fs';

const stream = generateStream({
  columns: 'id:autoIncrement,name:fullName,email',
  rows: 1_000_000,
  format: 'csv',      // 'csv' | 'ndjson'
  batchSize: 1000,
  seed: 42
});

stream.pipe(createWriteStream('million-users.csv'));

`inferSchemaFromFile(filePath)`

Auto-detect Ficta column definitions from an existing CSV or JSON file.

import { inferSchemaFromFile } from 'ficta';

const { columns, columnList } = await inferSchemaFromFile('./users.csv');
console.log(columns);
// 'id:autoIncrement,email:email,first_name:firstName,...'

`fromOpenAPIFile(filePath, options?)`

Convert an OpenAPI 3.x YAML or JSON file to a ficta.schema.json-compatible object.

import { fromOpenAPIFile } from 'ficta';

const schema = await fromOpenAPIFile('./openapi.yaml', {
  schemaName: 'User', // optional: target a specific component
  rows: 100,
  dialect: 'postgres'
});

`fromGraphQLFile(filePath, options?)`

Convert a GraphQL SDL file to a ficta.schema.json-compatible object.

import { fromGraphQLFile } from 'ficta';

const schema = await fromGraphQLFile('./schema.graphql', {
  typeName: 'User', // optional: target a specific object type
  rows: 100,
  dialect: 'postgres'
});

`watchAndGenerate(options)`

Watch a DDL schema file and regenerate output automatically on changes.

import { watchAndGenerate } from 'ficta';

const watcher = watchAndGenerate({
  schemaFile: './schema.sql',
  rows: 10,
  outputMode: 'ddl+insert',
  dialect: 'postgres',
  output: './seed.sql',
  debounceMs: 300,
  onSuccess: (outputPath, elapsedMs) => console.log(`Regenerated in ${elapsedMs}ms`),
  onError: (err) => console.error(err.message)
});

// Stop watching
watcher.stop();

Browser Functions

`downloadCSV(csv, filename)`

Download CSV as a file.

import { downloadCSV } from 'ficta/browser';

downloadCSV(csvString, 'data.csv');

`createUI(container, options)`

Create interactive UI.

import { createUI } from 'ficta/browser';

const ui = createUI('#container');
ui.setColumns('id:autoIncrement,name:fullName');
ui.setRows(50);

`generateAndDownload(options)`

Generate and download file in browser.

import { generateAndDownload } from 'ficta/browser';

generateAndDownload({
    columns: 'id:autoIncrement,name:fullName,email',
    rows: 100,
    filename: 'users.json',
    format: 'json'
});

Examples

Generate User Data

ficta \
  -o users.csv \
  -c "id:autoIncrement,firstName,lastName,email,phone,company,registeredDate:pastDate" \
  -r 1000

Generate Product Catalog

ficta \
  -o products.csv \
  -c "sku:pattern:PRD-######,name:product,price,category:department,inStock:boolean" \
  -r 500

Generate Orders with Custom Status

ficta \
  -o orders.csv \
  -c "orderId:uuid,date:timestamp,total:amount,status:enum:pending|shipped|delivered,customerEmail:email" \
  -r 2000

Generate Test Addresses with Preview

ficta -t addresses -r 300 -o addresses.csv -p

Email Counter Pattern

// Generate sequential email addresses
const result = generateData({
    columns: 'email:pattern:testuser+{COUNTER}@example.com',
    rows: 1000
});
// result.records[0].email === '[email protected]'

Mixed Column Types

import { generateData } from 'ficta';

const result = generateData({
    columns: 'id:autoIncrement,name:fullName,status:enum:active|inactive,score:range:0-100,code:pattern:XYZ-####',
    rows: 50
});

Generate Multiple Files

const datasets = ['users', 'products', 'transactions'];

for (const template of datasets) {
    await generateAndSave({
        columns: templates[template].columns,
        rows: 1000,
        output: `${template}.csv`
    });
}

Different Formats for Different Scenarios

# API testing (JSON)
ficta -t users -r 1000 -o api-test-data.json

# Database seeding (SQL)
ficta -t products -r 500 -o seed-products.sql --table-name products

# Excel reports (XLSX)
ficta -t transactions -r 10000 -o report.xlsx --sheet-name "Transactions 2024"

# Data import (CSV)
ficta -c "id:autoIncrement,email:pattern:user+{COUNTER}@test.com" -r 5000 -o import.csv

# Configuration files (XML)
ficta -c "key:word,value:word" -r 50 -o config.xml

Development

Setup

# Clone repository
git clone https://github.com/izaccavalheiro/ficta.git
cd ficta

# Install dependencies
npm install

# Build browser bundles
npm run build

Project Structure

ficta/
├── src/
│   ├── core.js              # Core generator (universal — no Node/browser deps)
│   ├── formatters.js        # Format converters (Node.js)
│   ├── formatters.browser.js # Format converters (browser)
│   ├── formatters.shared.js # Shared pure format utilities (CSV/TSV/JSON)
│   ├── node.js              # Node.js-specific exports + generateFromDDL/Stream/SchemaFile
│   ├── browser.js           # Browser-specific exports
│   ├── sql-schema.js        # SQL DDL/DML generator (universal)
│   ├── ddl-parser.js        # SQL DDL → TableDef parser (universal)
│   ├── schema-generator.js  # Multi-table FK-aware orchestrator (universal)
│   ├── schema-builder.js    # Fluent table/schema builder API (universal)
│   ├── infer.js             # Schema inference from sample rows (universal)
│   ├── openapi-bridge.js    # OpenAPI/JSON Schema → Ficta columns (universal)
│   └── graphql-bridge.js    # GraphQL SDL → Ficta columns (universal)
├── tests/
│   ├── core.test.js         # Core generation tests
│   ├── formatters.test.js   # Node.js formatter tests
│   ├── formatters.browser.test.js # Browser formatter tests
│   ├── node.test.js         # Node.js API tests
│   ├── browser.test.js      # Browser API tests
│   ├── cli.test.js          # CLI interface tests
│   ├── sql-schema.test.js   # SQL schema generator tests
│   ├── ddl-parser.test.js   # DDL parser tests
│   ├── schema-builder.test.js  # Fluent builder tests
│   ├── schema-generator.test.js # Schema orchestrator tests
│   ├── infer.test.js        # Schema inference tests
│   ├── openapi-bridge.test.js   # OpenAPI bridge tests
│   └── graphql-bridge.test.js   # GraphQL bridge tests
├── examples/
│   ├── simple.html          # Simple browser example
│   ├── esmodule.html        # ES module browser example
│   ├── sql-schema.html      # Interactive SQL schema browser demo
│   └── node/
│       ├── sql-simple.js          # Basic SQL generation examples
│       └── sql-schema-examples.js # Advanced multi-table SQL examples
├── cli.js                   # CLI entry point
├── build.js                 # Build script for browser bundles
└── package.json

Build Scripts

# Build browser bundles
npm run build

# Run tests
npm test

# Run tests with coverage
npm run test:coverage

Testing

Test Coverage

| Metric | Coverage | |------------|----------| | Statements | 100% | | Branches | 100% | | Functions | 100% | | Lines | 100% |

Test Statistics

Total Tests: 948
Test Suites: 13

Test Categories

fakerTypes Tests - All Faker data type categories
templates Tests - Predefined template validation
parseColumns Tests - Column parsing with complex definitions
generateRow Tests - Row generation with all special types
Data Generation Tests - Row generation with all special types
CLI Tests - Command-line interface and options
Integration Tests - End-to-end workflows
Format Tests - All output format converters

Run Tests

# Run all tests
npm test

# Run tests with coverage report
npm run test:coverage

# View coverage report
open coverage/lcov-report/index.html

Key Features Tested

✅ All Faker data types (40+ types)
✅ Special types: static, enum, range, pattern
✅ Pattern with {COUNTER} placeholder and # for random digits
✅ All predefined templates
✅ Column parsing with complex type definitions
✅ All file format generation (CSV, JSON, XML, XLSX, TSV, SQL, YAML, TOML)
✅ CLI argument parsing and error handling (incl. --seed, --locale, --header, schema subcommand)
✅ Preview mode and format auto-detection
✅ Browser and Node.js environments
✅ SQL DDL generation (CREATE TABLE, PK, FK, constraints, 4 dialects)
✅ SQL DML generation (INSERT, batch INSERT, UPSERT, TRUNCATE+INSERT)
✅ DDL parsing from raw SQL (parseDDL, orderByDependencies)
✅ Multi-table FK-aware data generation (schema-generator)
✅ Topological sort and circular dependency detection
✅ Plugin API: registerType, unregisterType, registerTemplate, unregisterTemplate
✅ Reproducible output via Faker seed
✅ Locale-aware data generation
✅ Streaming API (CSV/NDJSON)
✅ Fluent Schema Builder (table() / schema())
✅ JSON schema file input (generateFromSchemaFile) ✅ Schema inference from CSV/JSON files (inferSchemaFromFile) ✅ OpenAPI 3.x / JSON Schema → ficta.schema.json conversion
✅ GraphQL SDL → ficta.schema.json conversion
✅ Watch mode: auto-regenerate on DDL file changes (watchAndGenerate / --watch) ✅ Parquet format output (Node.js)

Browser Compatibility

Modern Browsers: Chrome, Firefox, Safari, Edge (ES2020+)
Legacy Support: Use the minified bundle for better compatibility

Node.js Compatibility

Node.js: 18+ (ES Modules required)

Dependencies

Production

@faker-js/faker - Generate realistic fake data
exceljs - Excel file generation (Node.js)
xml2js - XML building (Node.js)
js-yaml - YAML serialization
@iarna/toml - TOML serialization
yargs - CLI argument parsing
graphql - GraphQL SDL parsing (for from-graphql)
parquetjs-lite - Parquet file generation (Node.js)

Development

jest - Testing framework
esbuild - Browser bundle builder
csv-parse - CSV parsing for tests
c8 - Code coverage

AI Integration & Development

This project includes comprehensive AI/Agent support documentation:

AI_CONTEXT.md - Quick reference for AI assistants (start here!)
AGENTS.md - Complete AI integration guide with patterns and workflows
AI_WORKFLOWS.md - Step-by-step workflows for common tasks
ARCHITECTURE.md - Deep technical architecture documentation
.github/copilot-instructions.md - GitHub Copilot specific instructions

These documents empower AI assistants to understand, modify, and extend the codebase effectively.

License

ISC

Contributing

Contributions welcome! Please open an issue or PR.

For AI-assisted development, please review our AI integration documentation:

AI_CONTEXT.md - Quick project overview
AGENTS.md - Comprehensive AI development guide
AI_WORKFLOWS.md - Common task workflows

Made with ❤️ using Faker.js

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Ficta

Key Capabilities

🚀 Universal Runtime Support

📊 Production-Ready Format Support

🗄️ Advanced Database Features

🧪 Testing & QA Workflow Integration

🔌 Schema Import & Conversion

🎯 Developer Experience

Who Should Use Ficta?

👨‍💻 Backend Developers

🧪 QA Engineers & Test Automation Developers

🎨 Frontend Developers

🔧 DevOps & Site Reliability Engineers

📊 Data Engineers & Analysts

🏗️ Database Administrators

Table of Contents

Installation

For Node.js / CLI

For Browser (No Installation)

Browser / CDN

jsDelivr

unpkg (fallback)

Minimal browser example

Quick Start

CLI

Node.js

Browser

Usage

CLI (Command Line)

CLI Options

CLI Subcommands

Global CLI Options

Node.js (Programmatic)

Browser (Script Tag)

Browser (ES Module)

Supported Formats

Format Auto-Detection

Data Types

Column Definition Format

Available Types (40+)

Person

Internet

Phone

Address

Company

Commerce

Finance

Date

Numbers

Text

IDs

Other

Aliases

Special Types

Auto Increment

Static Values

Enums

Number Ranges

Patterns

Templates

Available Templates

Usage

CLI Subcommands

ficta schema <file>

Format-Specific Options

Excel (XLSX)

SQL

Basic INSERT Statements (Legacy)

Advanced SQL Schema Generation

SQL Dialect Options

SQL Generation Modes

SQL from Existing Schema (DDL Import)

JSON

CSV

`ficta schema <file>`

`generateData(options)`

`parseColumns(columnString)`

`setFaker(faker)`

`seedFaker(seed)`

`setLocale(locale)`

`registerType(name, generatorFn, options?)`

`unregisterType(name)`

`registerTemplate(name, config, options?)`

`unregisterTemplate(name)`

`listTypes()` / `listTemplates()`

`generateAndSave(options)`

`generateFromDDL(options)`

`generateFromSchemaFile(options)`

`generateStream(options)`

`inferSchemaFromFile(filePath)`

`fromOpenAPIFile(filePath, options?)`

`fromGraphQLFile(filePath, options?)`

`watchAndGenerate(options)`

`downloadCSV(csv, filename)`

`createUI(container, options)`

`generateAndDownload(options)`