json-restruct

v1.0.5

Published

a month ago

A high-performance, declarative schema based system for restructuring JSON data into well-defined, API-ready JSON payloads.

0High
0Medium
0Low

venu9l

json mapper json-mapper json-restruct json-restruct transformer etl excel data-transformation schema mapping wildcard filter transform

JSON Restruct

A high-performance JSON mapping engine for transforming JSON data into structured, API-ready payloads using declarative schemas.

✨ Features

✅ Multiple wildcards (*) for array expansion
✅ Excel-style column names (['Applicant Name'])
✅ Multi-path fallback for flexible data extraction
✅ Nested output schemas for complex structures
✅ Built-in transforms (toNumber, toDateTime, regex, case conversions, etc.)
✅ Regex capture groups with group indexing
✅ Transform pipelines for chained transformations
✅ Conditional transforms (when/then) for dynamic mapping
✅ Default values for missing data
✅ Row-level filters for data filtering
✅ Array filtering with [key=value] syntax
✅ Stream-friendly & memory efficient
✅ Schema validation for type safety
✅ Zero dependencies (except dayjs for date handling)

📦 Installation

npm install json-restruct

🚀 Quick Start

const _json = require('json-restruct');

const sourceData = {
  rows: [
    {
      'Application No': 230085,
      'User Name': 'John Doe',
      Gender: 'male',
      Age: 27,
      id '230085_12'
    }
  ]
};

const schema = {
  application_no: {
    path: "rows.*.['Application No']",
    transform: "toNumber"
  },
  name: {
    path: "rows.*.['User Name']"
  },
  age: {
    path: "rows.*.Age",
    transform: "toNumber",
    default: 0
  }
};

const result = _json.map(sourceData, schema);
console.log(result);
// Output: [{ application_no: 230085, name: 'John Doe', age: 27 }]

📚 Table of Contents

Basic Usage

Import the Module

// CommonJS
const _json = require('json-restruct');

// ES Modules (if supported)
import _json from 'json-restruct';

Simple Mapping

const source = {
  users: [
    { name: 'Alice', age: '30' },
    { name: 'Bob', age: '25' }
  ]
};

const schema = {
  name: { path: 'users.*.name' },
  age: { path: 'users.*.age', transform: 'toNumber' }
};

const result = _json.map(source, schema);
// [
//   { name: 'Alice', age: 30 },
//   { name: 'Bob', age: 25 }
// ]

Mapping Schema

A mapping schema defines how source data should be transformed into output format. Each field in the schema can have the following properties:

| Property | Type | Description | |----------|------|-------------| | path | string or string[] | Source path(s), supports wildcards (*) | | transform | string, string[], or object | Transform definition (see Transforms) | | default | any | Default value used if resolved value is null or undefined | | type | "object" | Indicates nested output object (requires properties) | | properties | object | Nested schema properties (when type: "object") |

Example Schema

const schema = {
  id: {
    path: 'rows.*.id',
    transform: 'toNumber'
  },
  name: {
    path: ['rows.*.name', 'rows.*.fullName'], // Fallback paths
    default: 'Unknown'
  },
  metadata: {
    type: 'object',
    properties: {
      created: { path: 'rows.*.created' },
      updated: { path: 'rows.*.updated' }
    }
  }
};

Paths & Wildcards

Single Wildcard

Expands arrays to create one output object per array item:

{
  name: { path: 'rows.*.name' }
}

Multiple Wildcards

Creates a cartesian product of all wildcard expansions:

{
  test_code: { path: 'rows.*.tests.*.code' }
}

Multi-Path Fallback

If the first path returns null, subsequent paths are tried:

{
  policy_no: {
    path: [
      'rows.*.PolicyNo',
      'rows.*.[\'Policy Number\']',
      'rows.*.policy_number'
    ]
  }
}

Bracket Notation

Use bracket notation for keys with special characters or spaces:

{
  applicant_name: {
    path: 'rows.*.[\'Applicant Name\']'
  }
}

Transforms

Transforms modify values during mapping. They can be applied as strings, arrays (pipelines), or objects (conditionals).

Available Transforms

| Transform | Description | Example | |-----------|-------------|---------| | toNumber | Converts to number | "123" → 123 | | toDateTime | Formats date/time with optional input format parsing | toDateTime:YYYY-MM-DD HH:mm:ss or toDateTime:DD/MM/YYYY|YYYY-MM-DD | | currentDateTime | Returns current date/time | currentDateTime:YYYY-MM-DD → "2024-01-15" | | toLowerCase | Converts to lowercase | "HELLO" → "hello" | | toUpperCase | Converts to uppercase | "hello" → "HELLO" | | toTitleCase | Converts to title case | "hello world" → "Hello World" | | toCamelCase | Converts to camelCase | "hello_world" → "HelloWorld" | | toSnakeCase | Converts to snake_case | "HelloWorld" → "hello_world" | | toKebabCase | Converts to kebab-case | "HelloWorld" → "hello-world" | | toPascalCase | Converts to PascalCase | "hello_world" → "HelloWorld" | | prefix | Adds prefix to value | prefix:ID- → "ID-123" | | postfix | Adds postfix to value | postfix:-END → "123-END" | | static | Returns static value | static:ACTIVE → "ACTIVE" (ignores input) | | regex | Extracts using regex | regex:\\d+:0 | | toJson | Parses JSON string | '{"key":"value"}' → {key: "value"} | | toString | Converts to string | 123 → "123" | | toBoolean | Converts to boolean | "true" → true |

Transform Syntax

Simple transform:

{
  age: {
    path: 'rows.*.age',
    transform: 'toNumber'
  }
}

Transform with parameters:

{
  date: {
    path: 'rows.*.timestamp',
    transform: 'toDateTime:YYYY-MM-DD HH:mm:ss' // Output format only
  }
}

toDateTime with fromFormat and toFormat:

{
  // Parse input in DD/MM/YYYY format and output in YYYY-MM-DD format
  formatted_date: {
    path: 'rows.*.date',
    transform: 'toDateTime:DD/MM/YYYY|YYYY-MM-DD'
  },
  // Parse input format, output format, and specify timezone (use pipes throughout)
  date_with_tz: {
    path: 'rows.*.timestamp',
    transform: 'toDateTime:MM/DD/YYYY HH:mm|YYYY-MM-DD HH:mm:ss|America/New_York'
  },
  // Output format with timezone (backward compatible - uses colons)
  date_tz: {
    path: 'rows.*.timestamp',
    transform: 'toDateTime:YYYY-MM-DD HH:mm:ss:UTC'
  },
  // Output format only (backward compatible)
  date_only: {
    path: 'rows.*.timestamp',
    transform: 'toDateTime:YYYY-MM-DD'
  }
}

Note: When using fromFormat, use pipe (|) separators: toDateTime:fromFormat|toFormat|timezone. This allows formats with colons (like HH:mm:ss) to work correctly. For backward compatibility, toFormat:timezone syntax still works when not using fromFormat.

Current DateTime with format:

{
  created_at: {
    path: 'rows.*.any_field',
    transform: 'currentDateTime:YYYY-MM-DD HH:mm:ss' // Returns current date/time in specified format
  },
  date_only: {
    path: 'rows.*.any_field',
    transform: 'currentDateTime:YYYY-MM-DD' // Returns current date only
  },
  timestamp: {
    path: 'rows.*.any_field',
    transform: 'currentDateTime' // Returns default format: "YYYY-MM-DD HH:mm:ss"
  }
}

Regex with capture group:

{
  id: {
    path: 'rows.*.indexing',
    transform: 'regex:(\\d+)_(\\d+):2' // Extracts second capture group
  }
}

Prefix transform:

{
  product_id: {
    path: 'rows.*.id',
    transform: 'prefix:PROD-' // Adds "PROD-" before the value
  }
}

Postfix transform:

{
  order_number: {
    path: 'rows.*.order',
    transform: 'postfix:-COMPLETE' // Adds "-COMPLETE" after the value
  }
}

Combining prefix and postfix:

{
  formatted_id: {
    path: 'rows.*.id',
    transform: ['prefix:ID-', 'postfix:-END'] // Pipeline: adds prefix then postfix
  }
}

Static transform:

{
  status: {
    path: 'rows.*.any_field',
    transform: 'static:ACTIVE' // Always returns "ACTIVE" regardless of input value
  }
}

Static transform with different values:

{
  type: {
    path: 'rows.*.data',
    transform: 'static:user' // Always returns "user"
  },
  category: {
    path: 'rows.*.data',
    transform: 'static:premium' // Always returns "premium"
  }
}

Transform pipeline:

{
  extracted_id: {
    path: 'rows.*.indexing',
    transform: [
      'regex:(\\d+)_(\\d+):1',
      'toNumber'
    ]
  }
}

Conditional Transforms

Use conditional transforms to apply different transformations based on value conditions.

Syntax

{
  transform: {
    when: [
      // `then` can be a single transform or an array (pipeline)
      { if: <condition>, then: <transform or [transform, ...]> }
    ],
    default: <transform> // Optional
  }
}

Supported Condition Operators

| Operator | Description | Example | |----------|-------------|---------| | eq | Equals | { eq: "male" } | | ne | Not equals | { ne: "female" } | | gt | Greater than | { gt: 18 } | | lt | Less than | { lt: 100 } | | gte | Greater than or equal | { gte: 18 } | | lte | Less than or equal | { lte: 65 } | | in | Value exists in array | { in: ["M", "m", "male"] } | | regex | Regex test | { regex: "^\\d+_\\d+$" } | | exists | Value exists (not null/undefined) | { exists: true } |

Examples

Gender Normalization:

{
  gender: {
    path: 'rows.*.Gender',
    transform: {
      when: [
        { if: { in: ['M', 'm', 'male'] }, then: 'toLowerCase' },
        { if: { in: ['F', 'f', 'female'] }, then: 'toLowerCase' }
      ],
      default: 'unknown'
    }
  }
}

Conditional Regex:

{
  test_id: {
    path: 'rows.*.Indexing',
    transform: {
      when: [
        {
          if: { regex: '^\\d+_\\d+$' },
          then: 'regex:(\\d+)_(\\d+):2'
        }
      ],
      default: null
    }
  }
}

Conditional with multiple transforms (pipeline) in then:

{
  test_id: {
    path: 'rows.*.Indexing',
    transform: {
      when: [
        {
          if: { regex: '^\\d+$' },
          // Apply regex, then add a postfix using a transform pipeline
          then: [
            'regex:(\\d+):1',
            'postfix: Y'
          ]
        }
      ],
      default: null
    }
  }
}

Filters

Filters allow you to skip rows that don't match certain criteria. Rows that fail filter conditions are excluded from the output.

Filter Syntax

{
  // ... field mappings ...
  filter: {
    all: [
      { path: 'rows.*.Gender', eq: 'male' },
      { path: 'rows.*.Age', gt: 18 }
    ]
  }
}

Filter Operators

Filters support the same operators as conditional transforms, plus logical operators:

eq, ne, gt, lt, gte, lte, in, regex, exists
and - All conditions must be true
or - At least one condition must be true
not - Negates a condition

Example

const schema = {
  name: { path: 'rows.*.name' },
  age: { path: 'rows.*.age', transform: 'toNumber' },
  filter: {
    all: [
      { path: 'rows.*.age', gte: 18 },
      { path: 'rows.*.status', eq: 'active' }
    ]
  }
};

Nested Output Schemas

Create nested output structures using type: "object":

{
  user: {
    type: 'object',
    properties: {
      id: {
        path: 'rows.*.[\'Application No\']',
        transform: 'toNumber'
      },
      name: {
        path: 'rows.*.[\'User Name\']'
      },
      age: {
        path: 'rows.*.Age',
        transform: 'toNumber',
        default: 0
      }
    }
  }
}

Array Filtering

Select specific objects from arrays using inline filter syntax: arrayField[key=value].property

Syntax

arrayField[key=value].property

Example

Input:

{
  clinical: [
    { key: 'name', value: 'Kumar' },
    { key: 'age', value: 32 }
  ]
}

Schema:

{
  patient_name: {
    path: 'clinical[key=name].value'
  }
}

Output:

[
  { patient_name: 'Kumar' }
]

With Wildcards

{
  age: {
    path: 'rows.*.clinical[key=age].value',
    transform: 'toNumber'
  }
}

Schema Validation

Validate schemas before mapping to catch errors early:

const _json = require('json-restruct');

try {
  _json.validateSchema(schema);
  const result = _json.map(source, schema);
} catch (error) {
  console.error('Schema validation failed:', error.message);
}

API Reference

`map(source, schema)`

Maps source data to output format using a declarative schema.

Parameters:

source (Object): Source data object to transform
schema (Object): Mapping schema definition

Returns:

Array<Object>: Array of mapped objects

Throws:

Error: If source or schema is invalid, or mapping encounters errors

Example:

const result = _json.map(sourceData, mappingSchema);

`validateSchema(schema)`

Validates a mapping schema for correctness.

Parameters:

schema (Object): Mapping schema to validate

Throws:

Error: If schema is invalid with descriptive error message

Example:

try {
  _json.validateSchema(schema);
} catch (error) {
  console.error(error.message);
}

Performance Considerations

The mapper is optimized for large datasets:

✅ Stream-friendly - Can process data in chunks
✅ Early filtering - Filters applied before object construction
✅ Pre-tokenized paths - Paths are parsed once
✅ Regex compiled once - Patterns are cached
✅ No recursion - Iterative algorithms
✅ Deterministic execution - Predictable performance

Performance Tips:

Use filters early to reduce processing
Avoid deeply nested wildcards when possible
Use specific paths instead of multiple fallbacks when you know the structure
Pre-validate schemas in development

Tested with:

100k+ rows efficiently
500k+ rows with proper memory management

Examples

Complete Example

Input Data:

{
  client_id: 'CLIENT_001',
  rows: [
    {
      'Application No': 230085,
      'User Name': 'John Doe',
      Gender: 'male',
      Age: 27,
      id '230085_12',
      tests: [
        { code: 'CG', value: '10' }
      ]
    }
  ]
}

Mapping Schema:

{
  client_id: { path: 'client_id' },
  
  user: {
    type: 'object',
    properties: {
      application_no: {
        path: 'rows.*.[\'Application No\']',
        transform: 'toNumber'
      },
      name: { path: 'rows.*.[\'User Name\']' },
      age: {
        path: 'rows.*.Age',
        transform: 'toNumber',
        default: 0
      }
    }
  },
  
  test: {
    type: 'object',
    properties: {
      code: { path: 'rows.*.tests.*.code' },
      value: {
        path: 'rows.*.tests.*.value',
        transform: 'toNumber'
      }
    }
  },
  
  filter: {
    all: [
      { path: 'rows.*.Age', gt: 18 }
    ]
  }
}

Output:

[
  {
    client_id: 'CLIENT_001',
    user: {
      application_no: 230085,
      name: 'John Doe',
      age: 27
    },
    test: {
      code: 'CG',
      value: 10
    }
  }
]

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT License - see LICENSE file for details.

Support

For issues, questions, or contributions, please open an issue on the GitHub repository.

Made with ❤️ for efficient data transformation

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

JSON Restruct

✨ Features

📦 Installation

🚀 Quick Start

📚 Table of Contents

Basic Usage

Import the Module

Simple Mapping

Mapping Schema

Example Schema

Paths & Wildcards

Single Wildcard

Multiple Wildcards

Multi-Path Fallback

Bracket Notation

Transforms

Available Transforms

Transform Syntax

Conditional Transforms

Syntax

Supported Condition Operators

Examples

Filters

Filter Syntax

Filter Operators

Example

Nested Output Schemas

Array Filtering

Syntax

Example

With Wildcards

Schema Validation

API Reference

map(source, schema)

validateSchema(schema)

Performance Considerations

Examples

Complete Example

Contributing

License

Support

`map(source, schema)`

`validateSchema(schema)`