npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2024 – Pkg Stats / Ryan Hefner

@codetailor/split-json

v1.0.3

Published

Large JSON file splitter

Downloads

20

Readme

Split JSON

NPM

npm npm Known Vulnerabilities license

A small Node.js module that splits large JSON files (with an array of objects) into smaller part files.

It does the following:

  • Opens a read stream to the large JSON file
  • Reads each top-level item from the array, one at a time
  • Writes each smaller list of items into a part file with sequencial numbering

For more information, please read the Usage section.

Warning

I'll be implementing changes as quickly as possible, if you have a bug report or feature request, please read the Feedback section.

I'll ensure that patch (0.0.x) updates won't break your code, but major (x.0.0) and minor (0.x.0) ones might.

Always check this README file before upgrading to the latest version.

Installation

To install the package, run this inside your project's folder.

$ npm i @codetailor/split-json

Usage

The module exports a Promise that must be handled on your side.

// Javascript imports
const split = require('split-json');
const path = require('path');

// Typescript imports
import split from 'split-json';
import * as path from 'path';

// Sample data, replace with your own
const inputFilePath = path.join('data', 'input', 'large-json-file.json'); // Path to the large JSON file
const outputFolder = path.join('data', 'output'); // Path to the folder for the part files
const outputPrefix = 'part-'; // Prefix for the part filenames
const maxItemsPerFile = 1; // (optional) Maximum number of items in each part file (default: 10000)
const minPartNumberLength = 3; // (optional) Minimum length of the part file number (ex: 4 -> 0001) (default: 4)

// Then/catch version
split(inputFilePath, outputFolder, outputPrefix, maxItemsPerFile, minPartNumberLength)
  .then(() => {
    // Insert your code here, part files have been created
  })
  .catch(console.error);

// Async/await version
(async () => {
  try {
    await split(inputFilePath, outputFolder, outputPrefix, maxItemsPerFile, minPartNumberLength);

    // Insert your code here, part files have been created
  }

  catch (error) {
    console.error(error);
  }
})();

With the following example input file in data/input/large-json-file.json:

[
  { "id": 1 },
  { "id": 2 },
  { "id": 3 },
  { "id": 4 }
]

The module should generate the following part files in the data/output folder:

part-001.json

[{"id":1}]

part-002.json

[{"id":2}]

part-003.json

[{"id":3}]

part-004.json

[{"id":4}]

Important notes:

  • The input file must contain an array of objects in valid JSON format
  • Each generated part file will contain an array of maxItemsPerFile items in minified JSON format

Errors

In case of error, the function will reject the promise with the respective error.

Custom error messages generated by the module are:

  • Input file not found when the module can't open or find the input JSON file

Feedback

All bug reports and feature requests are welcome, and should be submitted through one of the following channels:

All requests will be created as Github issues, if you don't use that channel.

License

MIT © 2023 Ricardo Nunes