npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

dataset-translator

v1.0.4

Published

CLI tool for translating English datasets into Burmese using the Gemini API

Readme

Burmese Dataset Translator

A command-line tool for translating English text datasets into Burmese using the Gemini API.

This tool is designed for AI/ML dataset preparation and performs safe batch translation with automatic error handling and API limit protection.


Features

  • Batch translation to reduce API requests
  • Token-aware batching for reliable processing
  • Automatic retry for temporary API errors
  • Safe stop when API quota is exceeded
  • Saves translated and untranslated rows separately
  • Clean command-line output
  • Works with CSV datasets

Installation

You can run the tool without cloning the repository.

Install Globally

Install the CLI globally using npm:

npm install -g dataset-translator

After installation, run the tool from anywhere:

dataset-translator

Run Without Installing (npx)

You can also run the tool directly using npx:

npx dataset-translator

Requirements

  • Node.js 18 or later
  • Gemini API key

The translator uses the model:

  • Gemini 2.5 Flash

Dataset Format

Your CSV file must contain the following columns:

| Column | Description | | ------ | ----------------------------- | | text | English sentence to translate | | label | Classification label |

Example dataset:

text,label
I love programming.,2
This is very sad.,0
The movie was amazing.,1

Usage

Run the command:

dataset-translator

The CLI will ask for:

  1. CSV dataset path
  2. Gemini API key

Example:

Burmese Dataset Translator

Enter CSV dataset path: example_dataset.csv
Enter Gemini API Key: **********************

Output

The tool creates translated files in batches.

Example output:

burmese_dataset_0-457.csv
burmese_dataset_457-855.csv
untranslated_dataset_855-953.csv

File Types

Translated files

burmese_dataset_*.csv

Contain translated Burmese text.

Untranslated files

untranslated_dataset_*.csv

Contain rows that could not be translated due to API limits.


Error Handling

The translator includes automatic protection against API issues.

Temporary API Errors

The tool retries automatically.

API Quota Exceeded

If the Gemini API quota is reached:

  • Translation stops safely
  • Remaining rows are saved to untranslated_dataset
  • No data is lost

Typical Workflow

  1. Prepare dataset CSV file
  2. Run the translator
  3. Wait for batch translation
  4. Use translated dataset for training or analysis

Use Cases

This tool is useful for:

  • Sentiment analysis datasets
  • NLP dataset localization
  • Multilingual AI training
  • Dataset augmentation

License

MIT License