@qvac/translation-nmtcpp

v7.2.2

Published

5 days ago

translation addon for qvac

Downloads

36,079

0High
0Medium
0Low

mafintosh

prdn

tether addon marian qvac

Translation Addons

This library simplifies the process of running various translation models within QVAC runtime applications. It provides a seamless interface to load, execute, and manage translation addons, offering support for multiple data sources (called data loaders).

Translation Addons

Supported Platforms

| Platform | Architecture | Min Version | Status | |----------|-------------|-------------|--------| | macOS | arm64, x64 | 14.0+ | Tier 1 | | iOS | arm64 | 17.0+ | Tier 1 | | Linux | arm64, x64 | Ubuntu 22+ | Tier 1 | | Android | arm64 | 12+ | Tier 1 | | Windows | x64 | 10+ | Tier 1 |

Installation

Prerequisites

Ensure that the Bare Runtime is installed globally on your system. If it's not already installed, you can add it using:

npm i -g bare

Note: Bare version must be 1.19.0 or higher. Verify your version with:

bare -v

Installing the Package

Install the main translation package via npm:

# Main package - supports Bergamot and IndicTrans backends (all languages)
npm i @qvac/translation-nmtcpp

Usage

The library provides a straightforward and intuitive workflow for translating text. Irrespective of the chosen model, the workflow remains the same:

1. Obtain Model Files

Before creating a model instance, you need the model files on disk. There are two options:

Bergamot models: Use ensureBergamotModelFiles() from lib/bergamot-model-fetcher to auto-download from Firefox CDN, or provide a local path.
IndicTrans2 models: Provide a local path to the GGML model file.

See the Quickstart Example for a complete working example.

2. Create the constructor arguments

The constructor accepts a single object with files, params, config, and an optional logger. The structure varies slightly depending on which backend you're using:

IndicTrans2 Model

For Indic language translations (English ↔ Hindi, Bengali, Tamil, etc.):

const model = new TranslationNmtcpp({
  files: { model: './models/ggml-indictrans2-en-indic-dist-200M.bin' },
  params: {
    mode: 'full',
    srcLang: 'eng_Latn',   // Source language (ISO 15924 code)
    dstLang: 'hin_Deva'    // Target language (ISO 15924 code)
  },
  config: {
    modelType: TranslationNmtcpp.ModelTypes.IndicTrans
  }
})

Key Parameters: | Parameter | Description | Example | |-----------|-------------|---------| | files.model | Path to GGML model file | './models/ggml-indictrans2-en-indic-dist-200M.bin' | | params.srcLang | Source language (ISO 15924) | 'eng_Latn', 'hin_Deva', 'ben_Beng' | | params.dstLang | Target language (ISO 15924) | 'eng_Latn', 'hin_Deva', 'tam_Taml' | | config.modelType | Required: TranslationNmtcpp.ModelTypes.IndicTrans | - |

IndicTrans2 model naming pattern:

ggml-indictrans2-{direction}-{size}.bin for q0f32 quantization
ggml-indictrans2-{direction}-{size}-q0f16.bin for q0f16 quantization
ggml-indictrans2-{direction}-{size}-q4_0.bin for q4_0 quantization

Where direction is en-indic, indic-en, or indic-indic, and size is dist-200M, dist-320M, or 1B.

Bergamot Model

Bergamot models (Firefox Translations) are downloaded automatically or provided as local files. Use ensureBergamotModelFiles() to handle download from Firefox CDN:

const TranslationNmtcpp = require('@qvac/translation-nmtcpp')
const path = require('bare-path')

const { ensureBergamotModelFiles, getBergamotFileNames } = require('@qvac/translation-nmtcpp/lib/bergamot-model-fetcher')

const srcLang = 'en'
const dstLang = 'it'
const modelDir = await ensureBergamotModelFiles(srcLang, dstLang, './models/bergamot-en-it')
const fileNames = getBergamotFileNames(srcLang, dstLang)

const model = new TranslationNmtcpp({
  files: {
    model: path.join(modelDir, fileNames.modelName),
    srcVocab: path.join(modelDir, fileNames.srcVocabName),
    dstVocab: path.join(modelDir, fileNames.dstVocabName)
  },
  params: {
    mode: 'full',
    srcLang,   // Source language (ISO 639-1 code)
    dstLang    // Target language (ISO 639-1 code)
  },
  config: {
    modelType: TranslationNmtcpp.ModelTypes.Bergamot
  }
})

Bergamot Model Files by Language Pair:

| Language Pair | Model File | Vocab File(s) | |---------------|------------|---------------| | en→it | model.enit.intgemm.alphas.bin | vocab.enit.spm | | it→en | model.iten.intgemm.alphas.bin | vocab.iten.spm | | en→es | model.enes.intgemm.alphas.bin | vocab.enes.spm | | es→en | model.esen.intgemm.alphas.bin | vocab.esen.spm | | en→fr | model.enfr.intgemm.alphas.bin | vocab.enfr.spm | | fr→en | model.fren.intgemm.alphas.bin | (see registry) | | en→de | model.ende.intgemm.alphas.bin | vocab.ende.spm | | en→ru | model.enru.intgemm.alphas.bin | vocab.enru.spm | | ru→en | model.ruen.intgemm.alphas.bin | vocab.ruen.spm | | en→zh | model.enzh.intgemm.alphas.bin | srcvocab.enzh.spm, trgvocab.enzh.spm | | zh→en | model.zhen.intgemm.alphas.bin | vocab.zhen.spm | | en→ja | model.enja.intgemm.alphas.bin | srcvocab.enja.spm, trgvocab.enja.spm | | ja→en | model.jaen.intgemm.alphas.bin | vocab.jaen.spm |

Key Parameters: | Parameter | Description | Example | |-----------|-------------|---------| | params.srcLang | Source language (ISO 639-1) | 'en', 'es', 'de' | | params.dstLang | Target language (ISO 639-1) | 'it', 'fr', 'de' | | files.model | Path to model weights file | './models/bergamot-en-it/model.enit.intgemm.alphas.bin' | | files.srcVocab | Path to source vocab file | './models/bergamot-en-it/vocab.enit.spm' | | files.dstVocab | Path to target vocab file | './models/bergamot-en-it/vocab.enit.spm' | | config.modelType | Required: TranslationNmtcpp.ModelTypes.Bergamot | - |

Bergamot model file naming convention:

model.{srctgt}.intgemm.alphas.bin - Model weights (e.g., model.enit.intgemm.alphas.bin)
vocab.{srctgt}.spm - Shared vocabulary for most language pairs
srcvocab.{srctgt}.spm + trgvocab.{srctgt}.spm - Separate vocabs for CJK languages (zh, ja)

Note: The list of supported languages for the srcLang and dstLang parameters differ by model type. Please refer to the Supported Languages section for details.

3. Create the `config` object

The config object is passed inside the constructor arguments (see step 2). It contains:

Model-specific parameters (required for some backends)
Generation/decoding parameters (optional, controls output quality)

Model-Specific Parameters

| Parameter | IndicTrans2 | Bergamot | |-----------|-------------|----------| | config.modelType | Required | Required | | files.srcVocab | Not needed | Required (path to source vocab) | | files.dstVocab | Not needed | Required (path to target vocab) |

Generation/Decoding Parameters (IndicTrans Only)

These parameters control how the model generates output. Note: Full parameter support is only available for IndicTrans2 models. Bergamot has limited parameter support.

// Generation parameters for IndicTrans2
const generationParams = {
  beamsize: 4,            // Beam search width (>=1). 1 disables beam search
  lengthpenalty: 0.6,     // Length normalization strength (>=0)
  maxlength: 128,         // Maximum generated tokens (>0)
  repetitionpenalty: 1.2, // Penalize previously generated tokens (0..2)
  norepeatngramsize: 2,   // Disallow repeating n-grams of this size (0..10)
  temperature: 0.8,       // Sampling temperature [0..2]
  topk: 40,               // Keep top-K logits [0..vocab_size]
  topp: 0.9               // Nucleus sampling threshold (0 < p <= 1)
}

Backend & GPU Settings

These config keys control which compute backend the native addon picks at load time. They are read once at load(); changing them after load has no effect (call unload() + load() to re-select a backend).

const backendSettings = {
  use_gpu: true,              // Enable GPU inference (default: false → CPU-only)
  gpu_backend: 'vulkan',      // Optional: pick a specific backend by name substring
  gpu_device: 0,              // Optional: ordinal within matching devices (default: 0)
  backendsDir: './prebuilds', // Optional: dir where libqvac-ggml-*.so / *.dll live
  openclCacheDir: '/path/ok'  // Optional (Android only): OpenCL kernel-cache dir
}

The three GPU control keys each accept a camelCase alias alongside the snake_case primary. Snake_case mirrors the C-struct field names; camelCase matches the ocr-onnx sibling addon convention. When both forms are present in the same config object, snake_case takes precedence.

| Key | Alias | Type | Description | |-----|-------|------|-------------| | use_gpu | useGPU | boolean | Enable GPU inference. When false (default), only the CPU backend is used. Bergamot is CPU-only by design — this flag is effectively a no-op for Bergamot. | | gpu_backend | gpuBackend | string | Case-insensitive substring match against the ggml device name (e.g. "vulkan", "vulkan0", "opencl", "metal"). When set, the selector runs a single explicit pass and picks the first non-CPU device whose name contains the substring. When unset, the default gated selection runs (see Backends). Explicit "opencl" bypasses the build-time USE_OPENCL guard — an informed opt-in. | | gpu_device | gpuDevice | int | Ordinal within the matching devices. Defaults to 0 (first match). Example: {gpu_backend: "vulkan", gpu_device: 1} picks the second Vulkan adapter. | | backendsDir | — | string | Path to the directory containing the runtime backend shared libraries (libqvac-ggml-vulkan.so, etc.). Defaults to <package>/prebuilds when unset, which is where npm install places the shipped prebuilds. Must be an absolute path; paths with .. segments or unresolvable symlinks are rejected with a warning and fall back to the default prebuilds directory. | | openclCacheDir | — | string | Android only. Writable directory the OpenCL backend uses for its JIT kernel cache (forwarded via GGML_OPENCL_CACHE_DIR). Must be an absolute path; paths with .. segments are rejected. The OpenCL backend falls back to a non-writable relative path if this is unset, which ggml_abort()s during init inside the app sandbox — always provide an app-writable path when exercising OpenCL on Android. |

Tip: Use model.getActiveBackendName() after load() to confirm which backend actually took the request — see Additional Features. The GGML scheduler silently falls back to CPU when no usable GPU ICD is registered, and this is the only way to detect that.

4. Create Model Instance

Import TranslationNmtcpp and create an instance by combining args (from Step 2) with config parameters (from Step 3):

const TranslationNmtcpp = require('@qvac/translation-nmtcpp')

IndicTrans2

const model = new TranslationNmtcpp({
  files: { model: './models/ggml-indictrans2-en-indic-dist-200M.bin' },
  params: { mode: 'full', srcLang: 'eng_Latn', dstLang: 'hin_Deva' },
  config: {
    modelType: TranslationNmtcpp.ModelTypes.IndicTrans,
    ...generationParams,
    maxlength: 256
  }
})

Bergamot

const model = new TranslationNmtcpp({
  files: {
    model: './models/bergamot-en-it/model.enit.intgemm.alphas.bin',
    srcVocab: './models/bergamot-en-it/vocab.enit.spm',
    dstVocab: './models/bergamot-en-it/vocab.enit.spm'
  },
  params: { mode: 'full', srcLang: 'en', dstLang: 'it' },
  config: {
    modelType: TranslationNmtcpp.ModelTypes.Bergamot,
    beamsize: 4
  }
})

Available Model Types:

TranslationNmtcpp.ModelTypes = {
  IndicTrans: 'IndicTrans',
  Bergamot: 'Bergamot'
}

5. Load Model

try {
  // Basic usage
  await model.load()
} catch (error) {
  console.error('Failed to load model:', error)
}

6. Run the Model

We can perform inference on the input text using the run() method. This method returns a QVACResponse object.

try {
  // Execute translation on input text
  const response = await model.run('Hello world! Welcome to the internet of peers!')

  // Process streamed output using callback
  await response
    .onUpdate(outputChunk => {
      // Handle each new piece of translated text
      console.log(outputChunk)
    })
    .await() // Wait for translation to complete

  // Access performance statistics (if enabled with opts.stats)
  if (response.stats) {
    console.log('Translation completed in:', response.stats.totalTime, 's')
  }
} catch (error) {
  console.error('Translation failed:', error)
}

7. Batch Translation (Bergamot Only)

For translating multiple texts efficiently, use the runBatch() method instead of calling run() multiple times.

Important: runBatch() is only available with the Bergamot backend. IndicTrans2 models should use sequential run() calls.

// Array of texts to translate (English)
const textsToTranslate = [
  'Hello world!',
  'How are you today?',
  'Machine translation has revolutionized communication.'
]

try {
  // Batch translation - returns array of translated strings
  const translations = await model.runBatch(textsToTranslate)

  // Output each translation
  translations.forEach((translatedText, index) => {
    console.log(`Original: ${textsToTranslate[index]}`)
    console.log(`Translated: ${translatedText}\n`)
  })
} catch (error) {
  console.error('Batch translation failed:', error)
}

runBatch() vs run():

| Method | Input | Output | Backend Support | |--------|-------|--------|-----------------| | run(text) | Single string | QVACResponse with streaming | All (IndicTrans, Bergamot) | | runBatch(texts) | Array of strings | Array of strings | Bergamot only |

Note: runBatch() is significantly faster when translating multiple texts as it processes them in a single batch operation. See examples/batch.example.js for a complete example with Bergamot.

8. Unload the Model

// Always unload the model when finished to free memory
try {
  await model.unload()
} catch (error) {
  console.error('Failed to unload model:', error)
}

Additional Features

Cancel: Translation can be cancelled mid-inference
Progress Tracking: Monitor loading progress with a callback function
Performance Stats: Measure inference time with the stats option
Active Backend Inspection: model.getActiveBackendName() returns the name of the backend that load() actually selected for inference, or a sentinel when no GPU backend is active. Useful for confirming use_gpu / gpu_backend selection actually took effect (the GGML scheduler silently falls back to CPU when no usable GPU ICD is registered):
```
await model.load()
const backend = model.getActiveBackendName()
console.log(`inference backend: ${backend}`)
// → "Vulkan0" | "OpenCL" | "Metal" | "CPU" | "Bergamot-CPU" | "Unloaded"
```
Sentinels: "Unloaded" (not yet loaded), "Bergamot-CPU" (Bergamot backend — CPU-only by design, independent of use_gpu), "CPU" (GGML backend but only the CPU device registered).

For a complete working example that brings all these steps together, see the Quickstart Example below.

Quickstart Example

This quickstart demonstrates Bergamot model inference (English → Italian translation).

Other Model Types: For IndicTrans2 models, refer to Section 2: Create the args object for model-specific configuration.

Follow these steps to run the Quickstart Example:

1. Create a New Project

mkdir translation-example
cd translation-example
npm init -y

2. Install Required Dependencies

Note: Ensure you've completed the Prerequisites setup (Bare runtime installed).

npm i @qvac/translation-nmtcpp

3. Create `quickstart.js` and paste the following code into it

touch quickstart.js

// quickstart.js

'use strict'

const TranslationNmtcpp = require('@qvac/translation-nmtcpp')
const path = require('bare-path')

const text = 'Machine translation has revolutionized the way we communicate across language barriers in the modern digital world.'

async function main () {
  const { ensureBergamotModelFiles, getBergamotFileNames } = require('@qvac/translation-nmtcpp/lib/bergamot-model-fetcher')

  const srcLang = 'en'
  const dstLang = 'it'

  // 1. Ensure model files are present (downloads from Firefox CDN if needed)
  const modelDir = await ensureBergamotModelFiles(srcLang, dstLang, './model/bergamot/enit')
  const fileNames = getBergamotFileNames(srcLang, dstLang)

  // 2. Create Model Instance with resolved file paths
  const model = new TranslationNmtcpp({
    files: {
      model: path.join(modelDir, fileNames.modelName),
      srcVocab: path.join(modelDir, fileNames.srcVocabName),
      dstVocab: path.join(modelDir, fileNames.dstVocabName)
    },
    params: { mode: 'full', srcLang, dstLang },
    config: { modelType: TranslationNmtcpp.ModelTypes.Bergamot }
  })

  // 3. Load model
  await model.load()

  try {
    // 4. Run the Model
    const response = await model.run(text)

    await response
      .onUpdate(data => {
        console.log(data)
      })
      .await()

    console.log('translation finished!')
  } finally {
    // 5. Unload the model
    await model.unload()
  }
}

main().catch(console.error)

4. Run the Example

bare quickstart.js

You should see this output on successful execution

La traduzione automatica ha rivoluzionato il modo in cui comunichiamo attraverso le barriere linguistiche nel mondo digitale moderno.
translation finished!

Adapting for Other Model Types

To use IndicTrans2 models instead, modify the constructor arguments as shown in Section 2: Create the constructor arguments and Section 4: Create Model Instance.

Quick Reference:

| Model Type | Key Changes | |------------|-------------| | IndicTrans2 | Use ISO 15924 language codes (eng_Latn, hin_Deva), provide files.model path, set config.modelType: IndicTrans | | Bergamot | Use ensureBergamotModelFiles() or local paths, provide files.model/files.srcVocab/files.dstVocab, set config.modelType: Bergamot |

Other Examples

For more detailed examples covering different use cases, refer to the examples/ directory:

| Example | Description | Model Type | |---------|-------------|------------| | indictrans.js | English-to-Hindi translation with IndicTrans2 | IndicTrans2 | | batch.example.js | Batch translation with runBatch() method | Bergamot | | pivot.example.js | Pivot translation (e.g., es→en→it) via Bergamot | Bergamot | | quickstart.js | Bergamot backend quickstart | Bergamot |

Supported Languages

IndicTrans2 Language Pairs

IndicTrans2 supports translation between English and 22 Indic languages. The following directions are available:

| Direction | Registry Keys | Sizes | |-----------|-----------------|-------| | English → Indic | Yes | 200M, 1B | | Indic → English | Yes | 200M, 1B | | Indic → Indic | Yes | 320M, 1B |

Supported Indic Languages:

Bergamot Models (Firefox Translations)

Language pairs available in the registry:

| Language | Code | en→X | X→en | |----------|------|------|------| | Arabic | ar | Yes | Yes | | Czech | cs | Yes | Yes | | Spanish | es | Yes | Yes | | French | fr | Yes | Yes | | Italian | it | Yes | Yes | | Japanese | ja | Yes | Yes | | Portuguese | pt | Yes | Yes | | Russian | ru | Yes | Yes | | Chinese | zh | Yes | Yes |

The Bergamot backend supports all language pairs available in Firefox Translations. See the Firefox Translations models repository for the complete and up-to-date list of supported language pairs. Use ensureBergamotModelFiles() to auto-download models from the Firefox CDN.

ModelClasses and Packages

ModelClass

The main class exported by this library is TranslationNmtcpp, which supports multiple translation backends:

const TranslationNmtcpp = require('@qvac/translation-nmtcpp')

// Available model types
TranslationNmtcpp.ModelTypes = {
  IndicTrans: 'IndicTrans',
  Bergamot: 'Bergamot'
}

Available Packages

Main Package

| Package | Description | Backends | Languages | |---------|-------------|----------|-----------| | @qvac/translation-nmtcpp | Main translation package | Bergamot, IndicTrans | See Supported Languages |

The main package supports both backends and all their respective languages. See Supported Languages for the complete list.

Backends

This project supports multiple translation backends (Bergamot/Firefox and IndicTrans2) and, for IndicTrans2 at runtime, multiple compute backends (CPU, Vulkan, Metal, OpenCL).

Translation backend build flags

The Bergamot backend is included in the build by default. To build without Bergamot support (reduces build time and dependencies):

bare-make generate -D USE_BERGAMOT=OFF

Compute backend selection

At runtime, the addon picks a ggml compute device using the use_gpu, gpu_backend, and gpu_device config keys described in Backend & GPU Settings. When use_gpu is true and gpu_backend is not set, the selector falls back to a default gated pass:

If built with USE_OPENCL=ON, prefer an OpenCL device first.
Otherwise (and as a fallback in the ON case) pick any non-CPU device. When USE_OPENCL=OFF (the default), OpenCL-named devices are also excluded from the fallback — the OpenCL backend still loads as a shared library but is never selected automatically.

An explicit config.gpu_backend: 'opencl' always bypasses the USE_OPENCL guard and selects OpenCL directly. The flag gates automatic selection only; caller-explicit requests are honored.

Compute backend build flag

| Flag | Default | Purpose | |------|---------|---------| | USE_OPENCL | OFF | When ON, the runtime selector prefers OpenCL over other non-CPU devices when use_gpu is true and gpu_backend is unset. Off by default because the Adreno 830 q4_0 transpose path in ggml-opencl aborts on NMT tensors whose second dimension isn't a multiple of 4 (see QVAC-17790 root-cause analysis). Callers that still want OpenCL can opt in per model via config.gpu_backend: 'opencl' regardless of this flag. |

# Default build (OpenCL not auto-selected)
bare-make generate

# Restore OpenCL-preferred selection (once the upstream ggml fix ships)
bare-make generate -D USE_OPENCL=ON

Benchmarking

We conduct comprehensive benchmarking of our translation models to evaluate their performance across different language pairs and metrics. Our benchmarking suite measures translation quality using BLEU and COMET scores, as well as performance metrics including load times and inference speeds.

Benchmark Results

Benchmarks are run via CI for all supported language pairs and model configurations.

The benchmarking covers:

Translation Quality: BLEU, chrF++, and COMET scores for accuracy assessment
Performance Metrics: Inference speed measured in tokens per second, total load time, and total inference time
Language Pairs: All supported source-target language combinations
Model Variants: Different quantization levels and model sizes

Results are updated regularly as new model versions are released.

Logging

The library supports configurable logging for both JavaScript and C++ (native) components. By default, C++ logs are suppressed for cleaner output.

Enabling C++ Logs

To enable verbose C++ logging, pass a logger object in the args parameter:

// Enable C++ logging
const logger = {
  info: (msg) => console.log('[C++ INFO]', msg),
  warn: (msg) => console.warn('[C++ WARN]', msg),
  error: (msg) => console.error('[C++ ERROR]', msg),
  debug: (msg) => console.log('[C++ DEBUG]', msg)
}

const model = new TranslationNmtcpp({
  files: { model: './models/bergamot-en-it/model.enit.intgemm.alphas.bin', srcVocab: '...', dstVocab: '...' },
  params: { mode: 'full', srcLang: 'en', dstLang: 'it' },
  config: { modelType: TranslationNmtcpp.ModelTypes.Bergamot },
  logger
})

Disabling C++ Logs

To suppress all C++ logs, either omit the logger parameter or set it to null:

const model = new TranslationNmtcpp({
  files: { model: './models/bergamot-en-it/model.enit.intgemm.alphas.bin', srcVocab: '...', dstVocab: '...' },
  params: { mode: 'full', srcLang: 'en', dstLang: 'it' },
  config: { modelType: TranslationNmtcpp.ModelTypes.Bergamot }
})

Using Environment Variables (Recommended for Examples)

All examples support the VERBOSE environment variable:

# Run with C++ logging disabled (default)
bare examples/quickstart.js

# Run with C++ logging enabled
VERBOSE=1 bare examples/quickstart.js

Log Levels

The C++ backend supports these log levels (mapped from native priority):

| Priority | Level | Description | |----------|-------|-------------| | 0 | error | Critical errors | | 1 | warn | Warnings | | 2 | info | Informational messages | | 3 | debug | Debug/trace messages |

Testing

This project includes comprehensive testing capabilities for both JavaScript and C++ components.

JavaScript Tests

# Run all JavaScript tests
npm test                   # Unit + integration tests
npm run test:unit          # Unit tests only
npm run test:integration   # Integration tests only

C++ Tests

The project includes C++ tests using Google Test framework.

npm Commands (Recommended - Cross-Platform)

# Build and run C++ tests
npm run test:cpp:build     # Build C++ test suite (auto-detects platform)
npm run test:cpp:run       # Run all C++ unit tests
npm run test:cpp           # Build and run in one command

# C++ Code Coverage
npm run coverage:cpp:build # Build with coverage instrumentation  
npm run coverage:cpp:run   # Run tests and collect coverage data
npm run coverage:cpp:report # Generate HTML coverage report
npm run coverage:cpp       # Complete coverage workflow

# Combined Testing
npm run test:all           # Run both JavaScript and C++ tests

Glossary

Bare – Lightweight, modular JavaScript runtime for desktop and mobile. Docs
Registry – QVAC model registry for distributing AI model weights and configuration.
Hyperbee – Decentralized B-tree built on Hypercores, with a key-value API. Docs
Corestore – Factory for managing named collections of Hypercores. Docs
QVAC – Open-source SDK for building decentralized AI applications.
QVACResponse – The response object used by the QVAC API. GitHub
Model Files – Local model weight files passed directly to the constructor via files. Models can be obtained from the registry, Firefox CDN, or provided locally.

Resources

Pear Platform – Decentralized platform for deploying apps. pears.com
Bare Runtime Docs – For running QVAC apps in a lightweight environment. docs.pears.com/bare
IndicTrans2 Model – Pretrained multilingual translation models. AI4Bharat/IndicTrans2
Translation App Example – QVAC-based translation application. qvac-examples/translation-app

Contributing

We welcome contributions! Here's how to get started:

Building from Source

This project contains C++ native addons that must be built before running tests.

# 1. Clone the monorepo
git clone https://github.com/tetherto/qvac.git
cd qvac/packages/translation-nmtcpp

# 2. Install dependencies
npm install

# 3. Build the native addon
npm run build

Note: Building requires CMake, a C++ compiler (GCC/Clang), and vcpkg. See the build prerequisites in the CI workflow for full system requirements.

Development Workflow

Fork the monorepo
Clone your fork: git clone https://github.com/YOUR_USERNAME/qvac.git
Navigate: cd qvac/packages/translation-nmtcpp
Install and build: npm install && npm run build
Create a branch: git checkout -b feature/your-feature-name
Make changes and ensure tests pass: npm test
Commit with a descriptive message: git commit -m "feat: add your feature"
Push to your fork: git push origin feature/your-feature-name
Open a Pull Request against the main branch

Code Style

This project uses StandardJS for JavaScript linting:

npm run lint        # Check for lint errors
npm run lint:fix    # Auto-fix lint errors

Running Tests

npm test            # Run all tests (lint + unit + integration)
npm run test:unit   # Unit tests only
npm run test:cpp    # C++ tests only (requires build first)

License

This project is licensed under the Apache-2.0 License - see the LICENSE file for details. For any questions or issues, please open an issue on the GitHub repository.