@realtimex/piper-tts-web

v1.1.1

Published

9 months ago

Fork of @diffusion-studio/vits-web for easier built-in PiperTTS use.

0High
0Medium
0Low

vits webassembly emscripten audio browser tts text-to-speech speech synthesis web ai webnn piperTTS AnythingLLM

This is a fork of @Mintplex-Labs/piper-tts-web/ for use of PiperTTS modules inside of a browser/Electron for RealTimeX.ai. A big shout-out goes to Rhasspy Piper, who open-sourced all the currently available models (MIT License) and to @jozefchutka who came up with the wasm build steps.
This fork includes upgrades to ONNX Runtime 1.22.0 and enhanced offline support.

Run PiperTTS based text-to-speech in the browser powered by ONNX Runtime 1.22.0

Difference from the original

Caching for client

You can leverage TTSSessions for a faster inference. (see index.js for implementation) Credit to this PR for the starting point.

Local WASM/Loading

You can define local WASM paths for the ort wasm as well as the phenomizer wasm and data file for faster local loading since the client could be offline.

Note:

This is a frontend library and will not work with NodeJS.

Usage

First of all, you need to install the library:

yarn add @therealtimex/piper-tts-web

Then you're able to import the library like this (ES only)

import * as tts from '@realtimex/piper-tts-web';

Now you can start synthesizing speech!

const wav = await tts.predict({
  text: "Text to speech in the browser is amazing!",
  voiceId: 'en_US-hfc_female-medium',
});

const audio = new Audio();
audio.src = URL.createObjectURL(wav);
audio.play();

// as seen in /example with Web Worker

Advanced Configuration

TTS Session Options

For more control over the TTS session, you can use the TtsSession class directly:

import { TtsSession } from '@realtimex/piper-tts-web';

const session = new TtsSession({
  voiceId: 'en_US-hfc_female-medium',
  allowLocalModels: true, // Allow loading local models (default: true)
  fallbackStrategy: 'auto', // 'cdn', 'local', or 'auto' (default: 'cdn')
  wasmPaths: {
    onnxWasm: '/custom/path/to/onnx/',
    piperData: '/custom/path/to/piper_phonemize.data',
    piperWasm: '/custom/path/to/piper_phonemize.wasm'
  },
  progress: (progress) => {
    console.log(`Loading: ${Math.round(progress.loaded * 100 / progress.total)}%`);
  },
  logger: (message) => {
    console.log(`TTS: ${message}`);
  }
});

const wav = await session.predict('Hello, world!');

Configuration Options

allowLocalModels: Enable/disable local model loading (default: true)
fallbackStrategy: How to handle CDN failures:
- 'cdn': Only use CDN (original behavior)
- 'local': Only use local paths
- 'auto': Try CDN first, fallback to local on failure
wasmPaths: Custom paths for WASM files
progress: Callback for download progress
logger: Callback for debug messages

Offline Support

This version includes enhanced offline support:

Automatic fallback: When fallbackStrategy: 'auto' is used, the library will automatically fallback to local WASM files if CDN is unreachable
Retry logic: Failed downloads are automatically retried with exponential backoff
Better error messages: More descriptive error messages help diagnose issues

Migration from @mintplex-labs/piper-tts-web

Update your package.json:

{
  "dependencies": {
    "@therealtimex/piper-tts-web": "^1.1.0"
  }
}

Update imports:

// Old
import * as tts from '@mintplex-labs/piper-tts-web';

// New
import * as tts from '@realtimex/piper-tts-web';

Optionally configure new options:

const session = new TtsSession({
  voiceId: 'en_US-hfc_female-medium',
  allowLocalModels: true, // Now configurable!
  fallbackStrategy: 'auto' // Enhanced offline support
});

With the initial run of the predict function you will download the model which will then be stored in your Origin private file system. You can also do this manually in advance (recommended), as follows:

await tts.download('en_US-hfc_female-medium', (progress) => {
  console.log(`Downloading ${progress.url} - ${Math.round(progress.loaded * 100 / progress.total)}%`);
});

The predict function also accepts a download progress callback as the second argument (tts.predict(..., console.log)).

If you want to know which models have already been stored, do the following

console.log(await tts.stored());

// will log ['en_US-hfc_female-medium']

You can remove models from opfs by calling

await tts.remove('en_US-hfc_female-medium');

// alternatively delete all

await tts.flush();

And last but not least use this snippet if you would like to retrieve all available voices:

console.log(await tts.voices());

// Hint: the key can be used as voiceId

What's New in v1.1.0

✅ ONNX Runtime 1.22.0: Updated to the latest version
✅ Configurable allowLocalModels: No more hardcoded restrictions
✅ Enhanced offline support: Automatic CDN fallback strategies
✅ Better error handling: Retry logic and descriptive error messages
✅ Improved logging: Optional debug logging for troubleshooting

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme