tts-pipelines

v0.2.8

Published

5 months ago

TTS pipelines for text to speech using Kitten TTS and Piper TTS

Downloads

0High
0Medium
0Low

srahul0_0

onnx tts text-to-speech machine-learning

tts-pipelines

A simple and powerfull tts pipelines working totally local in browser or in node environment using ONNX Runtime and WebGPU.

The pipeline uses Kitten TTS and Piper TTS (very powerfull and lightweight tts models that can even be used in the browser) under the hood to generate audio from text.

Installation

The package requires onnxruntime to be installed.if using on the web use the onnxruntime-web package and if using on node use the onnxruntime-node package.

Current stable release (0.2)

npm i tts-pipelines

Example Usage (browser)

If you want to try it in the browser, you can use Stackblitz or use the following worker snippet

import { PiperTTS, TextSplitterStream } from "tts-pipelines";
// or KittenTTS:
// import { KittenTTS,TextSplitterStream } from "tts-pipelines";
// const tts = await KittenTTS.from_pretrained();
let tts: PiperTTS | null = null;

self.addEventListener("message", async (event) => {
  if (event.data.type === "init") {
    try {
      console.log("loading...");

      tts = await PiperTTS.from_pretrained();
      console.log("loaded");
      self.postMessage({
        type: "model:loaded",
        message: "pipelines ready",
      });
    } catch (error) {
      self.postMessage({
        type: "model:error",
        message: error,
      });
    }
  }

  if (event.data.type === "message") {
    if (tts == null) {
      self.postMessage({
        type: "error",
        message: "tts instance is not initalized",
      });
      return;
    }
    const streamer = new TextSplitterStream();

    streamer.push(event.data.message);
    streamer.close(); // Indicate we won't add more text

    const stream = tts.stream(streamer);

    try {
      for await (const { text, audio } of stream) {
        self.postMessage({
          type: "stream",
          chunk: {
            audio: audio.toBlob(),
            text,
          },
        });
      }
    } catch (error) {
      console.error("Error during streaming:", error);
    }
    const audio = tts.merge_audio();
    self.postMessage({
      type: "done",
      audio: audio?.toBlob(),
    });
  }
  if (event.data.type == "clear") {
    tts.clearAudio();
  }
});

Example usage (node 18.x and higher)

import { PiperTTS, saveAudio, TextSplitterStream } from "tts-pipelines";

const tts = await PiperTTS.from_pretrained();
// or KittenTTS:
// import { KittenTTS, saveAudio, TextSplitterStream } from "tts-pipelines";
// const tts = await KittenTTS.from_pretrained();

const streamer = new TextSplitterStream();

streamer.push("hello world");
streamer.close(); // Indicate we won't add more text

const stream = tts.stream(streamer);
const chunks = [];

console.log({ status: "streaming" });
try {
  for await (const { text, audio } of stream) {
    console.log({
      status: "stream",
      chunk: {
        audio: audio.toBlob(),
        text,
      },
    });
    chunks.push(audio);
  }
} catch (error) {
  console.error("Error during streaming:", error);
}

const audio = await tts.merge_audio();
console.log({ status: "complete", audio: audio.toBlob() });
await saveAudio(audio.toBlob(), "./speech.wav");

Note: Supported in node 18.x and higher.

Note: Currently not working in ios looking further in the issue

Note: In case the package stops working please report it and update the package to latest if available.

Documentations

Full API documentations of both classes can be found here 😭

Contributions

If you happen to see missing feature or a bug, feel free to open an issue.
Pull requests are welcomed too!

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

tts-pipelines

Installation

Example Usage (browser)

Example usage (node 18.x and higher)

Documentations

Contributions

License