live-translate-mcp

v0.1.6

Published

2 months ago

MCP server for local speech translation (EN ↔ 中文) via Whisper + Claude + Piper

0High
0Medium
0Low

jolucashornung

live-translate-mcp

Real-time English ↔ Mandarin speech translation for Claude — powered by Whisper, Claude AI, and Piper TTS.

Give Claude the ability to listen, translate, and speak. live-translate-mcp is a Model Context Protocol (MCP) server that adds speech translation as a native tool inside Claude Desktop and Claude Code. Hand it an audio file, and it transcribes, translates, synthesises, and plays the result — entirely on your machine, with Claude handling the translation.

What it does

| Step | Technology | Where it runs | |------|-----------|---------------| | Speech → Text | OpenAI Whisper (via @huggingface/transformers) | Local | | Text → Translation | Claude (Opus 4.8) | Anthropic API | | Translation → Speech | Piper TTS (ONNX) | Local |

Audio never leaves your machine except for the translated text sent to the Claude API. ASR and TTS run fully on-device.

Tools

`translate_file`

Translate a WAV audio file. Pass an absolute path — the server transcribes it, translates the text via Claude, synthesises speech, saves <name>_translated.wav next to the original, and plays it automatically.

Translate /Users/alice/meeting_clip.wav

Returns: original text, translation, and the path to the saved output file.

`translate_speech`

Translate raw audio passed as a base64-encoded WAV string. Returns the transcription, translation, and synthesised audio as base64 WAV — useful for programmatic workflows.

`health_check`

Verify that all dependencies (Whisper model cache, Piper voice files, espeak-ng) are present and ready before making a translation request.

Installation

No installation required. Run it directly with npx:

npx -y live-translate-mcp

Or install globally:

npm install -g live-translate-mcp

Prerequisites:

Node.js 18+
An Anthropic API key
espeak-ng — for TTS phonemisation (brew install espeak-ng on macOS, apt install espeak-ng on Linux)

The Whisper model (~150 MB) and Piper voice models (~200 MB) download automatically on first use and are cached in ~/.live-translate/.

Claude Desktop setup

Add the following to your Claude Desktop configuration file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "live-translate": {
      "command": "npx",
      "args": ["-y", "live-translate-mcp"],
      "env": {
        "ANTHROPIC_API_KEY": "your-api-key-here"
      }
    }
  }
}

Restart Claude Desktop. You'll see live-translate appear in the tools panel.

Claude Code setup

claude mcp add live-translate -- npx -y live-translate-mcp

Then set your API key in the environment before starting Claude Code, or pass it via the MCP env config.

Usage examples

Once configured, just ask Claude naturally:

"Translate this audio file for me: /Users/alice/recording.wav"

"Use translate_file on /tmp/interview.wav"

"Check if live-translate is ready"

Claude will call the appropriate tool automatically.

Supported languages

| Language | ASR | Translation | TTS | |----------|-----|-------------|-----| | English | ✓ | ✓ | ✓ | | Mandarin Chinese (普通话) | ✓ | ✓ | ✓ |

Language is detected automatically from the audio — no need to specify it.

How it compares

Environment variables

| Variable | Default | Description | |----------|---------|-------------| | ANTHROPIC_API_KEY | — | Required. Your Anthropic API key. | | PIPER_VOICE_DIR | ~/.live-translate/voices | Path to Piper .onnx voice files. |

live-translate — the standalone CLI for push-to-talk translation with no API key required, using local models end-to-end.
Model Context Protocol — the open standard this server implements.
Piper TTS — the local TTS engine powering speech synthesis.

Licence

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

live-translate-mcp

What it does

Tools

translate_file

translate_speech

health_check

Installation

Claude Desktop setup

Claude Code setup

Usage examples

Supported languages

How it compares

Environment variables

Related

Licence

`translate_file`

`translate_speech`

`health_check`