@drakulavich/parakeet-cli
v0.6.0
Published
Fast multilingual speech-to-text CLI powered by NVIDIA Parakeet ONNX models
Downloads
575
Maintainers
Readme
🦜 parakeet-cli
Fast local speech-to-text. 25 languages. ~18x faster than Whisper on Apple Silicon.
- CoreML on Apple Silicon — ~155x real-time via FluidAudio
- ONNX on CPU — cross-platform fallback, 3x faster than Whisper
- Any audio format — ffmpeg handles OGG, MP3, WAV, FLAC, M4A
- Zero Python — Bun + TypeScript, native Swift binary for CoreML
Quick Start
bun install -g @drakulavich/parakeet-cli
parakeet install # CoreML on macOS arm64, ONNX elsewhere
parakeet audio.ogg # → transcript to stdoutUsage
parakeet install # auto-detect backend
parakeet install --coreml # force CoreML (macOS arm64)
parakeet install --onnx # force ONNX (~3GB)
parakeet audio.ogg # transcribe (language auto-detected)
parakeet --versionStdout: transcript. Stderr: errors. Pipe-friendly.
Requirements
Supported Languages
:bulgaria: Bulgarian, :croatia: Croatian, :czech_republic: Czech, :denmark: Danish, :netherlands: Dutch, :gb: English, :estonia: Estonian, :finland: Finnish, :fr: French, :de: German, :greece: Greek, :hungary: Hungarian, :it: Italian, :latvia: Latvian, :lithuania: Lithuanian, :malta: Maltese, :poland: Polish, :portugal: Portuguese, :romania: Romanian, :ru: Russian, :slovakia: Slovak, :slovenia: Slovenian, :es: Spanish, :sweden: Swedish, :ukraine: Ukrainian
Benchmark
MacBook Pro M3 Pro — 10 Russian voice messages:
| | faster-whisper (CPU) | Parakeet (CoreML) | |---|---|---| | Total | 35.3s | 1.9s | | Speed | | ~18x faster |
Full results with transcripts: BENCHMARK.md
How It Works
parakeet audio.ogg
├── CoreML installed? → parakeet-coreml subprocess → stdout
└── ONNX installed? → ffmpeg → mel → encoder → decoder → stdout- CoreML: Swift binary wraps FluidAudio + CoreML model
- ONNX: NVIDIA Parakeet TDT 0.6B v3 via onnxruntime-node
OpenClaw Integration
Drop-in replacement for OpenClaw voice processing — no API keys, runs locally.
{
"tools": {
"media": {
"audio": {
"enabled": true,
"models": [{"type": "cli", "command": "parakeet", "args": ["{{MediaPath}}"], "timeoutSeconds": 120}],
"echoTranscript": false
}
}
}
}License
MIT
