voicecc
v1.0.11
Published
Voice mode plugin for Claude Code -- hands-free interaction via local STT/TTS/VAD
Readme
Voice CC
A Claude Code plugin for hands-free voice interaction with local speech-to-text, text-to-speech, and voice activity detection.
Installation
Prerequisites
- macOS with Apple Silicon (M1/M2/M3/M4)
- Node.js 18+
- Python 3.10+
- Homebrew
Install
# 1. Install system dependencies
xcode-select --install
brew install espeak-ng cloudflared
# 2. Install Voice CC
npm install -g voicecc
# 3. Start the dashboard
voiceccThe postinstall script handles sox, the Whisper model, Python venv, and TTS dependencies automatically.
How It Works
The voice loop runs locally with zero external API calls except to Claude:
- Mic capture: VPIO (macOS Voice Processing IO) records 16kHz mono PCM with echo cancellation
- Voice activity detection: Silero VAD v5 detects speech segments
- Speech-to-text: sherpa-onnx (Whisper ONNX model) transcribes audio locally
- Endpointing: VAD silence-based turn detection
- Claude inference: Transcript sent to Claude Code Agent SDK session with streaming response
- Narration: Claude's response stripped of markdown and split into sentences
- Text-to-speech: Kokoro-82M via mlx-audio on Apple Silicon GPU (~8x realtime)
- Speaker playback: Audio output through VPIO at 24kHz with echo cancellation
