voicecc
v1.3.3
Published
Voice Agent Platform running on Claude Code -- create and deploy conversational voice agents with ElevenLabs STT/TTS and VAD
Readme
VoiceCC
A Voice Agent Platform running on Claude Code. Create, manage, and deploy conversational voice agents powered by Claude, with real-time speech-to-text, text-to-speech, and voice activity detection via ElevenLabs.
Project Structure
voice-server/ Python FastAPI: real-time audio pipeline (VAD, STT, TTS, Claude sessions)
server/ Node.js orchestration: boots dashboard + voice server, manages integrations
services/ Tunnel, Twilio, browser calls, agents, device pairing
index.ts Entry point (spawns voice-server + dashboard, auto-starts integrations)
dashboard/ Web UI (Vite + React) + API routes (Hono)
lander/ Static landing page
init/ Default prompt templates for new agents
bin/ CLI entry point (voicecc command)Install
Prerequisites
- macOS or Linux
- Node.js 18+
- Python 3.11+ with
venv - An ElevenLabs API key
Terminal
Sets up Cloudflared Quicktunnel (optional), protects installation with password (optional), and sets up your Elevenlabs API key.
# 1. Install Voice CC
npm install -g voicecc
# 2. Start the platform
voiceccHow It Works
The platform runs two servers: a Node.js orchestrator (dashboard, integrations, CLI) and a Python voice server (real-time audio pipeline via Pipecat).
- Mic capture: Browser captures audio via WebRTC, connected to the Python voice server
- Voice activity detection: Silero VAD v5 detects speech segments
- Speech-to-text: ElevenLabs Scribe transcribes audio
- Endpointing: VAD silence-based turn detection
- Claude inference: Transcript sent to Claude Agent SDK session with streaming response
- Narration: Claude's response stripped of markdown and split into sentences
- Text-to-speech: ElevenLabs streaming TTS generates audio
- Speaker playback: Audio streamed back through WebRTC
