vibetalk
v0.1.0
Published
Voice-to-text dictation CLI — speak and it types wherever your cursor is. Works with any app.
Downloads
109
Maintainers
Readme
VibeTalk
Voice-to-text dictation CLI — speak and it types wherever your cursor is.
Works with any app: VS Code, Cursor, terminal, browser, Slack, or any text field.
Powered by Soniox real-time speech-to-text.
Quick Start
npx vibetalkOn first run, it will ask for your Soniox API key. Get one free at console.soniox.com/api-keys.
That's it. Press Enter to start recording, speak, and press Enter again to stop. Your speech is typed wherever your cursor is focused.
How It Works
- Run
npx vibetalkin a terminal - Press Enter to start recording
- Speak — your words are typed in real-time wherever your cursor is
- Press Enter to stop recording
- Repeat as needed. Press Ctrl+C to quit.
Installation
# Use directly with npx (no install needed)
npx vibetalk
# Or install globally
npm install -g vibetalk
vibetalkConfiguration
Your API key is saved automatically on first run. To manage configuration:
# Set API key manually
vibetalk config set api-key YOUR_SONIOX_KEY
# View all settings
vibetalk config list
# Change language (default: en)
vibetalk config set language es
# Reset to defaults
vibetalk config resetYou can also pass the API key as an environment variable:
export SONIOX_API_KEY=your_key_here
npx vibetalkOptions
vibetalk [start] Start dictation (default command)
--api-key <key> Soniox API key
--device <index> Audio input device index
--language <lang> Language (en, es, fr, de, etc.)
--typing-strategy <s> auto | robotjs | clipboard
--typing-delay <ms> Ms between keystrokes (default: 12)
--use-hotkey Use global hotkey instead of Enter
--verbose Debug logging
vibetalk config set <k> <v> Set config value
vibetalk config get <key> Get config value
vibetalk config list Show all settings
vibetalk config reset Reset to defaults
vibetalk devices List audio input devicesRequirements
- Node.js 18+
- macOS, Windows, or Linux
- A microphone
- Soniox API key (free tier available at soniox.com)
Permissions
On macOS, you may need to grant:
- Microphone access to your terminal app
- Accessibility access for keyboard simulation (System Settings > Privacy & Security)
How It Works (Technical)
- Captures microphone audio via PvRecorder (16kHz, 16-bit PCM)
- Streams audio over WebSocket to Soniox real-time transcription API
- Receives transcribed tokens in real-time
- Types confirmed text character-by-character using robotjs
- Falls back to clipboard paste if robotjs is unavailable
License
MIT
