tap2talk
v5.1.7
Published
Voice transcription at your fingertips - Instantly convert speech to text with a simple keyboard shortcut
Maintainers
Readme
Tap2Talk
████████╗ █████╗ ██████╗ ██████╗ ████████╗ █████╗ ██╗ ██╗ ██╗
╚══██╔══╝ ██╔══██╗ ██╔══██╗ ╚════██╗ ╚══██╔══╝ ██╔══██╗ ██║ ██║ ██╔╝
██║ ███████║ ██████╔╝ █████╔╝ ██║ ███████║ ██║ █████╔╝
██║ ██╔══██║ ██╔═══╝ ██╔═══╝ ██║ ██╔══██║ ██║ ██╔═██╗
██║ ██║ ██║ ██║ ███████╗ ██║ ██║ ██║ ███████╗ ██║ ██╗
╚═╝ ╚═╝ ╚═╝ ╚═╝ ╚══════╝ ╚═╝ ╚═╝ ╚═╝ ╚══════╝ ╚═╝ ╚═╝Talk in ANY app, type nothing. Works everywhere you type - VSCode, Terminal, Slack, Browser, anywhere. Press a key, speak, and watch your words appear instantly where you're working.
"I believe talking to computers is the future. With AI getting smarter, why are we still typing everything? I built Tap2Talk for myself because I wanted to speak naturally in ANY application without switching tools. Now I'm sharing it with you." - @unclecode
Built by @unclecode, creator of Crawl4AI (50K+ stars).
Works Everywhere
✅ Code Editors - VSCode, Sublime, Vim, IntelliJ, any IDE
✅ Terminal - Dictate complex commands without typing
✅ Chat Apps - Slack, Discord, Teams, WhatsApp Web
✅ Browsers - Gmail, ChatGPT, Google Docs, anywhere on the web
✅ Note Taking - Notion, Obsidian, Apple Notes, OneNote
✅ Literally Anywhere - If you can type there, you can talk there
Features
- One Shortcut for Everything: Press
Ctrl+Space(double-tap Space) to start recording anywhere - Instant Paste: Your words appear right where your cursor is - no copy/paste needed
- Powered by Groq Whisper: Fast and accurate transcription using state-of-the-art AI
- Customizable Shortcuts: Set your own recording and abort shortcuts
- Cross-Platform: Works on macOS, Windows, and Linux
- Background Service: Runs quietly in the background, always ready
- System Tray Integration: Clean interface with status indicators
Groq API Setup
Tap2Talk currently uses Groq's lightning-fast Whisper API for transcription. Support for OpenAI Whisper, local models, and other providers coming soon!
Why Groq?
- ✅ Free tier available - Perfect for personal use (30 requests/min, 14,400 requests/day)
- ✅ Blazing fast - Near-instant transcription
- ✅ High accuracy - Powered by Whisper Large v3
- ✅ No credit card required - Start immediately
Get Your API Key (2 minutes)
- Create Account: Visit console.groq.com and sign up with Google or GitHub
- Generate API Key: Navigate to "API Keys" → Click "Create API Key" → Name it "Tap2Talk"
- Copy Your Key: Copy the key (starts with
gsk_) - Tap2Talk will ask for it on first run
The free tier includes 30 requests per minute and 14,400 requests per day - more than enough for regular use!
Coming Soon
- OpenAI Whisper API support
- Local Whisper models (no internet required)
- Other transcription providers
Installation
npm install -g tap2talkQuick Start
Run interactively (recommended for first time):
tap2talkStart as background service:
tap2talk startCheck status:
tap2talk status
Commands
tap2talk- Interactive mode with menutap2talk start- Start as background servicetap2talk stop- Stop background servicetap2talk status- Check service statustap2talk restart- Restart servicetap2talk logs- Show recent logstap2talk update- Check for updates
Setup
On first run, you'll need to:
- Get a free API key from Groq Console
- Enter it when prompted
- Grant accessibility permissions (macOS) for global shortcuts
Default Shortcuts
- Start Recording:
Ctrl+Space(hold Ctrl, double-tap Space) - Abort Recording: Double-tap
Escape
Both shortcuts are fully customizable in Settings.
How It Works
- Press your recording shortcut
- Speak naturally
- Recording stops automatically or press shortcut again
- Text is transcribed and pasted instantly
Status Indicators
The system tray shows real-time status:
Tap- Ready[*] REC- Recording[~] PRO- Processing[OK] DON- Done[X] ERR- Error[_] ABT- Aborted
Configuration
Settings are stored in ~/.tap2talk/config.json
Requirements
- Node.js 14 or higher
- Groq API key (free)
- Microphone access
Privacy
- Audio is processed via Groq API
- No audio is stored locally after transcription
- Settings and logs stay on your machine
Troubleshooting
Service won't start
tap2talk stop
tap2talk startPermission issues (macOS)
Go to System Preferences → Security & Privacy → Privacy → Accessibility Add and enable Terminal/iTerm
Check logs
tap2talk logsContributing
Issues and PRs welcome at github.com/unclecode/tap2talk
License
MIT © unclecode
Built with inspiration from productivity needs and powered by Groq Whisper API.
