talktocursor

v1.0.0

Published

3 months ago

Hands-free voice interface for Cursor AI — text-to-speech, voice commands, and conversational coding using ElevenLabs TTS

0High
0Medium
0Low

gwpmike

mcp text-to-speech elevenlabs cursor tts voice hands-free dictation model-context-protocol ai-assistant

Cursor TTS MCP Server

A Model Context Protocol (MCP) server that adds text-to-speech capabilities to Cursor AI. The agent can speak progress updates, completions, and responses aloud using ElevenLabs TTS, enabling hands-free coding workflows.

Features

🔊 Text-to-Speech - Agent speaks aloud via ElevenLabs API
🎛️ Settings UI - Web interface to configure API key, voice, and speech parameters
⚡ Auto-Submit - Optional: automatically press Enter when dictation finishes (hands-free)
🎨 Voice Presets - Quick settings for fast, slow, expressive, stable, and dramatic speech
🔧 Configurable - Speed, stability, similarity boost, and style exaggeration controls

Installation

1. Clone or download this repository

git clone https://github.com/yourusername/cursor-tts-mcp.git
cd cursor-tts-mcp

Or download and extract the ZIP.

2. Install dependencies

npm install

3. Build the project

npm run build

4. Configure Cursor to use the MCP server

Edit (or create) ~/.cursor/mcp.json:

{
  "mcpServers": {
    "tts": {
      "command": "node",
      "args": ["/ABSOLUTE/PATH/TO/cursor-tts-mcp/build/index.js"]
    }
  }
}

Important: Replace /ABSOLUTE/PATH/TO/cursor-tts-mcp with the actual full path to where you cloned/downloaded this project.

For example:

macOS/Linux: /Users/yourname/cursor-tts-mcp/build/index.js
Windows: C:\\Users\\yourname\\cursor-tts-mcp\\build\\index.js

5. Get your ElevenLabs API key

Go to elevenlabs.io/app/settings/api-keys
Sign up or log in (free tier available)
Create a new API key and copy it

6. Configure the MCP server

Open the settings UI:

npm run settings

Then open http://localhost:3847 in your browser and:

Paste your ElevenLabs API key
Click "Test Key" to verify it works
Click "Save API Key"
(Optional) Choose a voice, model, and voice settings
(Optional) Enable Auto-Submit if you want hands-free dictation

7. Restart Cursor

Fully quit Cursor (Cmd+Q on Mac, or close completely on Windows/Linux) and reopen it.

8. Test it

Open a new Cursor chat (Cmd+L)
Check that the speak or user-tts-speak tool appears in "Available Tools"
Type: "Say hello using the speak tool"
You should hear the voice through your speakers!

Usage

Once installed, the Cursor AI agent will automatically speak at key moments:

When starting a task
When completing a task
When encountering errors or needing clarification
At major progress milestones

You can customize when the agent speaks by editing ~/.cursor/rules/voice-feedback.mdc.

Voice Settings

The settings UI lets you adjust:

Speed (0.7x - 1.2x) - How fast the speech is delivered
Stability (0-1) - More consistent vs. more expressive
Similarity Boost (0-1) - How closely it matches the original voice
Style Exaggeration (0-1) - Amplifies the speaker's style (V2+ models)

Quick Presets:

Default - Balanced settings
Fast - Quick and energetic
Slow - Clear and measured
Expressive - Dynamic and varied
Stable - Consistent tone
Dramatic - Maximum style

Auto-Submit (Optional)

For completely hands-free dictation:

Enable "Auto-Submit" in the settings UI
Adjust the silence delay (how long to wait after you stop speaking)
Save the settings
Run in a separate terminal:

npm run auto-submit

Requirements:

macOS only (uses Accessibility API)
Grant Accessibility permissions: System Settings > Privacy & Security > Accessibility > Add your terminal app

The script monitors the text field and automatically presses Enter when dictation finishes.

Configuration Files

config.json - Stores API key, voice settings, and auto-submit preferences
~/.cursor/mcp.json - Registers the MCP server with Cursor
~/.cursor/rules/voice-feedback.mdc - Controls when the agent speaks

Troubleshooting

Tool doesn't appear in Cursor?

Make sure you fully quit and restarted Cursor (Cmd+Q)
Check that ~/.cursor/mcp.json has the correct absolute path
Run npm run build to ensure the project is compiled

"API key not set" error?

Open the settings UI: npm run settings
Enter your ElevenLabs API key and click "Save API Key"
Restart Cursor

No audio?

Check system volume and speaker output
Verify mpv is installed: mpv --version (installed automatically by ElevenLabs SDK)
Test your API key in the settings UI

Auto-submit not working?

Ensure macOS Accessibility permissions are granted
Check that Cursor is the frontmost app when dictating
Adjust the "Min Text Length" if short dictations aren't triggering
Increase "Silence Delay" if prompts are being submitted too early

Scripts

npm run build - Compile TypeScript to JavaScript
npm run settings - Open the web settings UI
npm run auto-submit - Start the auto-submit script (macOS only)

License

MIT

Credits

ElevenLabs for TTS API
Model Context Protocol for MCP SDK