kie-mcp
v4.0.5
Published
MCP server for kie.ai — 45+ image models, 70+ video models, 20+ audio tools with deep model intelligence
Maintainers
Readme
kie-mcp
A comprehensive Model Context Protocol server for the kie.ai generation API. Gives Claude (and any MCP client) access to 45+ image models, 70+ video models, and 20+ audio tools with deep model intelligence built in.
Why this exists
Most MCPs are thin API wrappers. This one is different:
- Deep research embedded — Every major model has a
researchfield with verdicts, prompt techniques, weaknesses, cost-efficiency analysis, and competitor comparisons. Researched by Averiguare, our model intelligence agent. - Cost-aware — Every model has pricing in credits and USD. The MCP tells you the cheapest option for your use case.
- Smart filtering —
list_models filter="lip sync"orfilter="architecture"orfilter="cheapest video"— searches across capability tags, descriptions, AND research fields. - Dual-mode transport — stdio for local Claude Code, HTTP Streamable for remote Cowork/cloud usage.
What you can do with it
Just ask Claude things like:
- "Generate a brand presentation board for a perfume launch" — picks GPT Image 2 (best for text-heavy layouts)
- "Make a 10s video of fruit scarecrows defending against crows, Pixar style" — recommends Veo 3.1 or Wan 2.7
- "Generate music for a fantasy adventure game" — Suno V5
- "Lip-sync this audio to my character image" — Kling AI Avatar or Infinitalk
- "Upscale this video to 4K" — Veo 4K upscale or Topaz
- "Replace the wall color in this room photo" — Flux Kontext Pro (best for surgical edits)
Model coverage
Image (45+)
- OpenAI: GPT Image 2 (NEW), GPT-4o Image, GPT Image 1.5
- Google: Nano Banana 2 / Pro / Edit / Original, Imagen 4 (Fast/Standard/Ultra)
- Black Forest Labs: Flux Kontext Pro/Max, Flux 2 Pro/Flex
- ByteDance: Seedream 3.0 / 4.0 / 4.5 / 5.0 Lite
- Alibaba: Wan 2.7 Image / Image Pro
- Ideogram: v3, Character, Edit, Remix, Reframe
- Others: Qwen/Qwen2, Z-Image, Grok Imagine, Recraft, Topaz
Video (70+)
- Google Veo 3.1: Quality / Fast / Lite (T2V + I2V), Extend, 1080p/4K upscale
- Alibaba HappyHorse 1.0 (NEW): T2V, I2V, R2V, Video Edit — #1 on Artificial Analysis Arena
- ByteDance Seedance: 2.0 / 2.0 Fast / 1.5 Pro (T2V + I2V)
- OpenAI Sora 2: T2V/I2V, Pro, Characters, Storyboard, Watermark Remover
- Kuaishou Kling: 3.0, 2.6, V2.5 Turbo, V2.1 Master/Pro/Standard, AI Avatar
- Alibaba Wan: 2.7 (T2V/I2V/Edit/R2V), 2.6, 2.5, 2.2 Turbo, Animate
- MiniMax Hailuo: 2.3 Pro/Standard, 02 Pro/Standard
- xAI Grok Imagine: T2V, I2V, Upscale, Extend
- Runway: Aleph, Aleph Edit, Extend
- Others: ByteDance V1 Pro/Lite, Topaz upscale, Infinitalk
Audio (20+)
- Suno: Music Gen, Extend, Cover, Add Instrumental/Vocals, Replace Section, Lyrics, Sounds, Sound Effects, MIDI, Music Video, Cover Art, Mashup, Persona, Timestamped Lyrics, Boost Style, Vocal Separation, WAV
- ElevenLabs: TTS (Turbo 2.5 + Multilingual V2), Text-to-Dialogue V3, Audio Isolation, Speech-to-Text
Utility
- File upload (URL or base64)
- Veo Extend, 1080p Upscale, 4K Upscale
- Runway Extend
- Task status, credit check, raw asset listing
Installation
Prerequisites
- Node.js 18+
- A kie.ai API key from kie.ai/api-key
Setup
git clone https://github.com/YOUR_USERNAME/kie-mcp.git
cd kie-mcp
npm installRun as stdio MCP (Claude Code, Claude Desktop)
Add to your Claude config (~/.claude.json for Claude Code, or your MCP client's equivalent):
{
"mcpServers": {
"kie-art": {
"command": "node",
"args": ["/absolute/path/to/kie-mcp/server.mjs"],
"env": {
"KIE_API_KEY": "your-kie-ai-api-key",
"KIE_PROJECT_ROOT": "/optional/path/for/outputs"
}
}
}
}Or use the Claude Code CLI:
claude mcp add -s user kie-art /usr/bin/env -- KIE_API_KEY=your-key node /path/to/server.mjsRun as HTTP MCP (Cowork, remote clients)
KIE_API_KEY=your-key node server.mjs --http --port=3100Then expose via ngrok / Cloudflare Tunnel / VPS deployment:
ngrok http 3100Configure your MCP client to use the resulting URL:
{
"mcpServers": {
"kie-art": {
"type": "http",
"url": "https://your-tunnel.ngrok-free.dev/mcp"
}
}
}Environment variables
| Variable | Required | Purpose |
|---|---|---|
| KIE_API_KEY | yes | Your kie.ai API key |
| KIE_PROJECT_ROOT | no | Where generated files are saved (default: current working dir). Files go to $KIE_PROJECT_ROOT/kie/assets/raw/ |
| KIE_MCP_PORT | no | Port for HTTP mode (default: 3100) |
| KIE_CALLBACK_URL | no | Callback URL sent with Suno generation requests (kie.ai requires the field; results are fetched by polling regardless). Defaults to an inert placeholder — set this only if you want to receive the callbacks yourself |
Tools available
generate_image, generate_video, generate_music, generate_sfx,
generate_tts, generate_dialogue, generate_sounds, generate_lyrics,
generate_persona, generate_mashup, generate_cover_art,
generate_midi, create_music_video,
create_omni_voice, create_omni_character,
extend_music, cover_audio, upload_extend_audio,
add_instrumental, add_vocals, replace_section,
convert_to_wav, separate_vocals, boost_style,
get_timestamped_lyrics, audio_isolation, speech_to_text,
list_models, check_task, list_tasks, check_credits,
download_result, list_raw_assets, upload_file,
veo_extend, veo_upscale_1080p, veo_upscale_4k, runway_extendSmart model recommendations
Try these queries in any MCP client:
list_models filter="reasoning" # GPT-4o, Nano Banana, GPT Image 2
list_models filter="lip-sync" # Kling Avatar, Infinitalk, Wan Speech
list_models filter="multi-shot" # Kling 3.0, Sora Storyboard
list_models filter="cheapest video" # Wan Flash, Sora 2 standard
list_models filter="alibaba" # HappyHorse 1.0 family
list_models filter="best visual quality" # Veo Quality, Seedance 2.0
list_models filter="text rendering" # Ideogram v3, GPT Image 2
list_models filter="character" # Sora Characters, Ideogram CharacterArchitecture
server.mjs # The whole server (~4000 lines)
├── PRICING # Credit cost per model
├── MODEL_REGISTRY # Image models (45+)
├── VIDEO_MODEL_REGISTRY # Video models (70+)
├── AUDIO_TOOLS_REGISTRY # Audio tool metadata
├── createMcpServer() # Factory for stdio + HTTP modes
└── Tool handlers # generate_*, list_*, etc.Each model entry has:
name,description,capabilities(tags),pricing(credits)aspectRatios,options(with types and defaults)buildBody/buildInput(request builders)research(Averiguare verdicts, prompt techniques, weaknesses, comparisons, sources)
Credits
- Built with the MCP TypeScript SDK
- Powered by kie.ai — affordable unified API for 100+ AI models
- Model intelligence by Averiguare — "No sabes hasta que averiguas — y averiguo en todas partes."
License
MIT
