kie-mcp

v4.0.5

Published

3 days ago

MCP server for kie.ai — 45+ image models, 70+ video models, 20+ audio tools with deep model intelligence

0High
0Medium
0Low

mcp kie.ai claude ai-image-generation ai-video-generation veo sora kling seedance happyhorse flux nano-banana gpt-image suno elevenlabs

kie-mcp

A comprehensive Model Context Protocol server for the kie.ai generation API. Gives Claude (and any MCP client) access to 45+ image models, 70+ video models, and 20+ audio tools with deep model intelligence built in.

Why this exists

Most MCPs are thin API wrappers. This one is different:

Deep research embedded — Every major model has a research field with verdicts, prompt techniques, weaknesses, cost-efficiency analysis, and competitor comparisons. Researched by Averiguare, our model intelligence agent.
Cost-aware — Every model has pricing in credits and USD. The MCP tells you the cheapest option for your use case.
Smart filtering — list_models filter="lip sync" or filter="architecture" or filter="cheapest video" — searches across capability tags, descriptions, AND research fields.
Dual-mode transport — stdio for local Claude Code, HTTP Streamable for remote Cowork/cloud usage.

What you can do with it

Just ask Claude things like:

"Generate a brand presentation board for a perfume launch" — picks GPT Image 2 (best for text-heavy layouts)
"Make a 10s video of fruit scarecrows defending against crows, Pixar style" — recommends Veo 3.1 or Wan 2.7
"Generate music for a fantasy adventure game" — Suno V5
"Lip-sync this audio to my character image" — Kling AI Avatar or Infinitalk
"Upscale this video to 4K" — Veo 4K upscale or Topaz
"Replace the wall color in this room photo" — Flux Kontext Pro (best for surgical edits)

Model coverage

Image (45+)

OpenAI: GPT Image 2 (NEW), GPT-4o Image, GPT Image 1.5
Google: Nano Banana 2 / Pro / Edit / Original, Imagen 4 (Fast/Standard/Ultra)
Black Forest Labs: Flux Kontext Pro/Max, Flux 2 Pro/Flex
ByteDance: Seedream 3.0 / 4.0 / 4.5 / 5.0 Lite
Alibaba: Wan 2.7 Image / Image Pro
Ideogram: v3, Character, Edit, Remix, Reframe
Others: Qwen/Qwen2, Z-Image, Grok Imagine, Recraft, Topaz

Video (70+)

Google Veo 3.1: Quality / Fast / Lite (T2V + I2V), Extend, 1080p/4K upscale
Alibaba HappyHorse 1.0 (NEW): T2V, I2V, R2V, Video Edit — #1 on Artificial Analysis Arena
ByteDance Seedance: 2.0 / 2.0 Fast / 1.5 Pro (T2V + I2V)
OpenAI Sora 2: T2V/I2V, Pro, Characters, Storyboard, Watermark Remover
Kuaishou Kling: 3.0, 2.6, V2.5 Turbo, V2.1 Master/Pro/Standard, AI Avatar
Alibaba Wan: 2.7 (T2V/I2V/Edit/R2V), 2.6, 2.5, 2.2 Turbo, Animate
MiniMax Hailuo: 2.3 Pro/Standard, 02 Pro/Standard
xAI Grok Imagine: T2V, I2V, Upscale, Extend
Runway: Aleph, Aleph Edit, Extend
Others: ByteDance V1 Pro/Lite, Topaz upscale, Infinitalk

Audio (20+)

Suno: Music Gen, Extend, Cover, Add Instrumental/Vocals, Replace Section, Lyrics, Sounds, Sound Effects, MIDI, Music Video, Cover Art, Mashup, Persona, Timestamped Lyrics, Boost Style, Vocal Separation, WAV
ElevenLabs: TTS (Turbo 2.5 + Multilingual V2), Text-to-Dialogue V3, Audio Isolation, Speech-to-Text

Utility

File upload (URL or base64)
Veo Extend, 1080p Upscale, 4K Upscale
Runway Extend
Task status, credit check, raw asset listing

Installation

Prerequisites

Node.js 18+
A kie.ai API key from kie.ai/api-key

Setup

git clone https://github.com/YOUR_USERNAME/kie-mcp.git
cd kie-mcp
npm install

Run as stdio MCP (Claude Code, Claude Desktop)

Add to your Claude config (~/.claude.json for Claude Code, or your MCP client's equivalent):

{
  "mcpServers": {
    "kie-art": {
      "command": "node",
      "args": ["/absolute/path/to/kie-mcp/server.mjs"],
      "env": {
        "KIE_API_KEY": "your-kie-ai-api-key",
        "KIE_PROJECT_ROOT": "/optional/path/for/outputs"
      }
    }
  }
}

Or use the Claude Code CLI:

claude mcp add -s user kie-art /usr/bin/env -- KIE_API_KEY=your-key node /path/to/server.mjs

Run as HTTP MCP (Cowork, remote clients)

KIE_API_KEY=your-key node server.mjs --http --port=3100

Then expose via ngrok / Cloudflare Tunnel / VPS deployment:

ngrok http 3100

Configure your MCP client to use the resulting URL:

{
  "mcpServers": {
    "kie-art": {
      "type": "http",
      "url": "https://your-tunnel.ngrok-free.dev/mcp"
    }
  }
}

Environment variables

| Variable | Required | Purpose | |---|---|---| | KIE_API_KEY | yes | Your kie.ai API key | | KIE_PROJECT_ROOT | no | Where generated files are saved (default: current working dir). Files go to $KIE_PROJECT_ROOT/kie/assets/raw/ | | KIE_MCP_PORT | no | Port for HTTP mode (default: 3100) | | KIE_CALLBACK_URL | no | Callback URL sent with Suno generation requests (kie.ai requires the field; results are fetched by polling regardless). Defaults to an inert placeholder — set this only if you want to receive the callbacks yourself |

Tools available

generate_image, generate_video, generate_music, generate_sfx,
generate_tts, generate_dialogue, generate_sounds, generate_lyrics,
generate_persona, generate_mashup, generate_cover_art,
generate_midi, create_music_video,
create_omni_voice, create_omni_character,
extend_music, cover_audio, upload_extend_audio,
add_instrumental, add_vocals, replace_section,
convert_to_wav, separate_vocals, boost_style,
get_timestamped_lyrics, audio_isolation, speech_to_text,
list_models, check_task, list_tasks, check_credits,
download_result, list_raw_assets, upload_file,
veo_extend, veo_upscale_1080p, veo_upscale_4k, runway_extend

Smart model recommendations

Try these queries in any MCP client:

list_models filter="reasoning"          # GPT-4o, Nano Banana, GPT Image 2
list_models filter="lip-sync"           # Kling Avatar, Infinitalk, Wan Speech
list_models filter="multi-shot"         # Kling 3.0, Sora Storyboard
list_models filter="cheapest video"     # Wan Flash, Sora 2 standard
list_models filter="alibaba"            # HappyHorse 1.0 family
list_models filter="best visual quality" # Veo Quality, Seedance 2.0
list_models filter="text rendering"     # Ideogram v3, GPT Image 2
list_models filter="character"          # Sora Characters, Ideogram Character

Architecture

server.mjs                      # The whole server (~4000 lines)
├── PRICING                     # Credit cost per model
├── MODEL_REGISTRY              # Image models (45+)
├── VIDEO_MODEL_REGISTRY        # Video models (70+)
├── AUDIO_TOOLS_REGISTRY        # Audio tool metadata
├── createMcpServer()           # Factory for stdio + HTTP modes
└── Tool handlers               # generate_*, list_*, etc.

Each model entry has:

name, description, capabilities (tags), pricing (credits)
aspectRatios, options (with types and defaults)
buildBody / buildInput (request builders)
research (Averiguare verdicts, prompt techniques, weaknesses, comparisons, sources)

Credits

Built with the MCP TypeScript SDK
Powered by kie.ai — affordable unified API for 100+ AI models
Model intelligence by Averiguare — "No sabes hasta que averiguas — y averiguo en todas partes."

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

kie-mcp

Why this exists

What you can do with it

Model coverage

Image (45+)

Video (70+)

Audio (20+)

Utility

Installation

Prerequisites

Setup

Run as stdio MCP (Claude Code, Claude Desktop)

Run as HTTP MCP (Cowork, remote clients)

Environment variables

Tools available

Smart model recommendations

Architecture

Credits

License