media-tools-mcp-server
v0.2.0
Published
MCP server for media operations including image search, video search, media understanding, and image generation
Downloads
36
Maintainers
Readme
media-tools-server MCP Server
A Model Context Protocol server for media search, download, and understanding using Unsplash, YouTube, and Google Gemini APIs.
Features
Tools
image_search- Search for high-quality images on Unsplash- Returns image URLs, descriptions, and photographer credits
- Configurable result count (default: 5)
download_image- Download images from URLs to local filesystem- Supports any image URL
- Saves with custom filename and path
video_search- Search for YouTube videos- Returns video IDs, titles, descriptions, and channel info
- Configurable result count (default: 5)
video_understanding- Extract transcripts from YouTube videos- Returns timestamped transcript text
- Useful for content analysis and summarization
- Note: Only works with videos that have transcripts/captions enabled
image_understanding- Analyze images using Google Gemini AI (gemini-2.5-flash)- Accepts image URLs or local file paths
- Optional custom prompts for guided analysis
- Returns detailed image descriptions
- Powered by Google's latest multimodal AI
image_generation- Generate actual images using Google Gemini 2.5 Flash Image model- Creates high-quality images directly from text prompts
- Supports various image sizes (256px to 4K)
- Saves images to ~/Downloads or custom path
- Powered by Google's latest image generation AI
Development
Install dependencies:
npm installBuild the server:
npm run buildFor development with auto-rebuild:
npm run watchInstallation
Quick Start (Recommended)
The easiest way to use this MCP server is with npx. No installation required!
On MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
On Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"media-tools": {
"command": "npx",
"args": ["-y", "media-tools-mcp-server"]
}
}
}Note: This server requires a Google Gemini API key for image understanding features. Set it as an environment variable:
{
"mcpServers": {
"media-tools": {
"command": "npx",
"args": ["-y", "media-tools-mcp-server"],
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}Alternative: Global Installation
If you prefer to install the package globally:
npm install -g media-tools-mcp-serverThen configure:
{
"mcpServers": {
"media-tools": {
"command": "media-tools-mcp-server",
"env": {
"GEMINI_API_KEY": "your-api-key-here"
}
}
}
}Debugging
Since MCP servers communicate over stdio, debugging can be challenging. We recommend using the MCP Inspector, which is available as a package script:
npm run inspectorThe Inspector will provide a URL to access debugging tools in your browser.
