@agentrix/gemini-image-mcp

v1.1.1

Published

18 days ago

MCP server for generating images using Gemini Banana API

0High
0Medium
0Low

xmz.ai

mcp gemini image-generation ai banana

Gemini Image MCP Server

A Model Context Protocol (MCP) server that provides image generation capabilities using Google's Gemini Banana API.

Features

Generate images from text prompts using Gemini API
Support for different aspect ratios (1:1, 16:9, 9:16, 4:3, 3:4)
Negative prompts to avoid unwanted elements
Returns base64 encoded images compatible with MCP clients
Configurable API endpoint, key, and model

Installation

From npm (when published)

npm install -g @agentrix/gemini-image-mcp

From source

git clone <repository-url>
cd gemini-image-mcp
npm install
npm run build

Configuration

The server requires environment variables for configuration:

| Variable | Required | Default | Description | |----------|----------|---------|-------------| | GEMINI_API_KEY | Yes | - | Your Gemini API key | | GEMINI_BASE_URL | No | https://generativelanguage.googleapis.com/v1beta | Gemini API base URL | | GEMINI_MODEL | No | gemini-2.5-flash-image | Model to use for image generation |

Usage

With Claude Desktop

Add to your Claude Desktop configuration (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):

{
  "mcpServers": {
    "gemini-image": {
      "command": "npx",
      "args": ["-y", "@agentrix/gemini-image-mcp"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here",
        "GEMINI_BASE_URL": "https://generativelanguage.googleapis.com/v1beta",
        "GEMINI_MODEL": "gemini-2.5-flash-image"
      }
    }
  }
}

Or if installed globally:

{
  "mcpServers": {
    "gemini-image": {
      "command": "gemini-image-mcp",
      "env": {
        "GEMINI_API_KEY": "your-api-key-here"
      }
    }
  }
}

Standalone

export GEMINI_API_KEY="your-api-key-here"
export GEMINI_BASE_URL="https://generativelanguage.googleapis.com/v1beta"
export GEMINI_MODEL="gemini-2.5-flash-image"

gemini-image-mcp

Available Tools

generate_image

Generate an image from a text prompt. Can return base64 encoded images or save them to disk.

Parameters:

prompt (string, required): The text description of the image to generate
aspectRatio (string, optional): The aspect ratio of the image. Options: "1:1", "16:9", "9:16", "4:3", "3:4". Default: "1:1"
negativePrompt (string, optional): Description of elements to avoid in the image
path (string, optional): Directory path to save generated images. If provided, all images will be saved as PNG files instead of returning base64 data

Returns:

When path is NOT provided:

Text confirmation message
Base64 encoded image data with MIME type (for all generated images)

When path is provided:

Text message with list of saved file paths

Note: The API may generate multiple images per request. All images will be either returned as base64 data or saved to the specified directory.

Example 1: Return base64 data

{
  "name": "generate_image",
  "arguments": {
    "prompt": "A serene mountain landscape at sunset with a lake reflection",
    "aspectRatio": "16:9",
    "negativePrompt": "people, buildings, text"
  }
}

Example 2: Save to disk

{
  "name": "generate_image",
  "arguments": {
    "prompt": "A cute cat sitting on a sofa",
    "path": "/Users/username/images"
  }
}

Response:

Successfully generated 2 images for prompt: "A cute cat sitting on a sofa"

Images saved to:
- /Users/username/images/image_1737023456789_1.png
- /Users/username/images/image_1737023456789_2.png

Development

# Install dependencies
npm install

# Build
npm run build

# Watch mode for development
npm run dev

# Run locally
npm start

Publishing

To publish to npm:

Update version in package.json
Build the project: npm run build
Login to npm: npm login
Publish: npm publish --access public

Requirements

Node.js >= 18.0.0
Valid Gemini API key with image generation access

Error Handling

The server provides detailed error messages for common issues:

Missing API key
API request failures
Invalid responses
Network timeouts (60 second timeout per request)

License

MIT

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Support

For issues and questions, please open an issue on the GitHub repository.