npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@bioinfornatics/gpt-image-mcp

v0.0.2

Published

MCP server for AI image generation via OpenAI and Azure OpenAI gpt-image-* models

Readme

gpt-image-mcp

MCP server for AI image generation via OpenAI and Azure OpenAI gpt-image-* models.
Built with Bun + NestJS · Streamable HTTP + stdio transports · 305 tests, 92 % coverage


Features

| Capability | Detail | |-----------|--------| | 🖼️ image_generate | Text → image with all gpt-image-* models | | ✏️ image_edit | Inpainting with optional mask (up to 16 input images) | | 🔀 image_variation | dall-e-2 variations (dall-e-2 only) | | 🔍 provider_list / validate | Connectivity check without generating an image | | ⚡ MCP Elicitation | Interactive parameter refinement (quality, size) | | 🧠 MCP Sampling | Prompt enhancement via client LLM | | 💾 MCP Roots | Save images directly to your workspace | | 🔒 Security | Bearer-token auth, rate limiting, secret masking, path traversal prevention | | 📊 Observability | Prometheus /metrics, /health/live, /health/ready |


Quick Start

Prerequisites

  • Bun ≥ 1.1 or Node.js ≥ 18 or Docker
  • An OpenAI API key or Azure OpenAI resource

Option A — Zero-install with bun x (recommended)

# stdio transport (Claude Desktop, Goose, Cursor)
PROVIDER=openai OPENAI_API_KEY=sk-... bun x --bun gpt-image-mcp

# HTTP transport on port 3000
PROVIDER=openai OPENAI_API_KEY=sk-... MCP_TRANSPORT=http PORT=3000 bun x --bun gpt-image-mcp

# Azure OpenAI
PROVIDER=azure \
  AZURE_OPENAI_ENDPOINT=https://my-resource.openai.azure.com \
  AZURE_OPENAI_API_KEY=... \
  AZURE_OPENAI_DEPLOYMENT=gpt-image-2 \
  bun x --bun gpt-image-mcp

--bun flag forces Bun runtime instead of Node.js. Both work.

With npx (Node.js users):

PROVIDER=openai OPENAI_API_KEY=sk-... npx gpt-image-mcp

Option B — Install globally

bun add -g gpt-image-mcp
PROVIDER=openai OPENAI_API_KEY=sk-... gpt-image-mcp

Option C — Clone & run from source

git clone https://github.com/bioinfornatics/gpt-image-mcp
cd gpt-image-mcp
bun install
cp .env.example .env   # then edit with your keys

bun run start:http     # HTTP on :3000
bun run start:stdio    # stdio

Option D — Docker

docker build -t gpt-image-mcp .

docker run -p 3000:3000 \
  -e PROVIDER=openai \
  -e OPENAI_API_KEY=sk-... \
  gpt-image-mcp

Configuration

| Variable | Required | Default | Description | |----------|----------|---------|-------------| | PROVIDER | ✅ | — | openai, azure, together, or custom | | OPENAI_API_KEY | ✅ if openai | — | OpenAI API key (sk-...) | | OPENAI_BASE_URL | ❌ | https://api.openai.com/v1 | Override for proxy | | AZURE_OPENAI_ENDPOINT | ✅ if azure | — | https://myresource.openai.azure.com | | AZURE_OPENAI_API_KEY | ✅ if azure | — | Azure API key | | AZURE_OPENAI_DEPLOYMENT | ✅ if azure | — | Deployment name | | AZURE_OPENAI_API_VERSION | ❌ | 2025-04-01-preview | API version | | MCP_TRANSPORT | ❌ | http | http or stdio | | PORT | ❌ | 3000 | HTTP port | | MCP_API_KEY | ❌ | — | Bearer token to protect /mcp | | DEFAULT_MODEL | ❌ | gpt-image-2 | Default image model (override per-request via tool param) | | USE_ELICITATION | ❌ | true | Enable MCP Elicitation | | USE_SAMPLING | ❌ | true | Enable MCP Sampling | | MAX_REQUESTS_PER_MINUTE | ❌ | 60 | Rate limit per client | | LOG_LEVEL | ❌ | info | debug/info/warn/error |


MCP Client Setup

Claude Desktop

~/Library/Application Support/Claude/claude_desktop_config.json (macOS) — %APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "gpt-image-mcp": {
      "command": "bun",
      "args": ["x", "--bun", "gpt-image-mcp"],
      "env": {
        "PROVIDER": "openai",
        "OPENAI_API_KEY": "sk-...",
        "MCP_TRANSPORT": "stdio",
        "LOG_LEVEL": "error"
      }
    }
  }
}

Goose

extensions:
  - name: gpt-image-mcp
    type: stdio
    cmd: bun
    args: [x, --bun, gpt-image-mcp]
    env:
      PROVIDER: openai
      OPENAI_API_KEY: sk-...
      MCP_TRANSPORT: stdio

HTTP / Remote Client

POST http://localhost:3000/mcp
Accept: application/json, text/event-stream
Content-Type: application/json
Authorization: Bearer <MCP_API_KEY>   # only if MCP_API_KEY is set

{"jsonrpc":"2.0","id":1,"method":"tools/call","params":{"name":"image_generate","arguments":{"prompt":"a cat"}}}

Available Models

As of April 23, 2026 — model landscape has changed significantly. dall-e-3 was retired.

OpenAI (PROVIDER=openai)

| Model | Status | Sizes | Best for | |-------|--------|-------|----------| | gpt-image-2 | ✅ Default · Recommended | Arbitrary up to 4K (multiples of 16 px, ≤ 3:1 ratio) | Production — best quality, 4K, flexible resolution | | gpt-image-1.5 | ✅ Available | 1024×1024, 1024×1536, 1536×1024 | Compatibility / migration | | gpt-image-1-mini | ✅ Available | 1024×1024, 1024×1536, 1536×1024 | High-volume, cost-sensitive pipelines | | gpt-image-1 | ✅ Available | 1024×1024, 1024×1536, 1536×1024 | Legacy workflows | | dall-e-2 | ⚠️ Variations only | 256×256, 512×512, 1024×1024 | Image variations endpoint only | | ~~dall-e-3~~ | ⛔ Retired 2026-03-04 | — | No longer available |

Azure OpenAI (PROVIDER=azure)

| Model | Status | Notes | |-------|--------|-------| | gpt-image-2 | ✅ Public Preview | No access application needed. Arbitrary resolution up to 4K. | | gpt-image-1.5 | ⚠️ Limited Access | Apply at aka.ms/oai/gptimage1.5access | | gpt-image-1-mini | ⚠️ Limited Access | Apply at aka.ms/oai/gptimage1access | | gpt-image-1 | ⚠️ Limited Access | Apply at aka.ms/oai/gptimage1access | | ~~dall-e-3~~ | ⛔ Retired 2026-03-04 | Existing deployments are non-functional |

💡 Azure users: gpt-image-2 is the easiest to start with — Public Preview requires no prior approval.
A 403 for gpt-image-1.x means you need to register at the links above.


gpt-image-2 Highlights

gpt-image-2 (released April 21, 2026) is the flagship model and the recommended default:

| Feature | Detail | |---------|--------| | Resolution | Arbitrary — both edges must be multiples of 16 px, max edge 3840 px (4K), max ratio 3:1, pixel range 655 360 – 8 294 400 | | Quality | low · medium · high · auto | | Output formats | png (with transparency), webp (with transparency + compression), jpeg (compression) | | Background | transparent · opaque · auto | | Images per request | 1 – 10 (n parameter) | | Prompt length | Up to 32 000 characters | | Image editing inputs | Up to 16 images per edit request | | Streaming | ✅ stream: true, partial_images: 0–3 | | Variations | ❌ Not supported (use dall-e-2 for variations) | | Text rendering | ✅ Excellent — infographics, banners, labels | | Photorealism | ✅ Best-in-class — use keyword photorealistic in prompt |

Popular resolution examples for gpt-image-2:

| Use case | Resolution | |----------|-----------| | Square (general) | 1024×1024 | | Portrait | 1024×1536 | | Landscape | 1536×1024 | | Widescreen / slides | 1536×864 | | 2K / QHD (reliability limit) | 2560×1440 | | 4K / UHD (experimental) | 3840×2160 |


Tools Reference

image_generate

{
  "prompt": "A serene Japanese garden at dawn, photorealistic",
  "model": "gpt-image-2",
  "n": 1,
  "size": "1536×1024",
  "quality": "high",
  "background": "transparent",
  "output_format": "webp",
  "output_compression": 85,
  "save_to_workspace": false,
  "response_format": "markdown"
}

image_edit

{
  "image": "<base64-encoded-png>",
  "mask": "<base64-encoded-mask>",
  "prompt": "Add a red hat to the person",
  "model": "gpt-image-2"
}

image_variation

{
  "image": "<base64-encoded-square-png>",
  "n": 3,
  "size": "1024×1024"
}

⚠️ Variations use dall-e-2 exclusively. Not supported by Azure OpenAI or any gpt-image-* model.

provider_validate

{ "provider": "openai" }

Development

bun test                  # 305 tests
bun test --coverage       # with coverage report (≥ 92 %)
bun run lint              # ESLint
bun run type-check        # tsc --noEmit
bun run build             # compile to dist/

Project Structure

src/
├── config/          # Joi-validated env config + models.ts (LATEST_MODEL)
├── mcp/
│   ├── tools/       # 5 MCP tools + Zod schemas
│   ├── features/    # Elicitation, Sampling, Roots
│   └── transport/   # HTTP (stateless) + stdio
├── providers/       # OpenAI + Azure + Together + Custom adapters
├── security/        # Auth guard, rate limiter, sanitiser
└── health/          # /health/* + /metrics
test/
├── unit/            # 240+ unit tests (mocked deps)
└── integration/     # 65+ integration tests (nock + supertest)

Security

  • API keys are never logged — masked with *** in all output
  • Input prompts sanitised (null bytes stripped, bidi control chars removed, length enforced)
  • Path traversal prevented when saving to workspace roots
  • Rate limiting: configurable per-client sliding window
  • Optional bearer-token auth on the /mcp endpoint
  • Container runs as non-root user (mcpuser)
  • Trivy vulnerability scan on every CI build
  • bun audit dependency audit on every CI build

See docs/SECURITY.md for full threat model.


Architecture

MCP Client (Claude / Goose / Cursor)
      │  JSON-RPC 2.0
      ▼
  POST /mcp  (Streamable HTTP)
  or stdio
      │
  ┌───┴────────────────────────────────┐
  │  NestJS Application                │
  │  ┌──────────┐  ┌────────────────┐  │
  │  │ 5 Tools  │  │ MCP Features   │  │
  │  │ schemas  │  │ Elicitation    │  │
  │  │ Zod val. │  │ Sampling       │  │
  │  └────┬─────┘  │ Roots          │  │
  │       │        └────────────────┘  │
  │  ┌────▼──────────────────────────┐ │
  │  │  IImageProvider               │ │
  │  │  OpenAI · Azure · Together    │ │
  │  │  Custom OpenAI-compatible     │ │
  │  └────┬──────────────────────────┘ │
  └───────┼────────────────────────────┘
          │  HTTPS
          ▼
  OpenAI API / Azure OpenAI

See docs/ARCHITECTURE.md for C4 diagrams and ADRs.


Contributing

  1. Fork → branch → implement with TDD (Red → Green → Refactor)
  2. All PRs must have a failing-test commit before the implementation commit
  3. Coverage must not drop below 90 %
  4. Run bun run lint && bun run type-check && bun test before pushing

See docs/TDD_STRATEGY.md for the full TDD workflow.


License

CeCILL-2.1 — a French open-source licence compatible with GNU GPL, endorsed by CEA, CNRS, and Inria.
© 2026 PhD Jonathan MERCIER