npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@mseep/mcp-vision-relay

v1.0.0

Published

[English](README.md) | [中文说明](README.zh-CN.md)

Readme

MCP Vision Relay

English | 中文说明

One-line summary: MCP Vision Relay wraps local Gemini/Qwen multimodal CLIs into a unified MCP tool so text-only models can instantly gain vision skills.

MCP Vision Relay 是一个面向 Claude、Codex 等 MCP 客户端的视觉“中继站”,把本地安装的多模态 CLI(目前支持 Google Gemini CLI 与 Qwen CLI)统一成标准的 MCP 工具。

为什么需要它? Claude Code 如果对接 k2、DeepSeek、MiniMax M2 等文本模型时,本身不具备多模态能力。接入 MCP Vision Relay 后,可借助 Gemini/Qwen CLI 来补齐看图能力,在不更换底层模型的情况下实现“物美价廉”的体验。

功能亮点

  • 双提供方支持:Gemini 与 Qwen 两套 CLI 都能通过统一工具调用。
  • 三种输入形式:本地路径、HTTP(S) URL、base64 全覆盖,自动校验大小与扩展名。
  • 可配置执行:模型、输出格式、sandbox、超时、附加 flag 均可按需调整。
  • Provider 抽象层:新 CLI 只需实现 provider,就能无缝接入工具层。
  • 结构化结果:工具返回标准文本 + 元信息,方便在 IDE/CLI 中展示。

快速开始

前置条件

  1. Node.js 18 或更高版本
  2. 已在本地安装并配置好的 Gemini CLI、Qwen CLI(确认 gemini -p "hi"qwen -p "hi" 可正常执行)

安装与构建

npm install
npm run build

启动 MCP 服务

# 开发模式:直接执行 TypeScript
npm run dev

# 生产模式:使用 dist 产物
npm run start

启动后会通过 stdio 暴露 MCP 服务,可配合 claude mcp addcodex mcp add 使用。

与 MCP 客户端集成示例

⚠️ 注册 MCP 客户端时请直接调用入口。命令里使用 npm run dev 会输出 npm 横幅到 stdout,导致 MCP 握手失败。如需保留 npm 脚本,请改为 npm --silent run dev --

Claude CLI

claude mcp add mcp-vision-relay -- npx tsx /absolute/path/to/mcp-vision-relay/src/index.ts

Codex CLI

codex mcp add mcp-vision-relay -- node /absolute/path/to/mcp-vision-relay/dist/index.js

注册完成后,在对应会话中即可使用 mcp-vision-relay 的工具。

可用工具

gemini_analyze_image

  • 自动把 prompt 与 @绝对路径 组合传给 Gemini CLI。
  • 支持自定义模型(model)、输出格式(outputFormat)、sandbox、额外参数等。

qwen_analyze_image

  • 对本地文件自动转成 data URL,确保 Qwen CLI 可以读取。
  • 其他参数与 Gemini 版本保持一致。

工具返回 Markdown 文本,并带有 modelimageSourcedurationMs 等元信息,便于后续处理。

配置说明

复制 .env.example 即可自定义:

  • GEMINI_CLI_COMMAND / QWEN_CLI_COMMAND
  • GEMINI_DEFAULT_MODEL / QWEN_DEFAULT_MODEL
  • GEMINI_OUTPUT_FORMAT
  • MCP_COMMAND_TIMEOUT_MS
  • MCP_MAX_IMAGE_BYTES
  • MCP_ALLOWED_IMAGE_EXTENSIONS
  • MCP_IMAGE_TEMP_DIR

如果只想为特定调用覆盖 CLI,可在工具参数里传 cliPath

本地验证脚本

npx tsx scripts/dev/manual-gemini-test.ts
npx tsx scripts/dev/manual-qwen-test.ts

脚本将直接调用 runTool,便于排查 CLI 输出问题。

项目结构

src/
  index.ts            # 程序入口
  config/             # 配置与默认值
  providers/          # Gemini/Qwen CLI 适配层
  server/             # MCP server wiring
  tools/              # 工具注册与 schema 定义
  utils/              # 文件、命令执行等工具函数
scripts/
  dev/                # 手动测试脚本
test-assets/          # 示例图片

开发规划

  • [x] Gemini CLI 图像分析
  • [x] Qwen CLI 图像分析
  • [ ] 更多多模态 provider
  • [ ] 自动化测试、CI、格式化

许可协议

MIT License,欢迎提交 issue / PR。

致谢