npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

mimo-mcp-server

v0.1.2

Published

MCP Server for MiMo multimodal capabilities (image, audio, video understanding, ASR, TTS)

Readme

MiMo MCP Server

  • 一个为 MiMo 多模态模型能力提供统一的 Model Context Protocol (MCP) 接口的服务器,支持图像理解、音频理解、视频理解、语音识别和语音合等功能成等功能。
  • 当你使用的模型不支持多模态时,可以使用该方式获得多模态能力。 例如 mimo-v2.5-pro 没有多模态能力,但是可以通过该 MCP Server 调用拥有多模态能力的 mimo-v2.5 模型。

功能特性

| 功能模块 | 特性说明 | 支持格式/场景 | 对话图例 | | ------------- | --------------------------------------------------------------------------------------- | ----------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------- | | 🖼️ 图像理解 | 支持图像 URL、Base64 编码、本地文件路径等多种输入方式。支持单张或多张图像分析。支持图像描述、OCR 文字提取、场景识别等。 | JPEG、PNG、GIF、WebP、BMP | 图像理解示例 | | 🎵 音频理解 | 支持音频 URL、Base64 编码、本地文件路径等多种输入方式。支持音频内容分析、情感识别、语言检测等。 | MP3、WAV、FLAC、M4A、OGG | 音频理解示例 | | 🎬 视频理解 | 支持视频 URL、Base64 编码、本地文件路径等多种输入方式。支持视频内容分析、场景描述、动作识别等。 | MP4、MOV、AVI、WMV | 视频理解示例 | | 🎤 语音识别 (ASR) | 支持中文、英文及自动语言检测。支持方言识别(粤语、吴语、闽南语、四川话等)。 | 会议转写、歌词识别、嘈杂环境录音 | 语音识别示例 | | 🔊 语音合成 (TTS) | 预置音色:提供多种预置音色(冰糖、茉莉、苏打、白桦等)。音色设计:通过文本描述自定义音色特征。音色复刻:基于音频样本克隆音色。支持情感控制和风格指令。 | 多种预置音色 + 自定义音色设计 + 音频克隆 | 使用预置音色使用文本设计音色使用文本设计音色(智能润色)音色复刻 |

Tips:

快速开始

  • 请先确保你的环境中已安装 NodeJS,建议版本 >= 22

方式一:直接使用 NPM 上发行的包 mimo-mcp-server

  • MCP Server 配置
{
  "mcpServers": {
    "mimo-mcp-server": {
      "command": "npx",
      "args": [
        "mimo-mcp-server",
        "-y"
      ],
      "env": {
        "MIMO_API_KEY": "your-api-key-here"
      }
    }
  }
}

方式二:从 GitHub 下载项目文件

  • 下载、安装、构建项目
# 下载项目文件
git clone https://github.com/AlionSSS/mimo-mcp-server.git
cd mimo-mcp-server

# 安装依赖
npm install
# 构建
npm run build
  • MCP Server 配置
{
  "mcpServers": {
    "mimo-mcp-server": {
      "command": "node",
      "args": [
        "<PATH_TO>/mimo-mcp-server/dist/index.js"
      ],
      "env": {
        "MIMO_API_KEY": "your-api-key-here"
      }
    }
  }
}

可用的环境变量 env

# 必需:MiMo API Key
MIMO_API_KEY="your-api-key-here"

# 可选:自定义 API 端点(默认为小米官方端点)
MIMO_API_URL="https://token-plan-cn.xiaomimimo.com/v1/chat/completions"

# 可选:自定义模型 ID
MIMO_MODEL_MULTIMODAL="mimo-v2.5"
MIMO_MODEL_ASR="mimo-v2.5-asr"
MIMO_MODEL_TTS="mimo-v2.5-tts"
MIMO_MODEL_TTS_VOICE_DESIGN="mimo-v2.5-tts-voicedesign"
MIMO_MODEL_TTS_VOICE_CLONE="mimo-v2.5-tts-voiceclone"

项目结构

mimo-mcp-server/
├── src/
│   ├── services/
│   │   └── mimo-client.ts          # MiMo API 客户端
│   ├── tools/
│   │   ├── audio-understanding.ts   # 音频理解工具
│   │   ├── image-understanding.ts   # 图像理解工具
│   │   ├── speech-recognition.ts    # 语音识别工具
│   │   ├── speech-synthesis.ts      # 语音合成工具
│   │   └── video-understanding.ts   # 视频理解工具
│   ├── constants.ts                 # 常量定义
│   ├── index.ts                     # 主入口
│   └── types.ts                     # 类型定义
├── resources/                       # 示例资源
├── docs/                            # 文档
├── package.json
└── tsconfig.json

技术栈

  • 运行时: Node.js 22+
  • 语言: TypeScript
  • 构建工具: esbuild
  • MCP SDK: @modelcontextprotocol/sdk
  • HTTP 客户端: axios
  • 数据验证: zod

开发

  • 下载,克隆项目
git clone https://github.com/AlionSSS/mimo-mcp-server.git
cd mimo-mcp-server
  • 安装依赖 $ npm install
  • 开发模式 $ npm run dev
  • 构建 $ npm run build
  • 代码检查 $ npx tsc --noEmit

许可证

MIT License

相关链接