npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

@tuannvm/vision-mcp-server

v0.0.4

Published

Local-only OCR MCP server using Apple Vision Framework. Fully offline, privacy-focused text extraction from images.

Readme

Vision MCP Server

npm version license

Local-only OCR MCP server using Apple Vision Framework. Fully offline, privacy-focused text extraction from images directly in Claude Code.

graph LR
    A[Claude Code] --> B[Vision MCP Server]
    B --> C[Apple Vision Framework]
    C --> D[Local Text Extraction]

    style A fill:#FF6B35
    style B fill:#4A90E2
    style C fill:#00D4AA
    style D fill:#FFA500

Quick Start

1. Install the Server

claude mcp add local-ocr -- npx -y @tuannvm/vision-mcp-server

2. Start Using

Extract the text from this image
Extract Chinese text using fast recognition

One-Click Install

VS Code VS Code Insiders Cursor

Tools

| Tool | Description | |------|-------------| | ocr_extract_text | Extract text from images using Apple Vision Framework |

Examples

The tool automatically detects the input format:

Pasted Images (Base64):

Extract the text from this image

When you paste an image in Claude Code, it's automatically converted to base64 format.

Local File Paths:

Read text from /Users/username/Desktop/screenshot.png
Extract text from ~/Downloads/receipt.jpg

Remote URLs:

Extract text from https://example.com/screenshot.jpg
OCR the image at https://example.org/photo.png

Multi-language extraction:

Extract Chinese and Japanese text from this screenshot

Fast recognition mode:

Extract text using fast recognition mode

Advanced options:

Extract text with recognition level "fast" and language correction disabled

Requirements

  • macOS 13.0+ — Apple Vision Framework is built into macOS
  • Node.js 18+ — Required for MCP server runtime
  • Apple Silicon or Intel — Both arm64 and x64 are supported

Supported Languages

en-US, zh-Hans, zh-Hant, ja, ko, es, fr, de, it, pt-BR, ru, ar, th, vi, nl, pl, tr

Full language list

Documentation

Development

npm install         # Install dependencies
npm run build       # Build Swift binary
npm run build:release  # Build optimized release binary
npm test            # Run tests

Related Projects

  • codex-mcp-server — MCP server for OpenAI Codex CLI with AI-powered code analysis and review
  • gemini-mcp-server — MCP server for Gemini CLI with 1M+ token context, web search, and media analysis

License

MIT