@tuannvm/vision-mcp-server
v0.0.4
Published
Local-only OCR MCP server using Apple Vision Framework. Fully offline, privacy-focused text extraction from images.
Maintainers
Readme
Vision MCP Server
Local-only OCR MCP server using Apple Vision Framework. Fully offline, privacy-focused text extraction from images directly in Claude Code.
graph LR
A[Claude Code] --> B[Vision MCP Server]
B --> C[Apple Vision Framework]
C --> D[Local Text Extraction]
style A fill:#FF6B35
style B fill:#4A90E2
style C fill:#00D4AA
style D fill:#FFA500Quick Start
1. Install the Server
claude mcp add local-ocr -- npx -y @tuannvm/vision-mcp-server2. Start Using
Extract the text from this image
Extract Chinese text using fast recognitionOne-Click Install
Tools
| Tool | Description |
|------|-------------|
| ocr_extract_text | Extract text from images using Apple Vision Framework |
Examples
The tool automatically detects the input format:
Pasted Images (Base64):
Extract the text from this imageWhen you paste an image in Claude Code, it's automatically converted to base64 format.
Local File Paths:
Read text from /Users/username/Desktop/screenshot.png
Extract text from ~/Downloads/receipt.jpgRemote URLs:
Extract text from https://example.com/screenshot.jpg
OCR the image at https://example.org/photo.pngMulti-language extraction:
Extract Chinese and Japanese text from this screenshotFast recognition mode:
Extract text using fast recognition modeAdvanced options:
Extract text with recognition level "fast" and language correction disabledRequirements
- macOS 13.0+ — Apple Vision Framework is built into macOS
- Node.js 18+ — Required for MCP server runtime
- Apple Silicon or Intel — Both arm64 and x64 are supported
Supported Languages
en-US, zh-Hans, zh-Hant, ja, ko, es, fr, de, it, pt-BR, ru, ar, th, vi, nl, pl, tr
Documentation
- API Reference — Full tool parameters and response formats
- Architecture — Technical design details
Development
npm install # Install dependencies
npm run build # Build Swift binary
npm run build:release # Build optimized release binary
npm test # Run testsRelated Projects
- codex-mcp-server — MCP server for OpenAI Codex CLI with AI-powered code analysis and review
- gemini-mcp-server — MCP server for Gemini CLI with 1M+ token context, web search, and media analysis
License
MIT
