agent-vision
v0.0.1
Published
Analyze images using AI vision (powered by Puter + Kimi K2.5)
Maintainers
Readme
Agent Vision CLI
A CLI tool for image analysis using AI vision, powered by Puter and Kimi K2.5.
Note: Kimi K2.5 is currently FREE on Puter for a limited time!
Installation
npm install -g agent-visionQuick Start
# First, authenticate
agent-vision login
# Analyze an image
agent-vision analyze screenshot.png
agent-vision analyze photo.jpg "What objects are in this image?"Commands
agent-vision login
Opens your browser to authenticate with Puter. Your token is stored securely in your config directory.
agent-vision logout
Removes your stored authentication token.
agent-vision analyze <image> [prompt]
Analyzes an image using AI vision.
Arguments:
image- Path to image file (png, jpg, jpeg, gif, webp)prompt- Optional custom prompt (default: "Describe what you see in this image.")
Options:
-d, --debug- Enable debug output-m, --model <model>- AI model to use (default: kimi-k2.5)
Environment Variables
PUTER_TOKEN- Set token directly for CI/CD or automation (skips stored token)
Examples
# Basic usage
agent-vision analyze screenshot.png
# Custom prompt
agent-vision analyze diagram.png "Explain this technical diagram"
agent-vision analyze photo.jpg "What objects are visible?"
# Debug mode
agent-vision analyze --debug photo.jpg
# CI/CD usage (no stored token needed)
PUTER_TOKEN=xxx agent-vision analyze image.png "Describe this"For Agents & Automation
This tool outputs clean text responses suitable for programmatic consumption.
Error handling for automation:
When no token is configured, the CLI fails with a clear error message:
Error: No authentication token found.
To authenticate, run:
agent-vision login
For CI/CD or automation, set the PUTER_TOKEN environment variable:
PUTER_TOKEN=xxx agent-vision analyze image.png
Get your token from: https://puter.comToken Storage
Tokens are stored in platform-appropriate config directories:
- Linux:
~/.config/agent-vision/token(respects$XDG_CONFIG_HOME) - macOS:
~/Library/Application Support/agent-vision/token - Windows:
%APPDATA%/agent-vision/token
Backend
Powered by:
- Puter (https://puter.com) - Cloud platform providing the API
- Kimi K2.5 - 1T parameter multimodal AI with MoonViT vision encoder
License
MIT
