@lokiyou/modelscope-vision
v0.1.3
Published
Simple image understanding for Pi Coding Agent. Install it, set your API key, and ask normal questions about images.
Downloads
338
Maintainers
Readme
@lokiyou/modelscope-vision
Simple image understanding for Pi Coding Agent.
Install it, set your ModelScope API key, choose a model, and ask normal questions about images. The extension is designed to feel straightforward for everyday use.
What it does
It adds image understanding to Pi.
After installation, you can ask Pi to:
- describe an image
- answer questions about an image
- read visible text in an image
- inspect either a public image URL or a local image file
Most users do not need to call the tools manually.
Installation
pi install npm:@lokiyou/modelscope-vision
/reloadConfiguration
The extension stores its configuration at:
~/.pi/agent/extensions/modelscope-vision/config.jsonIn most cases, you only need to do two things:
- set your API key
- choose the model you want to use
Set the API key
/modelscope-vision keyEnter your ModelScope access token when prompted.
Set the model
/modelscope-vision model Qwen/Qwen3-VL-32B-InstructOptional: set a custom base URL
/modelscope-vision base-url https://api-inference.modelscope.cn/v1View the current configuration
/modelscope-vision configAfter changing configuration, run /reload if needed.
How to use it
After installation, use normal prompts such as:
- "Describe this image in detail."
- "How many people are in this photo?"
- "What does the text in this image say?"
- "What error message is shown in this screenshot?"
The extension supports either of the following inputs:
image_urlfor a public image URLimage_pathfor an absolute local file path
License
MIT
