hubot-ollama

v1.5.4

Published

2 days ago

Hubot script for integrating with Ollama local LLMs

0High
0Medium
0Low

stephenyeargin

hubot hubot-scripts ollama llm ai

hubot-ollama

Hubot script for integrating with Ollama - run local or cloud LLMs in your chat.

screenshot

Quick Start

Install Ollama and pull a model:

# Install Ollama from https://ollama.com
ollama pull llama3.2
# Ollama server starts automatically on macOS/Windows.
# On Linux, start manually (or enable the systemd service):
#   ollama serve
#   sudo systemctl enable --now ollama

Add this package to your Hubot:
```
npm install hubot-ollama --save
```
Then add to external-scripts.json:
```
["hubot-ollama"]
```

Start chatting:

hubot ask what is an LLM?
hubot ollama explain async/await
hubot llm write a haiku about databases

Commands

| Pattern | Example | Notes | |---------|---------|-------| | hubot ask <prompt> | hubot ask what is caching? | Primary documented command | | hubot ollama <prompt> | hubot ollama summarize HTTP | Alias | | hubot llm <prompt> | hubot llm list json benefits | Alias |

Prompts are sanitized and truncated if they exceed the configured limit.

Configuration

| Variable | Required | Default | Purpose | |----------|----------|---------|---------| | HUBOT_OLLAMA_MODEL | Optional | llama3.2 | Model name (validated: [A-Za-z0-9._:-]+) | | HUBOT_OLLAMA_HOST | Optional | http://127.0.0.1:11434 | Ollama server URL | | HUBOT_OLLAMA_API_KEY | Optional | (unset) | API key for Ollama cloud access | | HUBOT_OLLAMA_SYSTEM_PROMPT | Optional | Built‑in concise chat prompt | Override system instructions | | HUBOT_OLLAMA_MAX_PROMPT_CHARS | Optional | 2000 | Truncate overly long user prompts | | HUBOT_OLLAMA_TIMEOUT_MS | Optional | 60000 (60 sec) | Abort request after this duration | | HUBOT_OLLAMA_CONTEXT_TTL_MS | Optional | 600000 (10 min) | Time to maintain conversation history; 0 to disable | | HUBOT_OLLAMA_CONTEXT_TURNS | Optional | 5 | Maximum number of conversation turns to remember | | HUBOT_OLLAMA_CONTEXT_SCOPE | Optional | room-user | Context isolation: room-user, room, or thread | | HUBOT_OLLAMA_WEB_ENABLED | Optional | false | Enable web-assisted workflow that can search/fetch context | | HUBOT_OLLAMA_WEB_MAX_RESULTS | Optional | 5 | Max search results to use (capped at 10) | | HUBOT_OLLAMA_WEB_FETCH_CONCURRENCY | Optional | 3 | Parallel fetch concurrency | | HUBOT_OLLAMA_WEB_MAX_BYTES | Optional | 120000 | Max bytes per fetched page used in context | | HUBOT_OLLAMA_WEB_TIMEOUT_MS | Optional | 45000 | Timeout for the web phase per fetch |

Change model:

export HUBOT_OLLAMA_MODEL=mistral

Connect to remote Ollama server:

export HUBOT_OLLAMA_HOST=http://my-ollama-server:11434

Use Ollama cloud (requires API key):

export HUBOT_OLLAMA_HOST=https://ollama.com
export HUBOT_OLLAMA_API_KEY=your_api_key
export HUBOT_OLLAMA_MODEL=gpt-oss:120b  # Use a cloud model

# Note: The host should match what `ollama signin` configures.
# Some environments may use a region-specific or API-prefixed host.
# Check `ollama signin --verbose` if unsure.

Custom system prompt:

export HUBOT_OLLAMA_SYSTEM_PROMPT="You are terse; answer in <=200 chars."

Adjust conversation memory:

# Keep 10 turns for 30 minutes, shared across the room
export HUBOT_OLLAMA_CONTEXT_TURNS=10
export HUBOT_OLLAMA_CONTEXT_TTL_MS=1800000
export HUBOT_OLLAMA_CONTEXT_SCOPE=room

Examples

hubot ask explain vector embeddings
hubot llm generate a short motivational quote
hubot ollama compare sql vs nosql

Web-Enabled Workflow

When HUBOT_OLLAMA_WEB_ENABLED=true and the connected Ollama host supports web tools, the bot registers hubot_ollama_web_search and the LLM can invoke it directly. The flow now is:

Phase 1: The model chooses whether to call hubot_ollama_web_search.
Phase 2: The tool performs webSearch, fetches top results in parallel, builds a compact context block, and returns it.
Phase 3: The model incorporates the returned context into its final reply.
The bot sends a status message when the search is running and skips duplicate web searches in the same interaction.

Enable:

export HUBOT_OLLAMA_WEB_ENABLED=true
export HUBOT_OLLAMA_WEB_MAX_RESULTS=5
export HUBOT_OLLAMA_WEB_FETCH_CONCURRENCY=3
export HUBOT_OLLAMA_WEB_MAX_BYTES=120000
export HUBOT_OLLAMA_WEB_TIMEOUT_MS=45000

Tool Integration

The bot uses a two-call LLM workflow to enable tools when supported by the model:

Phase 1: Model decides if a tool is needed to answer the question
Phase 2: Tool is executed (if selected) and results are captured
Phase 3: Model incorporates tool results into a natural conversational response

Built-in Tools:

hubot_ollama_get_current_time - Returns the current UTC timestamp (always available)

Configuration: | Variable | Required | Default | Purpose | |----------|----------|---------|---------| | HUBOT_OLLAMA_TOOLS_ENABLED | Optional | true | Enable tool support (true/1 or false/0) |

Enable tool support (default):

export HUBOT_OLLAMA_TOOLS_ENABLED=true

Disable tool support (useful for models without tool capability):

export HUBOT_OLLAMA_TOOLS_ENABLED=false

How It Works:

The bot automatically detects whether your selected model supports tools via ollama show.
If tools are enabled AND the model supports them, the two-call workflow activates.
If the model doesn't support tools or tools are disabled, the bot falls back to a single-call workflow.
When a tool is invoked, the model can request data (like current time) to enhance its response.

Example Tool Interaction:

user> hubot ask what time is it in UTC?
hubot> (Phase 1: Model decides current time is needed)
       (Phase 2: Tool returns: 2025-12-05T14:30:45.123Z)
       (Phase 3: Model incorporates and responds)
       The current UTC time is 2:30:45 PM on December 5, 2025.

Registering Custom Tools: Use the tool registry to add your own tools:

const registry = require('hubot-ollama/src/tool-registry');

registry.registerTool('my_tool', {
  name: 'my_tool',
  description: 'A brief description of what this tool does',
  parameters: {
    type: 'object',
    properties: {
      param1: { type: 'string', description: 'The first parameter' }
    }
  },
  handler: async (args, robot, msg) => {
    // args: parsed arguments from the LLM
    // robot: Hubot robot instance
    // msg: Current message object, use msg.send() to output results while tool in use
    return { result: 'Tool output here' };
  }
});

Conversation Context

Hubot remembers recent exchanges within the configured scope, allowing natural follow-up questions:

alice> hubot ask what are the planets in our solar system?
hubot> Mercury, Venus, Earth, Mars, Jupiter, Saturn, Uranus, Neptune.

alice> hubot ask which is the largest?
hubot> Jupiter is the largest planet in our solar system.

Context Scopes:

room-user (default): Each user has separate conversation history per room
room: All users in a room share the same conversation history
thread: Separate history per thread (for Slack-style threading)

Context automatically expires after the configured TTL (default 10 minutes). Set HUBOT_OLLAMA_CONTEXT_TTL_MS=0 to disable conversation memory entirely.

Ollama Cloud

This package supports Ollama's cloud service, which allows you to run larger models that wouldn't fit on your local machine. Cloud models are accessed via the same API but run on Ollama's infrastructure.

Setup

Create an account at ollama.com
Generate an API key
Run ollama signin to register the host with Ollama.com

Configure your environment:

export HUBOT_OLLAMA_HOST=https://ollama.com
export HUBOT_OLLAMA_API_KEY=your_api_key
export HUBOT_OLLAMA_MODEL=gpt-oss:120b-cloud

Available Models

Hubot-Ollama works with any model supported by Ollama, whether running locally or in the cloud. You can switch models on the fly using HUBOT_OLLAMA_MODEL, making it easy to choose between speed, size, and capability.

Cloud model availability changes over time. Check the model catalog at Ollama.com for the latest list.

Both local and cloud models share the same API, making the integration seamless regardless of where your model runs.

Note: Cloud models require network connectivity and count against your cloud usage. Local models remain free and private.

Error Handling

| Situation | User Message | |-----------|------------| | Ollama server unreachable | Cannot connect to Ollama server message | | Model missing | Suggest ollama pull <model> | | Empty response | Specific empty response notice | | Timeout | Indicates the configured timeout elapsed | | API error | Surfaces error message |

Security & Safety

Uses official Ollama JavaScript library with proper API communication.
Model name validation & prompt sanitization (strip control chars).
When HUBOT_OLLAMA_WEB_ENABLED=true, web search results are fetched from external sites. Only the fetched pages — not your private prompts — are sent over the network.

Troubleshooting

| Symptom | Check | |---------|-------| | No response | Check Hubot logs for errors; verify Ollama server is accessible | | Connection refused | Ensure Ollama server is running (ollama serve or daemon) | | Model not found | Run ollama list to see available models, then ollama pull <model> | | Wrong server | Set HUBOT_OLLAMA_HOST=http://your-server:11434 | | Long delays | Try increasing HUBOT_OLLAMA_TIMEOUT_MS or use a faster model | | Web tools not running | The connected Ollama host must support webSearch/webFetch; feature auto-skips when unavailable | | No search performed | The model decided a web search was unnecessary; disable web workflow or ask explicitly | | Error: unauthorized | If using a cloud model, you must run ollama signin to register the host | | Other cloud auth issues | Verify your HUBOT_OLLAMA_API_KEY is valid at ollama.com/settings/keys |

Development

Run tests & lint:

npm test
npm run lint

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

hubot-ollama

Quick Start

Commands

Configuration

Examples

Web-Enabled Workflow

Tool Integration

Conversation Context

Ollama Cloud

Setup

Available Models

Error Handling

Security & Safety

Troubleshooting

Development

License