@llmist/cli

v15.12.0

Published

6 days ago

CLI for llmist - run LLM agents from the command line

Downloads

6,253

0High
0Medium
0Low

zbigniew-sobiecki

llmist cli llm agents terminal tui

@llmist/cli

Command-line interface for llmist - run LLM agents from the terminal.

Installation

npm install -g @llmist/cli
# or run directly with npx:
npx @llmist/cli

Quick Start

# Set your API key
export OPENAI_API_KEY="sk-..."

# Quick completion
llmist complete "Explain TypeScript generics in one paragraph"

# Run an agent with gadgets
llmist agent "Search for files" --gadgets ./my-gadgets/

# Interactive chat
llmist chat

Commands

| Command | Description | |---------|-------------| | complete <prompt> | One-shot LLM completion | | agent <prompt> | Run agent with gadgets | | chat | Interactive chat session | | tui | Launch terminal UI |

Using Gadgets

Load gadgets from various sources:

# Local directory
llmist agent "Do something" --gadgets ./gadgets/

# npm package
llmist agent "Search the web" --gadgets dhalsim/BrowseWeb

# Git URL
llmist agent "Process files" --gadgets github:user/repo

Configuration

Create a llmist.toml file for reusable configurations:

[agent]
model = "sonnet"
system = "You are a helpful assistant"

[gadgets]
paths = ["./gadgets"]
external = ["dhalsim/BrowseWeb"]

[display]
markdown = true
colors = true

Use with:

llmist agent "Do something" --config ./llmist.toml

Rate Limiting

llmist CLI enables conservative rate limiting by default to prevent hitting provider API limits and avoid agent crashes.

Default Behavior

Rate limits are automatically configured based on your model's provider:

| Provider | RPM | TPM | Daily Tokens | |------------|-----|-----------|--------------| | Anthropic | 50 | 40,000 | - | | OpenAI | 3 | 40,000 | - | | Gemini | 15 | 1,000,000 | 1,500,000 |

These defaults are conservative (protecting free tier users). Paid tier users should configure higher limits.

Configuration

TOML Config (~/.llmist/cli.toml or project llmist.toml):

# Global rate limits (applies to all commands)
[rate-limits]
enabled = true
requests-per-minute = 100
tokens-per-minute = 200_000
safety-margin = 0.8  # Throttle at 80% of limit

# Profile-specific overrides
[profile-gemini]
model = "gemini:flash"

[profile-gemini.rate-limits]
requests-per-minute = 15
tokens-per-day = 1_500_000

# Disable rate limiting for a profile
[profile-fast]
model = "gpt4o"

[profile-fast.rate-limits]
enabled = false

CLI Flags (override all config):

# Override limits
llmist agent --rate-limit-rpm 100 --rate-limit-tpm 200000 "your prompt"

# Disable rate limiting
llmist agent --no-rate-limit "your prompt"

# Configure retry behavior
llmist agent --max-retries 5 --retry-min-timeout 2000 "your prompt"

# Disable retry
llmist agent --no-retry "your prompt"

TUI Feedback

The Terminal UI provides real-time feedback when rate limiting is active:

Status Bar: Shows ⏸ Throttled Xs when waiting for rate limits
Status Bar: Shows 🔄 Retry 2/3 during retry attempts

Conversation Log: Persistent entries like:

⏸ Rate limit approaching (45 RPM, 85K TPM), waiting 5s...
🔄 Request failed (attempt 1/3), retrying...

Finding Your Tier Limits

To configure optimal limits for your API tier:

Anthropic: Rate Limits Documentation
OpenAI: Rate Limits Guide
Gemini: Quota Documentation

Check your provider dashboard for current tier limits, then update your llmist.toml accordingly.

Terminal UI

The TUI provides an interactive interface to browse execution history, inspect raw payloads, and debug agent runs:

llmist tui

Documentation

Full documentation at llmist.dev/cli

Related Packages

llmist - Core library
@llmist/testing - Testing utilities

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@llmist/cli

Installation

Quick Start

Commands

Using Gadgets

Configuration

Rate Limiting

Default Behavior

Configuration

TUI Feedback

Finding Your Tier Limits

Terminal UI

Documentation

Related Packages

License