@devang0907/agent-dev

v0.4.8

Published

37 minutes ago

Minimal terminal coding agent

0High
0Medium
0Low

devang0907

ai agent cli opentui coding-agent

agent-dev

A minimal terminal coding agent with an OpenTUI + SolidJS terminal UI. Chat with an AI that can read and edit code, search the web, run git/shell commands (with approval), use MCP servers, load skills, schedule Telegram reminders and daily tasks, and optionally delegate work through a boss orchestrator that coordinates specialized worker agents.

Quick start

Requires Bun 1.2+.

From source:

git clone https://github.com/Devang0907/agent-dev.git
cd agent-dev
bun install
bun run dev

Global install (npm):

npm install -g @devang0907/agent-dev
agent

Set at least one API key:

export OPENROUTER_API_KEY=sk-or-...   # Free models (default)
export OPENAI_API_KEY=sk-...          # ChatGPT
export ANTHROPIC_API_KEY=sk-ant-...   # Claude
export GROQ_API_KEY=gsk_...           # Groq
export GEMINI_API_KEY=...             # Google Gemini

Requires Bun 1.2+ (Bun must be on your PATH for the agent command).

Providers

| Provider | Env var | Example models | |----------|---------|----------------| | OpenAI (ChatGPT) | OPENAI_API_KEY | gpt-4o, gpt-4.1, o4-mini | | Anthropic (Claude) | ANTHROPIC_API_KEY | claude-sonnet-4-6, claude-opus-4-8, claude-haiku-4-5-20251001 | | Groq | GROQ_API_KEY | llama-3.3-70b-versatile, llama-3.1-8b-instant | | Google Gemini | GEMINI_API_KEY or GOOGLE_API_KEY | gemini-2.5-pro, gemini-2.5-flash | | Free (OpenRouter) | OPENROUTER_API_KEY | See free models below |

Model refs use provider/model-id, e.g. anthropic/claude-sonnet-4-6, openai/gpt-4.1, or free/openrouter/free.

Free models (OpenRouter)

Free models use the :free suffix on OpenRouter. Availability changes often — agent-dev migrates retired slugs and retries fallbacks automatically. If a model fails, use /model openrouter/free (auto-picks a working free model).

| Model | Slug | |-------|------| | Free auto (recommended) | openrouter/free | | Llama 3.3 70B (free) | meta-llama/llama-3.3-70b-instruct:free | | Qwen3 Coder (free) | qwen/qwen3-coder:free | | Qwen3 Next 80B (free) | qwen/qwen3-next-80b-a3b-instruct:free | | GPT-OSS 120B (free) | openai/gpt-oss-120b:free | | Gemma 4 26B (free) | google/gemma-4-26b-a4b-it:free |

Retired slugs like deepseek/deepseek-r1:free and qwen/qwen3-235b-a22b:free are remapped automatically on load.

Default provider/model: free/meta-llama/llama-3.3-70b-instruct:free.

CLI

bun run dev                                    # Interactive TUI
bun run dev -- -p "List files in src"          # Print mode (no TUI)
bun run dev -- --boss                          # Start in boss orchestrator mode
bun run dev -- --boss -p "refactor auth module" # Boss mode, print and exit
bun run dev -- -c                              # Continue last session
bun run dev -- --model groq/llama-3.3-70b-versatile "hello"
bun run build && bun start                     # Production build
bun run test                                   # Run test suite
bun run test:watch                             # Watch mode
bun run test:coverage                          # Coverage report

| Flag | Description | |------|-------------| | -p, --print | Print response and exit | | -c, --continue | Resume the most recent session | | --boss | Enable boss orchestrator mode | | --model <ref> | Provider/model (e.g. openai/gpt-4o) | | -h, --help | Show help |

Skills subcommand:

agent skills add vercel-labs/agent-skills
agent skills add vercel-labs/agent-skills -g    # global
agent skills find react
agent skills list

Telegram gateway

Chat with agent-dev from your phone via Telegram. The gateway runs on your PC, uses long-polling (no public URL or port forwarding), and forwards DMs to the agent. Shell/git/exec approvals arrive as Approve / Deny inline buttons. You can also set reminders and daily tasks (e.g. morning news) that fire while the gateway is running.

On first connect, send /start (or any message) to receive a welcome guide with available commands and capabilities.

Setup

Create a bot — open @BotFather in Telegram, run /newbot, save the token.
Configure — use /connect in the TUI (recommended), or add to ~/.agent-dev/settings.json manually:

{
  "telegram": {
    "botToken": "123456789:ABCdef...",
    "allowedUserIds": [987654321],
    "workdir": "D:/projects/MyRepo"
  }
}

Or use environment variables:

export TELEGRAM_BOT_TOKEN=123456789:ABCdef...
export TELEGRAM_ALLOWED_USER_IDS=987654321

TUI setup (/connect): in the interactive agent, run /connect to open the gateway setup screen. Choose Telegram, paste your botToken, enter allowedUserIds as comma-separated numbers (e.g. 123456789), then save. Values are written to telegram.botToken and telegram.allowedUserIds in ~/.agent-dev/settings.json. More gateways can be added here later.

Find your Telegram user ID — start the gateway, DM your bot, send /whoami. If the allowlist is empty, check the gateway console logs for Rejected message from user <id>.
Add your ID to allowedUserIds, then restart the gateway.

Run

bun run dev -- telegram --workdir D:/projects/MyRepo
# or after build:
agent telegram --workdir D:/projects/MyRepo
agent telegram --boss --verbose

Keep the process running while you use the bot. While the gateway runs, the terminal shows a live log of Telegram activity:

[user] (123456789) list files in src
[agent] Here are the files in src/...
[tool] Running: grep pattern
[approval] npm test
[approval] Approved

On Windows, run it in a dedicated terminal, or use Task Scheduler / pm2 to keep it alive on login.

Telegram commands

| Command | Description | |---------|-------------| | /start | Welcome guide — commands, tools, project path (first connect) | | /help | Show the welcome guide again | | /whoami | Show your numeric Telegram user ID | | /new | Start a new agent session | | /status | Model, workdir, session id, busy/idle | | /stop | Abort the current turn | | /build | Switch to Build mode (edit files, run shell) | | /plan | Switch to Plan mode (read-only exploration) | | /boss | Toggle boss orchestrator mode | | /boss on / /boss off | Enable or disable boss mode | | /mode | Show current mode and available options | | /model | List available models | | /model <provider/id> | Switch model (e.g. /model groq/llama-3.3-70b-versatile) | | /schedules | List active reminders and daily tasks for this chat | | /compact | Summarize older messages to free context |

Reminders and scheduled tasks

While the Telegram gateway is running, you can ask the agent to remind you later or run recurring tasks (e.g. morning news). The agent uses the built-in schedule tool; a background scheduler checks every 30 seconds and delivers due items to your chat.

One-shot reminders — simple notifications at a relative or absolute time:

Remind me to drink water in 5 minutes
Remind me to call mom in 2 hours

Daily recurring tasks — the agent runs your instruction each day (uses web_search for news, etc.):

Send me the top news headlines every morning at 8:00
Give me a daily weather summary at 7:30

| Schedule kind | When it fires | |---------------|---------------| | reminder | Sends ⏰ Reminder: … in Telegram | | task | Runs the agent with your message (good for news, reports) |

Timing options the agent can set: in_minutes (e.g. 5), daily_at in 24h local time (e.g. 08:00), or at (ISO datetime). Schedules are stored in ~/.agent-dev/schedules.json. Ask the agent to cancel a schedule by id, or use /schedules to list them.

Requirements: the gateway process must stay running (see Run above). If a task fires while the agent is busy, it retries in about a minute. Reminders always send immediately.

While the agent is busy, Telegram accepts one queued follow-up message per chat; additional messages get a busy reply until the queue drains.

Security

Only users in allowedUserIds can chat with the agent (except /whoami for setup).
The bot can run shell commands on your PC after you approve them — treat the bot token like a password.
Do not commit settings.json with tokens to git.

Interactive commands

| Command | Description | |---------|-------------| | /model, /m [search] | Open model selector (grouped by provider) | | /build | Switch to Build mode (full tool access) | | /plan | Switch to Plan mode (read-only exploration) | | /boss | Toggle boss orchestrator mode | | /tasks | Show the active task plan | | /compact [instructions] | Summarize older messages to free context (optional focus) | | /rules | List loaded project rule files (AGENTS.md, etc.) | | /permissions | Show merged permission presets for this project | | /trace | Show path to the latest worker trace log | | /sessions | Browse and load saved chat sessions | | /settings | Thinking level and API key status | | /connect | Configure gateway connection (Telegram bot token and allowed user IDs) | | /skills | Browse and install skills (Vercel CLI) | | /skill <name> | Load a skill for the current turn | | /new | Start a new session | | /quit | Exit |

Keyboard shortcuts:

Tab / Shift+Tab — cycle Build ↔ Plan when input is empty
Esc — abort a running turn
Ctrl+G — scroll chat to latest
Ctrl+U / Ctrl+D — scroll chat up/down

Context compaction

Long sessions can exceed model context limits. agent-dev tracks approximate token usage (footer shows ctx 42k/128k) and compacts older history when needed.

| Trigger | Behavior | |---------|----------| | Auto | Before each turn when tokens > contextWindow - reserveTokens (default reserve 16k) | | Overflow | On provider context errors — compact once and retry the turn | | Manual | /compact or /compact focus on auth changes |

Compaction is lossy for the model but non-destructive on disk: full chat history stays in sessions/*.jsonl; a compaction entry records the summary and which messages the LLM still sees.

Toggle auto-compaction in /settings or settings.json:

{
  "compaction": {
    "enabled": true,
    "reserveTokens": 16384,
    "keepRecentTokens": 20000
  }
}

Telegram: /compact (same as TUI).

Project rules

Project rules are markdown instructions injected into the system prompt automatically from AGENTS.md and related files.

Discovery order (root → specific; all matching files are concatenated):

| Priority | Path | |----------|------| | 1 | ~/.agent-dev/AGENTS.md | | 2 | AGENTS.md or CLAUDE.md from git root down to cwd | | 3 | <workdir>/.agent-dev/AGENTS.md | | 4 | <workdir>/.agent-dev/rules/*.md (sorted by filename) |

Use /rules in the TUI to see which files were loaded. Total injected size is capped (default 32k characters).

Disable with AGENT_NO_PROJECT_RULES=1 or in settings.json:

{
  "projectRules": {
    "enabled": false,
    "maxChars": 32768
  }
}

Boss worker sessions do not inherit project rules in v1 (main agent loop only).

Permission presets

Gated tools (shell, verify, git writes, DB mutations, MCP call_tool, destructive browser actions) normally prompt for approval. Optional files rules gate write/edit when configured. Permission presets let you allow, ask, or deny by pattern.

Config (project patterns append to global; last matching rule wins):

| Scope | Path | |-------|------| | Global | ~/.agent-dev/settings.json → permissions | | Project | <workdir>/.agent-dev/permissions.json |

Example permissions.json:

{
  "bash": {
    "*": "ask",
    "npm test": "allow",
    "npm test *": "allow",
    "rm *": "deny"
  },
  "git": {
    "commit": "ask",
    "push *": "deny"
  },
  "database": { "*": "ask", "SELECT *": "allow" },
  "mcp": { "call_tool": "ask" },
  "browser": { "*": "ask" },
  "files": { "*": "ask", ".agent-dev/plans/*": "allow" }
}

verify uses the same bash permission rules (e.g. npm test: allow applies to both bash and verify).

The files category is optional — without files rules, write and edit are allowed without prompting. Add files rules for paranoid setups.

Shorthand: "bash": "ask" expands to { "*": "ask" }. Put "*": "ask" first, then more specific patterns after.

Use /permissions to inspect merged rules. /settings shows a rule count summary.

Read-only git commands and SELECT queries stay allowed regardless of rules.

Agent modes

Build and Plan

Switch between Build and Plan mode from the prompt footer, slash commands, or Tab:

| Mode | Toggle | Behavior | |------|--------|----------| | Build (default) | Tab / /build | Full tool access — edit files, run shell, verify | | Plan | Tab / /plan | Read-only — explore code, research, write plans to .agent-dev/plans/*.md |

Current mode is shown in the prompt footer (Build or Plan)
Plan mode blocks write/edit/bash/verify/database/MCP; git write actions are denied
Switch Plan → Build before implementing; the agent gets a reminder to execute the plan

Boss orchestrator (opt-in)

Boss mode adds a hierarchical orchestrator on top of the normal agent loop. The boss plans work, delegates to specialized workers sequentially, monitors their progress, and synthesizes a final answer.

Enable with --boss or /boss. The footer shows BOSS in purple when active.

User → Boss (plan + delegate)
         ├─ explore worker   (read-only research)
         ├─ implement worker (code changes)
         ├─ shell worker     (commands & tests)
         └─ plan worker      (architecture docs)

| Worker | Role | Tools | |--------|------|-------| | explore | Read-only codebase research | read, grep, git, docs, browser | | implement | Focused code changes | read, write, edit, diff, grep, verify | | shell | Commands and tests | bash, exec, verify | | plan | Planning documents | plan, read, grep |

The boss only has plan and delegate tools — it does not edit files or run shell commands directly. Worker activity is streamed in the TUI and logged to ~/.agent-dev/traces/<sessionId>/<runId>.jsonl. Use /trace to find the latest log.

Boss mode uses the same model you select in /model. Workers run in isolated contexts with scoped task briefs.

Tools

The agent has 20 built-in tools (delegate is boss-only; 19 are available in normal mode):

| Tool | Description | |------|-------------| | read | Read a file in the project directory | | list_dir | List files and directories (use instead of shell ls) | | write | Create or overwrite a file | | edit | Replace text in a file | | diff | Preview unified diff before applying changes | | grep | Search codebase (ripgrep; on Windows uses findstr, then PowerShell) | | git | Git status, diff, log, commit, etc. (writes need approval) | | bash | Run a shell command — requires approval | | exec | Structured shell command (cmd array) — requires approval | | web_search | Search the internet (DuckDuckGo / Google News) | | docs | Look up npm READMEs, MDN, or fetch a docs URL | | memory | Store/recall long-term facts in ~/.agent-dev/memory.json | | plan | Create and track multi-step task plans (supports assignee, parent task, run id) | | delegate | Boss only — spawn a worker agent for a focused subtask | | database | Run SQL on SQLite files (mutations need approval) | | verify | Auto-run tests/build from package.json scripts | | mcp | Call tools from MCP servers (see below) | | skill | Load a skill by name from available_skills | | schedule | Schedule Telegram reminders and daily recurring tasks (gateway must be running) | | browser | Control a real Chromium browser (navigate, click, extract, screenshot) |

File operations are restricted to the current working directory. Shell commands, git writes, SQL mutations, MCP tool calls, and destructive browser actions prompt for approval (y / n). In boss mode, approval prompts show which worker requested the action.

Browser automation

Requires a one-time browser install:

bunx playwright install chromium

The browser runs visible by default so you can watch the agent. Session state (tabs, cookies) persists across tool calls within a chat session.

| Action | Description | |--------|-------------| | open | Launch browser and open URL in a new tab | | goto | Navigate active tab to URL | | click / type / select / check | Interact with page elements (CSS selectors) | | waitFor | Wait for selector to appear | | getPageContent | Inspect page — URL, text, interactive elements | | extract | Read text/value from a selector | | screenshot | Save PNG to ~/.agent-dev/screenshots/ | | listTabs / switchTab / newTab / closeTab | Multi-tab control | | waitForUser | Pause for CAPTCHA, OTP, or manual payment | | close | Close browser |

Optional settings in ~/.agent-dev/settings.json:

{
  "browser": {
    "headless": false,
    "actionTimeoutMs": 30000
  }
}

Safety: Purchases, booking confirmations, and payment steps require approval or manual user action. The agent pauses automatically on CAPTCHA/OTP/payment fields. Load the browser-automation skill for multi-step workflows.

Plan mode: Read-only browser actions only (goto, extract, getPageContent, screenshot). Form interactions require Build mode.

MCP configuration

Add servers to ~/.agent-dev/mcp.json:

{
  "servers": {
    "filesystem": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "D:/projects"]
    }
  }
}

Use the mcp tool with list_servers, list_tools, and call_tool actions.

Skills

Skills use the Vercel Agent Skills format (compatible with the Vercel skills CLI and Cursor).

Install skills:

agent skills add vercel-labs/agent-skills
agent skills add vercel-labs/agent-skills -g          # global
agent skills find react
agent skills list

In the TUI, run /skills and press a to install from a repo (press o to open the catalog in your browser).

Browse what's available:

skills.sh — searchable directory of community + Vercel skills
Vercel Agent Skills docs — official curated list
agent skills find <query> — search from the terminal

Discovery paths (later entries override same name):

| Scope | Path | |-------|------| | Global (Vercel CLI) | ~/.config/agents/skills/ | | Global (compat) | ~/.agents/skills/ | | Project | .agents/skills/ (walk up to git root) | | Agent config | ~/.agent-dev/skills/ | | Custom | skills.paths in settings.json |

The agent sees an <available_skills> catalog in its system prompt and loads full instructions with the skill tool (or /skill <name> in chat).

Filter skills in ~/.agent-dev/settings.json:

{
  "skills": {
    "enabled": ["vercel-react-best-practices"],
    "disabled": ["canvas"],
    "paths": ["~/team-skills"]
  }
}

Configuration

All config lives under ~/.agent-dev/ (override with AGENT_DEV_DIR):

| Path | Purpose | |------|---------| | settings.json | Default provider/model, thinking level, agent mode, orchestrator mode, API keys, skills, permissions, project rules | | sessions/*.jsonl | Chat history (one file per session) | | last-session.json | Pointer to resume with -c | | memory.json | Cross-session memory | | plan.json | Active task plan | | mcp.json | MCP server definitions | | traces/<sessionId>/ | Boss worker trace logs (JSONL) | | telegram-sessions.json | Telegram chat → session id mapping | | schedules.json | Active reminders and daily scheduled tasks |

Example settings.json:

{
  "defaultProvider": "free",
  "defaultModel": "meta-llama/llama-3.3-70b-instruct:free",
  "thinkingLevel": "off",
  "agentMode": "build",
  "orchestratorMode": "off",
  "compaction": {
    "enabled": true,
    "reserveTokens": 16384,
    "keepRecentTokens": 20000
  },
  "permissions": {
    "bash": { "*": "ask", "npm test *": "allow" }
  },
  "projectRules": {
    "enabled": true
  }
}

Set orchestratorMode to "boss" to enable boss mode by default.

Thinking level (thinkingLevel in settings or /settings) enables extended reasoning on supported models: Claude Sonnet/Opus 4+, OpenAI o3/o4-mini, and Gemini 2.5+. Other providers ignore it.

Environment variables

| Variable | Description | |----------|-------------| | OPENROUTER_API_KEY | OpenRouter (free tier models) | | OPENAI_API_KEY | OpenAI | | ANTHROPIC_API_KEY | Anthropic (Claude) | | GROQ_API_KEY | Groq | | GEMINI_API_KEY / GOOGLE_API_KEY | Google Gemini | | AGENT_DEV_DIR | Config directory (default ~/.agent-dev) | | AGENT_MAX_TOOL_ROUNDS | Max tool-call rounds per turn (default 50) | | AGENT_MAX_DELEGATIONS | Max worker delegations per boss turn (default 10) | | AGENT_COMPACTION_ENABLED | 0/false to disable auto-compaction | | AGENT_COMPACTION_RESERVE_TOKENS | Tokens reserved for model response (default 16384) | | AGENT_NO_PROJECT_RULES | 1 to disable project rules injection | | TELEGRAM_BOT_TOKEN | Telegram bot token (overrides settings) | | TELEGRAM_ALLOWED_USER_IDS | Comma-separated Telegram user IDs |

Architecture

agent-dev is a single-agent ReAct loop — no LangGraph or external agent framework. The core loop streams an LLM response, executes tool calls, and repeats until done.

Boss mode nests additional runAgentLoop instances inside the delegate tool, following the orchestrator-workers pattern: the boss dynamically decomposes tasks and delegates to workers with isolated context.

src/
├── agent/
│   ├── loop.ts              # Core ReAct loop
│   ├── session.ts           # Session state, events, boss routing
│   ├── orchestrator/        # Boss prompt, workers, traces
│   └── tools/               # Built-in tool implementations
├── gateway/
│   ├── telegram/            # Telegram bot daemon (grammY)
│   └── scheduler.ts         # Fires due reminders and daily tasks
├── providers/               # OpenAI, Groq, Gemini, OpenRouter
├── tui/                     # OpenTUI + SolidJS terminal UI
└── modes/print-mode.ts      # Headless / CI output

License

MIT