@praveencs/agent

v0.9.21

Published

9 hours ago

CLI agent runtime with Skill Hub, Plan Files, and permissioned tools

Downloads

4,014

0High
0Medium
0Low

praveencs

agent cli automation skills mcp llm

🤖 Agent Runtime

An autonomous, goal-oriented AI agent runtime with an interactive CLI, plugin ecosystem, and self-improvement capabilities.

$ agent

  ╭────────────────────────────────────────────────╮
  │  🤖 Agent Runtime v0.9.8                       │
  │    Project: my-app                              │
  │    Model: gpt-4o │ 3 skills │ 2 commands        │
  ╰────────────────────────────────────────────────╯

  Type a goal, a /command, or /help for help.

  > Refactor the auth module to use JWT
  ⠋ Thinking...
  ⚡ fs.read(src/auth/handler.ts) ✓
  ⚡ fs.write(src/auth/jwt.ts) ✓
  ⚡ cmd.run(npm test) ✓

  ✓ Done (12.3s)

  > /deploy-staging
  Running command: deploy-staging...

✨ Features

| Category | Capabilities | |----------|-------------| | 🤖 Interactive CLI | Conversational REPL with multi-turn context, slash commands, and tab completion | | 🧠 Goal Decomposition | LLM-powered breakdown of complex objectives into dependency-aware task graphs | | ⚡ Autonomous Execution | Background daemon processes tasks with retries, rollback, and verification | | 🛠️ Extensible Skills | Markdown-based skill definitions—install from a hub or write your own | | ⚡ Lightweight Commands | Quick goal templates as markdown files—no boilerplate needed | | 📜 Scripts System | Repeatable local tasks defined in script.yaml for direct execution | | 🪝 Lifecycle Hooks | Intercept execution at 10 event points (before:tool, after:plan, etc.) | | 🔌 Plugin System | Bundle skills, commands, scripts, and hooks into distributable packages | | 🔧 Multi-CLI Orchestration | Delegate tasks to Cursor, Codex, Gemini, or Claude CLIs | | 💾 Persistent Memory | SQLite + FTS5 semantic memory across sessions | | ❤️ Self-Improvement | Monitors skill metrics and auto-patches failing skills | | 📊 Reporting | Daily standup reports and AI-generated executive summaries | | 🔒 Policy Engine | Permission-gated tool execution with human-in-the-loop approval |

📦 Installation

npm install -g @praveencs/agent

Quick Start

# Initialize project configuration
agent init

# Launch interactive mode (recommended)
agent

# Or run a one-off goal
agent run "Add input validation to the signup form"

Configuration

After agent init, a .agent/ directory is created in your project with configuration, skills, commands, and hooks. Set your LLM provider API keys:

# Set via environment variables
export OPENAI_API_KEY=sk-...
export ANTHROPIC_API_KEY=sk-ant-...

# Or configure directly
agent config --init

📖 Usage Guide

1. Interactive Mode (Recommended)

Type agent with no arguments to enter the Interactive REPL:

agent

You get a bordered welcome banner showing your project, model, and loaded extensions. Then just type naturally:

  > Add rate limiting to the /api/auth endpoint
  > Now write tests for it
  > /deploy-staging

The agent remembers context across turns—no need to repeat yourself.

Slash Commands

| Command | Action | |---------|--------| | /help | Show all available commands | | /skills | List installed skills with status | | /commands | List available lightweight commands | | /scripts | List available local scripts | | /hooks | Show registered lifecycle hooks | | /model | Display current model and provider info | | /compact | Summarize conversation and free context | | /clear | Clear the terminal screen | | /exit | Exit interactive mode |

Custom commands from .agent/commands/ are also available as slash commands (e.g., /deploy-staging).

Tab completion works on all slash commands—press Tab after /.

2. One-Shot Mode

Run a single goal without entering the REPL:

agent run "Refactor the database module to use connection pooling"
agent run "Fix all TypeScript errors in the project"
agent run deploy-staging          # Runs a named Command or Skill

3. Skills

Skills are reusable capabilities defined by markdown prompts and a skill.json manifest.

# List installed skills
agent skills list

# Search the skill hub
agent skills search "docker"

# Install a skill
agent skills install <skill-name>

# Create a custom skill
agent skills create my-new-skill
# → Creates .agent/skills/my-new-skill/prompt.md

# Self-healing
agent skills stats               # View success rates
agent skills doctor my-skill     # Diagnose failures
agent skills fix my-skill        # Auto-repair with LLM

4. Lightweight Commands

Commands are quick goal templates—just a markdown file. No skill.json needed.

Create .agent/commands/deploy-staging.md:

---
name: deploy-staging
description: Deploy current branch to staging
tools: [cmd.run, git.status, git.diff]
---
# Deploy to Staging

Steps:
1. Run `npm test` to verify all tests pass
2. Run `npm run build` to create the production bundle
3. Run `git push origin HEAD:staging` to trigger deployment

Now use it:

agent run deploy-staging     # From CLI
# or
> /deploy-staging            # From interactive mode

The command's markdown body becomes the LLM prompt, and only the whitelisted tools are available.

agent commands list          # See all available commands

5. Scripts

Scripts are repeatable, scriptable tasks (shell/Node Python) defined via a script.yaml manifest. They differ from Skills and Commands because they execute directly without LLM involvement, making them perfect for CI/CD tasks, builds, or deployments.

Create .agent/scripts/deploy/script.yaml:

name: deploy-staging
description: Build and deploy current branch to staging
entrypoint: deploy.sh
confirm: true
args:
  branch:
    description: Branch to deploy
    default: main

Create exactly the script you need (deploy.sh or deploy.ts):

#!/bin/bash
echo "Deploying branch ${SCRIPT_ARG_BRANCH:-main}..."
npm run build && git push origin HEAD:staging

Now execute it directly from the CLI or REPL:

agent scripts run deploy-staging --branch develop
# or interactively
> /scripts
> agent scripts run deploy-staging

Scripts are automatically provided as context to the LLM, so if you ask the agent to "deploy to staging", it knows it can use your exact shell script to do it.

6. Lifecycle Hooks

Hooks intercept agent execution at every point. Define them in .agent/hooks/hooks.json:

{
  "hooks": {
    "after:tool": [
      {
        "match": "fs.write",
        "command": "npx prettier --write {{path}}",
        "blocking": false
      }
    ],
    "before:plan": [
      {
        "command": "./scripts/validate-env.sh",
        "blocking": true
      }
    ]
  }
}

Available Events

| Event | When | |-------|------| | before:tool / after:tool | Before/after any tool executes | | before:plan / after:plan | Before/after a plan runs | | after:step | After each plan step | | before:skill / after:skill | Around skill execution | | after:decompose | After goal decomposition | | session:start / session:end | At session boundaries |

agent hooks list             # Show registered hooks
agent hooks add after:tool "npx eslint --fix {{path}}" --match fs.write
agent hooks events           # Show all available events

6. Plugins

Bundle skills, commands, and hooks into a distributable package:

my-plugin/
├── plugin.json
├── skills/
│   └── security-scan/
│       ├── skill.json
│       └── prompt.md
├── commands/
│   └── audit.md
└── hooks/
    └── hooks.json

plugin.json:

{
  "name": "enterprise-security",
  "version": "1.0.0",
  "description": "Security scanning and compliance",
  "skills": ["skills/"],
  "commands": ["commands/"],
  "scripts": ["scripts/"],
  "hooks": "hooks/hooks.json"
}

agent plugins install ./my-plugin    # Install from local path
agent plugins list                   # Show installed plugins
agent plugins remove my-plugin       # Uninstall

7. Multi-CLI Orchestration

The agent can delegate tasks to external AI CLIs when they're the right tool for the job:

| Tool | CLI | Best For | |------|-----|----------| | cli.cursor | Cursor | Multi-file refactoring with codebase context | | cli.codex | OpenAI Codex | Code generation with sandbox execution | | cli.gemini | Gemini | Large-context analysis and reasoning | | cli.claude | Claude | Careful code review and generation |

Configure in .agent/config.json:

{
  "cliTools": {
    "cursor": { "binary": "cursor", "available": true },
    "claude": { "binary": "claude", "available": true }
  }
}

The LLM orchestrator automatically selects the right CLI based on the task.

8. Goal Management & Daemon

For long-running, multi-step projects:

# Create a goal
agent goal add "Build authentication with OAuth2" --priority 1

# AI decomposes into tasks
agent goal decompose 1

# Run tasks autonomously
agent daemon start

# Monitor progress
agent goal list               # See goal status
agent goal status 1           # Detailed task view
agent daemon status           # Daemon health
agent daemon logs             # Recent execution logs

# Get reports
agent report generate --summary

9. Plans

Create and run structured execution plans:

agent plan propose "Migrate database from MySQL to PostgreSQL"
agent plan list
agent plan run <plan-file>

10. Memory

The agent stores facts, learnings, and project context persistently:

agent memory search "database credentials"
agent memory add "Staging server is at 10.0.0.5" --category fact

11. Agent Studio

The web-based management console for all agent subsystems:

agent studio
# → Agent Studio running at http://localhost:3333

Agent Studio Dashboard

Studio provides:

Dashboard — View all active agent instances (REPL + daemon)
Console — Real-time terminal with command relay and approval flow
Goals & Plans — Create goals, view tasks, track progress
Skills Manager — CRUD for skill definitions (skill.json + prompt.md)
Commands — Create/delete lightweight command templates
Scripts — Create/delete local automation scripts
Plugins — View and remove installed plugin bundles
Daemon Control — Start/stop background worker, view logs
Memory Explorer — Search, add, and browse persistent memories

Agent Console

12. Auto-Update

The CLI automatically checks for updates on startup:

# Manual update
agent update

On every launch, the agent fetches the latest npm version, compares semver, and installs updates in the background without blocking your session.

🤖 Full CLI Reference

Core

| Command | Description | |---------|-------------| | agent | Launch interactive REPL (no subcommand) | | agent run "<goal>" | One-shot goal execution | | agent init | Initialize project configuration | | agent config --init | Set up global config | | agent doctor | System health check | | agent update | Update CLI to latest version | | agent studio | Launch web-based management console |

Skills

| Command | Description | |---------|-------------| | agent skills list | List installed skills | | agent skills search <query> | Search the skill hub | | agent skills install <name> | Install a skill | | agent skills create <name> | Create a custom skill | | agent skills stats | View performance metrics | | agent skills doctor <name> | Diagnose a failing skill | | agent skills fix <name> | Auto-repair with LLM |

Commands

| Command | Description | |---------|-------------| | agent commands list | List available commands |

Scripts

| Command | Description | |---------|-------------| | agent scripts list | List available scripts | | agent scripts run <name> | Execute a script directly | | agent scripts show <name> | Show script arguments and details |

Hooks

| Command | Description | |---------|-------------| | agent hooks list | Show registered hooks | | agent hooks add <event> <cmd> | Add a new hook | | agent hooks events | Show all hook events |

Plugins

| Command | Description | |---------|-------------| | agent plugins list | List installed plugins | | agent plugins install <path> | Install from local path | | agent plugins remove <name> | Remove a plugin |

Goals & Daemon

| Command | Description | |---------|-------------| | agent goal add "<title>" | Create a goal | | agent goal list | List goals | | agent goal decompose <id> | AI breakdown | | agent goal status <id> | Task-level progress | | agent daemon start | Start background worker | | agent daemon stop | Stop background worker | | agent daemon status | Health & uptime |

Plans, Memory & Reports

| Command | Description | |---------|-------------| | agent plan propose "<desc>" | AI-generate a plan | | agent plan run <file> | Execute a plan | | agent memory search <query> | Search agent memory | | agent memory add "<fact>" | Store a fact | | agent report generate | Activity report |

🏗️ Architecture

┌─────────────────────────────────────────────────────┐
│                    CLI / REPL                        │
│  agent run │ agent (REPL) │ /slash-commands │ MCP   │
├─────────────────────────────────────────────────────┤
│                  LLM Router                          │
│  OpenAI │ Anthropic │ Azure │ Ollama (fallback)     │
├──────────┬──────────┬──────────┬────────────────────┤
│  Skills  │ Commands │  Hooks   │    Plugins          │
│  .md     │  .md     │  .json   │    bundles          │
├──────────┴──────────┴──────────┴────────────────────┤
│              Tool Registry & Policy Engine           │
│  fs.* │ cmd.run │ git.* │ cli.* │ project.detect    │
├─────────────────────────────────────────────────────┤
│  Planner  │ Executor │ Memory  │ Daemon │ Reporter  │
└─────────────────────────────────────────────────────┘

Key Components:

CLI / REPL: Entry point—interactive or subcommand-based
LLM Router: Multi-provider with offline-first support and fallback chains
Skills: Markdown prompt-based capabilities
Commands: Lightweight goal templates (YAML frontmatter + prompt)
Scripts: Direct executable automation with argument injection
Hooks: Event-driven lifecycle interception
Plugins: Distributable bundles of skills + commands + scripts + hooks
Tool Registry: Sandboxed tool execution with permission gates
Policy Engine: Human-in-the-loop approval for sensitive operations
Multi-CLI Tools: Cursor, Codex, Gemini, Claude wrappers

📚 Learning Series

Understand the agent architecture with our 12-part deep-dive:

Vision & Architecture — The high-level design
The Brain (Planner) — Goal decomposition
The Body (Executor) — Secure skill execution
Memory & Context — SQLite & semantic search
Self-Improvement — Metrics & the Auto-Fixer
Plugin Ecosystem — Hooks, commands, multi-CLI
Interactive CLI — The conversational experience
Scripts System — Non-LLM deterministic automation
Agent Studio — Web management console
LLM Providers — Provider configuration & routing
Policy & Approvals — Permission-gated execution
Daemon & Automation — Background task processing

Reference

REST API Reference — Complete Studio API docs
Configuration Reference — All config options

Comparisons

vs OpenClaw — How we differ from AI OS projects

🔮 Roadmap

Check out our detailed ROADMAP.md to see what's next:

✅ Phase 5: Plugin Ecosystem & Extensibility
✅ Phase 6: Interactive CLI Experience
🔜 Phase 1: Sandboxed Execution & Secrets Management
🔜 Phase 2: Multi-Agent Collaboration (The Swarm)
🔜 Phase 3: Voice & Vision Interfaces
🔜 Phase 4: The Agent Cloud (Skill Hub, Remote Execution, Dashboard)

🤝 Contributing

We welcome contributions! Please see CONTRIBUTING.md for details.

Key areas where we need help:

Writing new Skills
Improving Planner prompt engineering
Building the Web Dashboard
Creating community Plugins

License

MIT