ai-desk

v0.3.5

Published

a month ago

Security-First, Token-Efficient AI Gateway

0High
0Medium
0Low

Ai_DESK

A security-first gateway for running personal AI agents in a hardened, controlled environment. Authentication, sandboxing, and threat detection are mandatory and always-on — not optional add-ons.

What is Ai_DESK?

Ai_DESK is a self-hosted AI agent gateway that sits between you and the AI providers (Anthropic, Google, OpenRouter). It manages:

Multiple AI agents with per-agent model, tools, budget, and sandbox configuration
Messaging platform bots (Telegram, Discord) that route to your agents
Token and cost budgets with hard caps and automatic failover to cheaper models
Tool execution in isolated sandboxes with an explicit allowlist policy
Autonomous skill creation — agents learn from their own sessions and synthesize reusable skills automatically
A web dashboard for real-time monitoring, credentials management, and configuration

The guiding principle: every secret is encrypted, every tool call is checked against policy, every user input is scanned for threats.

Features

Security

Multi-mode authentication (token / password / certificate) with brute-force lockout
AES-256-GCM encryption for all secrets, sessions, tokens, and memory at rest
Real-time threat detection — prompt injection, jailbreaks, command injection, social engineering
Deny-all-by-default tool policy with explicit allowlisting per agent
Mandatory process sandbox for all tool execution (timeout + memory limits)
Tamper-evident audit log with SHA-256 hash chain validation
Human-in-the-loop approval flow for sensitive tool operations

Token & Cost Efficiency

Response cache (SQLite-backed, configurable TTL) — cache hits return in under 1 ms
Context compaction — auto-summarises old conversation history before hitting token limits
Per-agent daily and monthly budgets (tokens + dollar cost) with hard stops
Smart model router: primary → failover chain; sub-agents forced to cheaper models (e.g. Gemini Flash)

Multi-Agent Orchestration

Unlimited named agents with independent configuration
Sub-agent spawning with configurable depth and concurrency limits
Task orchestration (DAG fan-out across multiple agents)
Team coordination with role-based delegation
Session persistence and replay

Autonomous Skill Creation

Auto-synthesis — agent sessions with enough tool calls are automatically synthesized into reusable skills (fire-and-forget, never blocks the hot path)
Skill registry — skills persist as *.skill.json files; disabled by default until explicitly approved
Approval flow — generated skills require human review before activation (skill review + skill approve)
Anti-skills (kind: "avoid") — synthesized from failure traces; injected as a cautionary "AVOID patterns" block in the system prompt
Self-improvement — underperforming skills (high failure rate) are automatically revised; sandbox gate rejects regressions
Lifecycle management — TTL expiry, negative-ROI archiving, LRU pruning when enabled count exceeds per-agent cap
Conflict detection — Jaccard topic-overlap and imperative contradiction checks prevent conflicting skills from being active together
Skill merger — combine two compatible skills into a single consolidated skill
Multi-agent scope — skills can be scoped global, project, or agent-level with an explicit allowlist
Evaluation harness — golden task *.eval.json files scored by an LLM judge per skill
Export / Import — portable bundle format with SHA-256 checksum; PII scrubbed on export

Integrations

Telegram and Discord bots (per-bot agent routing, typing indicator, concurrency locking)
Skills — modular capability bundles that inject tools, system-prompt fragments, and MCP servers
MCP (Model Context Protocol) server support for external tool providers
Cron-based scheduled agent runs
HTTP webhooks that trigger agent invocations from external systems

Quick Start

Prerequisites

Node.js 22+
An Anthropic, Google, or OpenRouter API key

Install

npm install -g ai-desk@latest

First Run

ai-desk onboard        # interactive setup wizard

On first run without a config file the gateway opens a setup wizard at http://127.0.0.1:18789/setup. Complete it to generate ai-desk.json and your first auth token.

Dashboard: http://127.0.0.1:18789/dashboard
Login: http://127.0.0.1:18789/login

Background Daemon

ai-desk gateway --background        # start in background
ai-desk daemon install              # install as system service (systemd / launchd / Task Scheduler)
ai-desk daemon start
ai-desk daemon stop
ai-desk daemon restart

Configuration

Environment Variables (`.env`)

| Variable | Required | Description | |---|---|---| | AI_DESK_MASTER_KEY | Yes | Encryption passphrase for all at-rest secrets. If lost, encrypted data is unrecoverable — back it up. | | AI_DESK_DATA_DIR | No | Runtime data directory (default: ./.ai-desk-data) | | ANTHROPIC_API_KEY | No | Anthropic API key (can also be stored via dashboard) | | GOOGLE_AI_API_KEY | No | Google Gemini API key | | OPENROUTER_API_KEY | No | OpenRouter API key | | TELEGRAM_BOT_TOKEN | No | Telegram bot token | | DISCORD_BOT_TOKEN | No | Discord bot token |

Config File (`ai-desk.json`)

{
  "gateway": {
    "bind": "127.0.0.1",   // bind address — never expose to 0.0.0.0 without a reverse proxy
    "port": 18789,
    "auth": {
      "mode": "token",     // "token" | "password" | "certificate"
      "maxFailedAttempts": 5,
      "lockoutDurationMs": 300000
    },
    "rateLimit": {
      "maxPerSecond": 10,
      "maxConnections": 50
    }
  },

  "agents": {
    "defaults": {
      "model": {
        "primary": "anthropic/claude-sonnet-4-6",
        "failover": ["anthropic/claude-haiku-4-5-20251001", "google/gemini-2.5-flash"]
      },
      "sandbox": {
        "mode": "all",          // "all" | "untrusted"
        "timeoutMs": 30000,
        "maxMemoryMb": 512
      },
      "tools": {
        "profile": "deny-all", // "deny-all" | "readonly" | "messaging" | "full"
        "allow": [],
        "deny": []
      },
      "budget": {
        "daily":   { "tokens": 500000, "cost": 5.0,  "action": "pause" },
        "monthly": { "tokens": 5000000, "cost": 50.0, "action": "warn" }
      },
      "subagents": {
        "model": "google/gemini-2.5-flash",
        "maxDepth": 3,
        "maxConcurrent": 5
      }
    },
    "list": [
      {
        "id": "main",
        "default": true,
        "workspace": ".",
        "skills": ["security-code-review"]  // skills active for this agent
      }
    ]
  },

  "skillSynthesis": {
    "model": "anthropic/claude-sonnet-4-6",
    "improvementModel": "anthropic/claude-sonnet-4-6",
    "scrubModel": "anthropic/claude-haiku-4-5",
    "fallbackToHaikuUnderBudget": true,
    "maxPerDay": 5,                    // max auto-syntheses per day
    "minGapMinutes": 15,               // min time between syntheses
    "autoTriggerMinToolCalls": 8,      // synthesize after sessions with >= N tool calls
    "failureRateThreshold": 0.4,       // improve skills with failure rate above this
    "minUsesBeforeImprovement": 30,    // minimum uses before improvement is considered
    "ttlDays": 60,                     // archive generated skills unused for this long
    "maxEnabledPerAgent": 15,          // LRU-prune when enabled count exceeds this
    "maxGeneratedTotal": 50,
    "deprecateAfterNegativeUses": 10   // archive if avg token savings is negative
  },

  "messaging": {
    "telegram": {
      "enabled": true,
      "agentId": "main",
      "allowedChatIds": []
    },
    "discord": {
      "enabled": true,
      "agentId": "main",
      "prefix": "!ask",
      "allowedGuildIds": [],
      "allowedChannelIds": []
    }
  },

  "cache": {
    "enabled": true,
    "backend": "sqlite",
    "ttlSeconds": 3600
  }
}

CLI Reference

Gateway

ai-desk gateway                          # start gateway (interactive)
ai-desk gateway --background             # start in background
ai-desk gateway --config path/to/config  # use custom config path

Agents

ai-desk agent test "your prompt"         # run a one-shot agent call from CLI
ai-desk agent list                       # list configured agents

Auth Tokens

ai-desk token create                     # generate a new dashboard token
ai-desk token list
ai-desk token revoke <tokenId>

Budget

ai-desk budget show                      # show usage summary
ai-desk budget resume <agentId>          # resume a paused agent

Cache

ai-desk cache stats
ai-desk cache clear
ai-desk cache purge                      # purge expired entries

Skills

# Discovery & status
ai-desk skill list                       # list all loaded skills
ai-desk skill list-generated             # list generated skills and approval status
ai-desk skill info <name>                # full details for a skill
ai-desk skill review <name>              # review a generated skill (diff vs parent)

# Enable / disable
ai-desk skill enable <name>
ai-desk skill disable <name>

# Approval flow (generated skills)
ai-desk skill approve <name>
ai-desk skill reject <name>
ai-desk skill archive <name>

# Synthesis
ai-desk skill synthesize --from-session <id>           # synthesize from a session trace
ai-desk skill synthesize --from-session <id> --negative  # synthesize an anti-skill (kind=avoid)
ai-desk skill synthesize --from-session <id> --dry-run   # preview without writing

# Self-improvement
ai-desk skill improve                    # improve all qualifying skills
ai-desk skill improve --name <name>      # improve a specific skill
ai-desk skill improve --dry-run

# Merge
ai-desk skill merge-candidates           # list pairs recommended for merging
ai-desk skill merge <nameA> <nameB>      # merge two skills into one
ai-desk skill merge <nameA> <nameB> --name my-merged --archive-sources

# Scope
ai-desk skill scope <name> --set project                        # project-wide (default)
ai-desk skill scope <name> --set agent --allow-agent <agentId>  # restrict to specific agent
ai-desk skill scope <name> --set global                         # all agents

# Evaluation
ai-desk skill eval <name>                # run golden task evals for a skill
ai-desk skill eval --all                 # evaluate all enabled skills
ai-desk skill eval --tag security        # filter evals by tag

# Export / Import
ai-desk skill export <name>              # export as portable bundle JSON
ai-desk skill export <name> --out my-skill.bundle.json
ai-desk skill import <bundle-path>       # import and register for approval

MCP

ai-desk mcp list
ai-desk mcp test <serverName>

Multi-Agent

ai-desk team list
ai-desk team run <teamId> "goal"
ai-desk role list
ai-desk orchestrate run '[{"prompt":"...", "agentId":"main"}]'

Security

ai-desk security audit                   # comprehensive security recommendations

Autonomous Skill Creation

AI_DESK can observe its own agent sessions and synthesize reusable skills — JSON bundles that extend future agents with learned instructions and tool permissions.

How it works

Agent session (≥ 8 tool calls, successful)
  → Auto-trigger (fire-and-forget, non-blocking)
  → PII scrub
  → LLM synthesis (skill-synthesis.v1.md prompt)
  → Schema validation (no mcpServer, provenance=generated)
  → Dedup check (Jaccard similarity vs existing skills)
  → Written to skills/generated/<name>.skill.json
  → Registered as pendingApproval=true
  → Human reviews and approves via CLI or dashboard
  → Skill enabled → injected into agent system prompt

Skill kinds

| Kind | Behaviour | |------|-----------| | positive | Capability addition — appended to the main system prompt | | avoid | Cautionary anti-skill — injected as a separate "AVOID patterns" block; synthesized from failure traces using --negative |

Lifecycle

| Check | Trigger | Action | |-------|---------|--------| | Negative ROI | avgTokensSaved < 0 for ≥ N uses | Archive | | TTL expiry | Not used within ttlDays | Archive | | LRU prune | Enabled count > maxEnabledPerAgent | Disable oldest | | Self-improvement | Failure rate > threshold | LLM revision → sandbox gate → pending approval |

Skill bundle format

{
  "name": "sql-query-optimizer",
  "version": "1.0.0",
  "description": "Guides the agent to write efficient SQL with proper indexing hints.",
  "tags": ["sql", "database", "performance"],
  "systemPromptAddition": "When writing SQL queries:\n1. ...",
  "toolAllowlist": ["read_file", "bash"],
  "provenance": "generated",
  "revision": 1,
  "kind": "positive",
  "scope": "project",
  "sourceSessionId": "sess-abc123",
  "traceHash": "a1b2c3d4e5f6a1b2",
  "createdAt": 1746000000000
}

Golden task evaluations

Place *.eval.json files in evals/golden/ to score skills against expected outcomes:

{
  "id": "security-review-001",
  "description": "Agent should identify SQL injection",
  "prompt": "Review this code: SELECT * FROM users WHERE id = ${userId}",
  "expectedOutcome": "Agent identifies SQL injection and suggests parameterized queries.",
  "tags": ["security"],
  "skill": "security-code-review"
}

Run with: ai-desk skill eval security-code-review

AI Providers & Models

| Provider | Models | Auth | |---|---|---| | Anthropic | claude-opus-4-7, claude-sonnet-4-6, claude-haiku-4-5-20251001 | API key or Claude Code OAuth (auto-detected from ~/.claude/) | | Google | gemini-2.5-flash, gemini-2.0-pro, gemini-1.5-pro | GOOGLE_AI_API_KEY | | OpenRouter | 200+ models (openrouter/anthropic/..., openrouter/openai/gpt-4o, …) | OPENROUTER_API_KEY |

Model routing: each agent has a primary model and an ordered failover chain. Sub-agents default to the configured sub-agent model (typically Gemini Flash) to keep costs low. The compaction model (used for summarising history) defaults to Haiku. Skill synthesis falls back to Haiku automatically when budget is running low.

Messaging Platforms

Bot polling with per-chat allowlisting
Typing indicator refreshed every 4.5 s while the agent works
Per-chat concurrency lock — queues new messages instead of dropping them
Hot-connect a bot token from the dashboard without restarting

Discord

Guild and channel allowlisting
Mention-based or prefix-based triggering (!ask)
Per-channel concurrency lock with queuing

Both adapters normalise incoming messages to a common IncomingMessage format and route through the same agent runtime, threat scanner, and budget tracker.

Dashboard

Access at http://127.0.0.1:18789/dashboard (requires auth token).

| Tab | What it shows | |---|---| | Status | Live system snapshot — agents, budget, MCP servers, event log | | Agents | Agent list, status, session count; edit config with hot-reload | | Teams | Team definitions and role assignments | | Roles | Role registry with system-prompt fragments and delegation rules | | Skills | Enable/disable skills; view pending approvals, conflicts, metrics, and tool allowlists | | MCP Servers | Connected external tool servers and their status | | Messaging | Telegram/Discord connections; add per-agent bots | | Chat | Live WebSocket chat with any configured agent | | History | Session replay — browse and search past conversations | | Schedule | Cron job management — create, enable/disable, run now | | Webhooks | HTTP trigger endpoints — create, view invocation history | | Audit | Tamper-evident audit log — filter by time, agent, event type | | Credentials | Encrypted key storage for Anthropic, Google, OpenRouter |

Real-time updates are delivered via Server-Sent Events (SSE). The dashboard reconnects automatically with exponential backoff.

Skills dashboard API

GET  /dashboard/api/skills                  list all skills
GET  /dashboard/api/skills/pending          skills awaiting approval
GET  /dashboard/api/skills/conflicts        conflict audit across pending skills
GET  /dashboard/api/skills/:name            single skill detail
POST /dashboard/api/skills/:name/approve
POST /dashboard/api/skills/:name/reject
POST /dashboard/api/skills/:name/archive

Security Architecture

Authentication

Token, password, or certificate mode — one must be chosen; no "none" mode exists
Failed-attempt counter per IP with configurable lockout duration
Dashboard tokens stored hashed; raw token shown only once at creation time

Threat Detection

Every incoming message is scanned before reaching the agent:

Prompt injection patterns ("ignore previous instructions", fake system prompts)
Jailbreak patterns ("DAN mode", "do anything now", roleplay bypass)
Data exfiltration attempts
Command injection
Social engineering

Detected threats are blocked and logged; a short canned reply is sent to the user.

Tool Policy

Profile sets the baseline: deny-all, readonly, messaging, or full
Allow list explicitly permits additional tools (wildcard support: read*)
Deny list blocks specific tools even if the profile would permit them
Approval flow — high-risk tools (e.g. write_file, execute_command) can require live human consent via WebSocket before execution

Sandbox

All tool execution runs in isolated child processes:

Configurable timeout (default 30 s)
Memory cap (default 512 MB)
Environment sanitised — secrets stripped before spawning
Output limited to 1 MB per call

Skill Security Invariants

Generated skills enforce strict security guarantees that cannot be bypassed:

mcpServer is never allowed in generated skills — only builtin/user skills may spawn external processes
toolAllowlist must be a subset of the tools the source session was permitted to use
All skills start with enabled: false and pendingApproval: true — explicit human approval required
PII is scrubbed from session traces before any data reaches the LLM
Synthesis is rate-limited per agent (configurable maxPerDay + minGapMinutes)

Audit Log

Every authentication event, tool call, budget violation, skill synthesis, and threat detection is recorded
Hash chain: each entry includes the SHA-256 of the previous entry
Integrity can be verified at any time from the dashboard or CLI
Stored in .ai-desk-data/audit.db

Budget Controls

Daily and monthly caps in tokens and dollar cost
Actions on breach: warn (log + notify), pause (block new calls), block (immediate stop)
Per-run maximums prevent single runaway calls from consuming the entire budget

Architecture Overview

┌─────────────────────────────────────────────────────────┐
│  Client (Browser / Telegram / Discord / CLI)            │
└───────────────────────┬─────────────────────────────────┘
                        │  WebSocket / HTTP
┌───────────────────────▼─────────────────────────────────┐
│  Gateway Server                                         │
│  ├── Auth Manager       (token validation, lockout)     │
│  ├── Rate Limiter       (per-IP, per-connection)        │
│  ├── Dashboard Server   (HTTP + SSE + Skill API)        │
│  └── WebSocket Handler  (streaming chat)                │
└───────────────────────┬─────────────────────────────────┘
                        │
┌───────────────────────▼─────────────────────────────────┐
│  Agent Runtime                                          │
│  ├── Threat Scanner     (input validation)              │
│  ├── Response Cache     (SQLite, TTL-based)             │
│  ├── Budget Tracker     (daily / monthly caps)          │
│  ├── Context Compactor  (auto-summarise history)        │
│  ├── Model Router       (primary + failover chain)      │
│  ├── Tool Executor      (policy check → sandbox)        │
│  ├── Sub-agent Spawner  (recursive, depth-limited)      │
│  ├── Session Store      (encrypted persistence)         │
│  └── Skill Auto-Trigger (fire-and-forget synthesis)     │
└───────────┬─────────────────────┬───────────────────────┘
            │                     │
┌───────────▼──────────┐ ┌───────▼──────────────────────┐
│  AI Providers        │ │  Tool Providers               │
│  ├── Anthropic       │ │  ├── Built-in tools           │
│  ├── Google Gemini   │ │  ├── MCP servers              │
│  └── OpenRouter      │ │  └── Skills                   │
└──────────────────────┘ └───────────────────────────────┘
                                  │
          ┌───────────────────────▼──────────────────────┐
          │  Autonomous Skill System                      │
          │  ├── Skill Trace Store   (SQLite FTS5)        │
          │  ├── Skill Synthesizer   (positive + avoid)   │
          │  ├── Skill Improver      (self-revision)      │
          │  ├── Skill Merger        (combine skills)     │
          │  ├── Skill Evaluator     (golden task evals)  │
          │  ├── Skill Lifecycle Mgr (TTL / LRU / ROI)   │
          │  ├── Conflict Detector   (Jaccard + imperatives)
          │  ├── Skill Sandbox       (LLM judge gate)     │
          │  ├── Skill Registry      (state persistence)  │
          │  └── Export / Import     (bundle + checksum)  │
          └───────────────────────────────────────────────┘

Request path (happy path):

User message
  → Threat scanner
  → Cache lookup          ← hit: return immediately
  → Budget check
  → Skill system prompt   ← positive skills + AVOID block injected
  → Model router → API call (streaming)
  → Tool calls → Policy check → Approval? → Sandbox → recurse
  → Session save (encrypted) + trace recorded
  → Budget deduct + skill metrics updated
  → Auto-trigger synthesis? (fire-and-forget if ≥ N tool calls)
  → Response streamed back

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Ai_DESK

What is Ai_DESK?

Features

Security

Token & Cost Efficiency

Multi-Agent Orchestration

Autonomous Skill Creation

Integrations

Quick Start

Prerequisites

Install

First Run

Background Daemon

Configuration

Environment Variables (.env)

Config File (ai-desk.json)

CLI Reference

Gateway

Agents

Auth Tokens

Budget

Cache

Skills

MCP

Multi-Agent

Security

Autonomous Skill Creation

How it works

Skill kinds

Lifecycle

Skill bundle format

Golden task evaluations

AI Providers & Models

Messaging Platforms

Telegram

Discord

Dashboard

Skills dashboard API

Security Architecture

Authentication

Threat Detection

Tool Policy

Sandbox

Skill Security Invariants

Audit Log

Budget Controls

Architecture Overview

Environment Variables (`.env`)

Config File (`ai-desk.json`)