@qingfeng346/sbot

v0.1.2

Published

15 days ago

Self-hosted AI agent server. Open-source alternative to OpenAI, Claude, and Gemini interfaces. Run LLM agents locally with MCP tool support, memory, multi-channel integrations (Lark/Feishu), and a built-in web UI.

0High
0Medium
0Low

qingfeng346

sbot — Self-hosted AI Agent Server

English | 中文

Open-source, self-hosted AI agent framework. Modular by design: models, memory, tools, and channels are independent building blocks you mix and match to assemble agents — run on your own server with multi-channel integrations, MCP tool support, and a built-in web UI, no vendor lock-in.

Quick Start

npm

npm install -g @qingfeng346/sbot
sbot
# Open http://localhost:5500

If port 5500 is already in use, change it with:

sbot port 3000        # save port and exit
sbot --port 3000      # save port and start

Docker

docker pull qingfeng346/sbot
docker run -d \
  -p 5500:5500 \
  -v ~/.sbot:/root/.sbot \
  --name sbot \
  qingfeng346/sbot
# Open http://localhost:5500

Configuration and data are persisted in ~/.sbot on the host.

Features

Modular composition — Models, memory, tools, channels, and skills are independent building blocks you mix and match to assemble agents
One-command deployment — npm install -g or docker run, native cross-platform with no extra system dependencies
Full Web UI management — All configuration done in the browser, no manual file editing required
Multiple LLM providers — OpenAI, Anthropic Claude, Google Gemini, Ollama, and any OpenAI-compatible API (Azure OpenAI, Groq, Mistral, DeepSeek, etc.)
Multi-agent orchestration — ReAct recursive task decomposition + Generative multimodal, agents can be nested and composed
Knowledge base — Built-in wiki system with automatic document extraction and semantic search, referenced by agents during conversations
Long-term memory — Full extract → compress pipeline with vector-embedding semantic search
MCP tools — Standard MCP protocol (stdio/HTTP), connect to any MCP tool ecosystem
Multiple channels — Web UI, CLI, Lark/Feishu, Slack, WeCom, WeChat, REST API, WebSocket
Built-in tools — Shell execution, file system, archive operations, media file read, Python/PowerShell inline execution, cron scheduler, todo tasks
Skills — Installable prompt modules, remote install from skills.sh / Clawhub
Token usage tracking — Built-in consumption statistics with real-time visibility
Flexible config — Global and per-session overrides from a single settings.json

Usage Guide

Open http://localhost:5500 after starting sbot. Follow these steps:

1. Add a Model — sidebar → Models → New

Fill in provider, API key, base URL, and model name. Supported providers: OpenAI, Anthropic, Google Gemini, Ollama, and any OpenAI-compatible endpoint (Azure OpenAI, Groq, Mistral, DeepSeek, etc.).

2. Create a Saver — sidebar → Savers → New

Choose a backend for storing conversation history:

| Backend | Description | |---|---| | File | JSON files per conversation thread (recommended) | | SQLite | Local SQLite database | | Memory | Cleared after conversation ends, no persistence, suitable for one-off conversations or Q&A assistants |

3. Create an Agent — sidebar → Agents → New

Choose a mode:

Single — select a model, write a system prompt, optionally attach MCP tools and skills
ReAct — select a think model, then add sub-agents (each with an id and description for task dispatch). The think model decomposes tasks and dispatches sub-tasks recursively; each sub-agent has read-only access to shared memory
Generative — select a multimodal model for mixed text and image content generation

→ MCP Tools · Skills

4. Start chatting — choose your entry point

Session — sidebar → Chat → New Session, select agent + saver + memory; optionally configure a working directory
Channel (IM) — sidebar → Channels → New → Channel Setup; optionally configure a working directory

5. (Optional) Enable Wiki Knowledge Base — sidebar → Wiki → New

Built-in knowledge base. Create pages manually (title + content + tags) or let the agent auto-extract knowledge from conversations. Auto-extraction requires an extractor model. Assign wikis to a session or channel; agents can search, read, and create pages via built-in tools.

| Field | Description | |-------|-------------| | Extractor | LLM model for auto-extracting knowledge from conversations | | Auto-extract | When enabled, the agent extracts valuable knowledge after each reply | | Shared | Off = per-thread wiki; On = shared across all threads |

6. (Optional) Enable Memory — sidebar → Memories → New

Requires an embedding model first (sidebar → Embeddings → New). Then assign the memory to a session or channel.

| Field | Description | |-------|-------------| | Mode | read_only / human_only (user messages) / human_and_ai (full conversation) | | Max age (days) | Auto-expire memories after N days | | Embedding | Embedding model for semantic search (OpenAI, Azure, Ollama) | | Extractor | Extracts key facts from conversation | | Compressor | Merges similar memories to reduce redundancy | | Shared | Off = per-thread memory; On = shared across all threads |

Add MCP Tools

Sidebar → MCP → New

Add a server:

stdio — command + args (e.g. npx -y some-mcp-package)
http — remote URL + optional headers

Supports global servers shared across all agents and per-agent overrides. Servers auto-restart on failure. Then open an agent → MCP tab to attach the servers you want it to use.

Manage Skills

Sidebar → Skills

Skill files (Markdown) are stored in ~/.sbot/skills/. Search and install from remote hubs (Clawhub, skills.sh) on the Skills page, or drop files manually. In an agent → Skills tab, select specific skills to load, or leave empty to load all.

Customize Prompts

Sidebar → Prompts

View and edit any built-in prompt (system prompts, agent prompts, tool descriptions, etc.). Saved overrides are stored in ~/.sbot/prompts/ and take precedence over the defaults, effective immediately without restart. Supports {varName} placeholders substituted at runtime.

Channel Setup

In Channels → New, select the type and fill in the credentials, then assign agent + saver + memory. Every user/group chat is isolated automatically.

| Type | Required fields | |------|----------------| | Lark / Feishu | App ID, App Secret | | Slack | Bot Token (xoxb-...), App Token (xapp-...) | | WeCom | Bot ID, Secret | | WeChat | QR code login (credentials auto-populated) |

Setting up Lark / Feishu:

Create a bot app in the Feishu Developer Console (or Lark Developer Console for international)
Enable Bot capability
Grant the following permissions under Permissions & Scopes (or use Batch Import with the JSON below):

| Permission | Description | |------------|-------------| | im:message:send_as_bot | Send messages as bot | | im:message.p2p_msg:readonly | Receive direct messages | | im:message.group_at_msg:readonly | Receive group @bot messages | | im:message.group_msg | Receive all group messages | | im:message:readonly | Read message content | | im:chat:readonly | Read chat/group info | | im:resource | Read files and images in messages | | contact:user.base:readonly | Read basic user info | | contact:contact.base:readonly | Read basic contact info |

{
  "scopes": {
    "tenant": [
      "contact:contact.base:readonly",
      "contact:user.base:readonly",
      "im:chat:readonly",
      "im:message.group_at_msg:readonly",
      "im:message.group_msg",
      "im:message.p2p_msg:readonly",
      "im:message:readonly",
      "im:message:send_as_bot",
      "im:resource"
    ],
    "user": []
  }
}

Under Events & Callbacks, set the subscription mode to Long Connection
In Web UI → Channels, create a Lark channel and fill in App ID and App Secret

Supports event deduplication, interactive cards, per-user context isolation, and file/image send and receive.

Setting up WeCom:

Create an AI app in the WeCom Admin Console and obtain the Bot ID and Secret
In Web UI → Channels, create a WeCom channel and fill in Bot ID and Secret

Connects via WebSocket for real-time messaging, with file and image support.

Setting up WeChat:

In Web UI → Channels, create a WeChat channel
Click QR login and scan the code with WeChat to authenticate
Credentials are saved automatically once authenticated, and the channel goes live immediately

WeChat integration connects via the iLink Bot API, with file and image support.

Built-in Tools

Command Execution

Shell commands and scripts
Inline Python and PowerShell execution
File-referenced script execution
Configurable timeout per command

File System

Read, write, edit files
Search with regex (grep)
Find files by pattern (glob)
Directory listing, create, remove, move, copy
Read media files (images, etc.)

Archive

Compress and extract archive files
List archive contents
Read files directly from within archives

Scheduler

Standard 6-field cron expressions (second minute hour day month weekday), persisted across restarts
Tasks can target a channel user, a web session, or a working directory
Optional max run count with auto-cleanup
Manageable via the Web UI or by asking the agent directly

Todo

Agents can create, complete, and query todo tasks
Web UI provides a Todo management page

Ask

Agents can pause mid-task and ask the user structured questions
Supported question types: single-select, multi-select, text input
Works across Web UI and Lark; the agent resumes automatically after the user responds

Keywords

ai agent self-hosted llm server open source mcp model context protocol multi-agent react agent openai claude anthropic ollama chatbot lark feishu long-term memory vector search typescript node.js

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

sbot — Self-hosted AI Agent Server

Quick Start

npm

Docker

Features

Usage Guide

Add MCP Tools

Manage Skills

Customize Prompts

Channel Setup

Built-in Tools

Keywords