hajimi-claw

v0.1.4

Published

4 months ago

Single-user Telegram-first ops agent in Rust

0High
0Medium
0Low

dongshanrandeng

hajimi-claw

Single-user Telegram/Feishu-first ops agent in Rust.

Current scope

Telegram channel -> gateway -> runtime command flow
Feishu webhook channel -> gateway -> runtime command flow
Telegram long polling command surface
Single active task gate
Structured tools for file access, Docker, and systemd
Guarded local command execution with approval mode, short-lived elevated mode, and full elevated mode
SQLite audit/task/session persistence
Windows-safe execution mode with allowlist checks and Job Object cleanup
Channel-aware onboarding for Telegram or Feishu plus provider/model setup
Configurable multi-agent orchestration with coordinator/worker/integrator flow
Executable skills registered as first-class tools with shared approval, audit, and persistence
Capability inventory and MCP server status surfaced in chat via /capabilities, /skills, and /mcp

Running

Run cargo run -- onboard or hajimi onboard.
hajimi onboard lets you choose telegram, feishu, or skip, verifies the channel credentials when configured, and then interactively guides provider model selection.
Start the daemon with cargo run or hajimi.
In Telegram, use /menu for the main quick-action panel, or just send plain text and let hajimi reply in natural language.
In Feishu mode, configure the app's event subscription URL to point at http(s)://<your-host><event_path> for the local webhook listener.

Set HAJIMI_CLAW_CONFIG if you want to load a different config path.

For background use without keeping a terminal open:

hajimi launch
hajimi status
hajimi stop

Install

Windows

Build and install to a user directory in PATH: powershell -ExecutionPolicy Bypass -File .\scripts\install.ps1
System-wide install: powershell -ExecutionPolicy Bypass -File .\scripts\install.ps1 -System

The installer copies hajimi-claw.exe, creates the hajimi.exe alias, copies config.example.toml, and updates PATH.

Linux

User install: sh ./scripts/install.sh
System-wide install: sudo sh ./scripts/install.sh --system
System-wide install with systemd unit: sudo sh ./scripts/install.sh --system --install-service

The Linux installer copies the binary to PREFIX/bin/hajimi-claw, creates the PREFIX/bin/hajimi alias, and copies config.example.toml to PREFIX/share/hajimi-claw.

npm

Global install from the package: npm install -g hajimi-claw
Global install from the repo: npm install -g .

The npm package exposes both hajimi and hajimi-claw, but it no longer compiles Rust during install. Instead:

The root package is a small launcher package
Platform-specific binaries are published as optional npm packages
npm install -g hajimi-claw installs the matching prebuilt binary package when one is available

Supported prebuilt package names:

hajimi-claw-win32-x64-msvc
hajimi-claw-win32-arm64-msvc
hajimi-claw-linux-x64-gnu
hajimi-claw-linux-arm64-gnu

For local testing from the repo, stage a binary into the local platform package first:

cargo build --release
npm run stage:binary
npm install -g .

For CI or release staging, you can target a specific package explicitly:

node scripts/stage-prebuilt.js win32-x64-msvc path/to/hajimi-claw.exe
node scripts/stage-prebuilt.js linux-x64-gnu path/to/hajimi-claw

Recommended publish order:

Build each platform binary in CI
Stage it into the matching npm/platforms/<target>/bin/
Publish each platform package
Publish the root hajimi-claw package last

Current GitHub Actions automation is wired for:

hajimi-claw-win32-x64-msvc
hajimi-claw-linux-x64-gnu

The arm64 package skeletons are present in the repo, but they are not yet part of the automated publish matrix.

npm Versioning

For npm releases, the root package.json is the single version source.

When you change package.json.version, run:

npm run sync:npm-versions

That script updates:

the root package optionalDependencies
every platform package version under npm/platforms/*/package.json

The release workflow also runs this sync step automatically, so you do not need to edit platform package versions by hand.

Channels

telegram
feishu

hajimi onboard now lets you pick the primary channel:

telegram: verifies the bot token and can auto-pair the admin user/chat
feishu: asks for app_id and app_secret, verifies them by requesting a tenant access token, and supports both webhook and long-connection modes
skip: leaves channel setup empty for now so you can finish provider/model setup first

Current Feishu limitations:

In webhook mode, you still need to configure the Feishu event subscription URL manually in the Feishu developer console
In long-connection mode, message receive events can come through the WebSocket connection, but card button callbacks still need the lightweight HTTP callback path
Feishu card support currently targets menu/provider/model style button flows; it is not a full parity implementation of Telegram inline interactions

CLI

hajimi
hajimi ask <prompt>
hajimi tasks
hajimi approvals
hajimi approve <request-id>
hajimi shell open [name]
hajimi shell status <session-id>
hajimi shell exec <session-id> <command>
hajimi shell close <session-id>
hajimi profile show
hajimi profile use <ops-safe|dev-agent|computer-use>
hajimi daemon
hajimi launch
hajimi stop
hajimi status
hajimi onboard
hajimi providers
hajimi provider current
hajimi provider use <provider-id>
hajimi provider models [provider-id]
hajimi provider set-model <provider-id> <model>
hajimi model current
hajimi model use [model]
hajimi models [provider-id]
hajimi restart
hajimi help

hajimi ask now records task state and tool invocations in SQLite. If a guarded command blocks on approval, hajimi approve <request-id> resumes the blocked task instead of asking you to rerun it.

Skills and MCP

Hajimi now treats native tools, executable skills, and MCP-discovered tools as one capability surface.

Native tools keep their existing names such as read_file or exec_once
When [telegram].bot_token is configured, a native telegram_api tool is also registered
Executable skills are configured in [skills] and are registered as tool names like skill.deploy
MCP tools will be exposed as namespaced tool names like mcp.<server>.<tool>
All three flow through the same runtime path for approval, audit logging, task persistence, and model tool-calling

skills.md remains prompt guidance only. Use it for playbooks, habits, and routing hints. It is not the executable source of truth for runnable skills.

Config

config.example.toml now includes these sections:

[skills]
enabled = true
directory = "./skills"
manifest_paths = []
entries = []

[mcp]
enabled = true
servers = []

Skill manifests and inline entries deserialize into ExecutableSkillConfig, including name, description, command, args, cwd, env_allowlist, requires_approval, timeout_secs, max_output_bytes, and input_schema.

Relative paths in skill manifests, skills.directory, skills.manifest_paths, and MCP server cwd values resolve relative to the config file.

Telegram capability commands

/capabilities — list the effective native tools, executable skills, and MCP tools
/skills — list executable skills only
/skill run <name> <json-or-text> — explicitly invoke one configured skill through the runtime
/mcp — show configured MCP server status
/mcp tools [server] — list discovered MCP tools, optionally filtered by server

Natural-language requests stay primary. Once a skill or MCP tool is registered in the runtime, the model can choose it during normal /ask or plain-text requests just like built-in tools.

telegram_api calls the configured Telegram Bot API token directly from the native tool layer. A typical invocation looks like:

{
  "method": "sendMessage",
  "params": {
    "text": "deploy finished"
  },
  "use_default_chat_id": true
}

Multi-Agent

hajimi can split one natural-language request into multiple sub-agents. This is configured in config.toml under [multi_agent]:

[multi_agent]
enabled = true
auto_delegate = false
default_workers = 3
max_workers = 8
worker_timeout_secs = 90
max_context_chars_per_worker = 24000

Natural-language triggers:

use 4 agents to analyze this
开 6 个 agent 帮我排查
please use sub agents for this

Behavior:

If the prompt explicitly asks for N agents or N workers, hajimi uses that count up to max_workers
If the prompt says sub agent / multi agent, hajimi uses default_workers
auto_delegate = true enables light automatic delegation for obviously parallel analysis tasks
Worker agents only perform parallel reasoning in v1; local command execution still stays behind the existing single runtime/policy path
Session-level controls are available in chat:
- /agents on
- /agents off
- /agents auto
- /agents status
When a natural-language task is about to run in multi-agent mode, Telegram now shows a temporary progress card and Feishu sends a status card before the final reply

Persona Files

hajimi onboard seeds persona files in ~/.hajimi/:

identity.md
soul.md
agents.md
tools.md
skills.md
heartbeat.md

hajimi reloads persona prompt files on each request, so edits take effect on the next /ask.

Layered prompt order:

base system prompt
identity.md
soul.md
agents.md / AGENTS.md / tools.md / skills.md
runtime overlays such as shell-session metadata and multi-agent role instructions

Auto-discovery behavior:

Auto-discovered files: identity.md, soul.md, agents.md, AGENTS.md, tools.md, skills.md
Search roots and precedence: persona.directory -> config directory -> current working directory
identity.md and soul.md support optional front matter, but plain markdown still works
Structured identity.md / soul.md fields override by precedence while freeform notes accumulate
Extension files stay additive in precedence order
heartbeat.md is runtime config only and is never appended into the prompt
Optional explicit list: set [persona].prompt_files in config.toml

Use these files for:

identity.md: user profile, owned systems, environments, durable preferences, and hard constraints
soul.md: Hajimi's stable role, tone, style, and behavioral stance
agents.md or AGENTS.md: repo or operator instructions
tools.md: tool-use policy and operational preferences
skills.md: extra habits, playbooks, and skill-selection hints for the prompt layer only
heartbeat.md: daemon heartbeat runtime config

Telegram commands

/onboard
/onboard cancel
/provider list
/provider add
/provider current
/provider use <id>
/provider bind <id>
/provider test [id]
/provider models [id]
/provider set-model <provider-id> <model>
/model current
/model use [model]
/persona list
/persona guide
/persona read <identity|heartbeat|soul|agents|tools|skills>
/persona write <file> <content>
/persona append <file> <content>
/ask <text>
/capabilities
/skills
/skill run <name> <json-or-text>
/mcp
/mcp tools [server]
/shell open [name]
/shell exec <cmd>
/shell close
/status
/menu
/elevated on
/elevated off
/elevated ask
/elevated full

Plain Telegram text now defaults to a natural-language task, so /ask is optional for normal requests. hajimi sends a Telegram typing action, posts a short placeholder, and then edits that message with the final answer.

The bot also registers Telegram slash commands with setMyCommands on startup and exposes an inline quick-action menu via /menu and /help. /provider current and /model current now return inline buttons so you can switch provider/model directly from Telegram without typing full commands.

Provider And Model Switching

Add a new provider: hajimi onboard or Telegram /provider add
List configured providers: hajimi providers or Telegram /provider list
Switch the default provider: hajimi provider use <provider-id> or Telegram /provider use <id>
See the current model: hajimi model current or Telegram /model current
Switch the current model on the active provider: hajimi model use [model] or Telegram /model use [model]
Open the model picker with inline buttons: hajimi model use or Telegram /model use
Switch a specific provider to a specific model: hajimi provider set-model <provider-id> <model> or Telegram /provider set-model <provider-id> <model>
hajimi onboard now supports optional fallback models per provider; if the primary model fails, hajimi will retry the provider using the configured fallback models in order

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

hajimi-claw

Current scope

Running

Install

Windows

Linux

npm

npm Versioning

Channels

CLI

Skills and MCP

Config

Telegram capability commands

Multi-Agent

Persona Files

Telegram commands

Provider And Model Switching