uctm

v1.5.4

Published

4 months ago

Universal Claude Task Manager — SDD-based task pipeline subagent system for Claude Code CLI

0High
0Medium
0Low

uc_jung

claude claude-code subagent task-manager sdd ai-agent pipeline specification-driven-development

uc-taskmanager

Requirements Analysis & Development 6-Agent Full Pipeline + DAG-Based Orchestration + Sliding Window Context Management

Universal Claude Task Manager — A WORK-PIPELINE Agent that executes SDD (Specification-Driven Development) for Claude Code. It formalizes user requirements into specifications, builds execution plans (WORK) from those specs, decomposes them into small tasks (TASK) and analyzes dependency graphs (DAG), then automatically executes TASKs sequentially or in parallel based on dependencies.

Available as an npm CLI (uctm). Install once, use []-tagged requests to trigger automated multi-agent pipelines.

한국어 문서 (Korean)

Quick Start

npm CLI

npm install -g uctm
cd your-project
uctm init --lang en   # English agents
uctm init --lang ko   # 한국어 에이전트 (Korean — npm only)
uctm init             # Interactive language selection

uctm init automatically configures Bash permissions for agents in settings.local.json and installs .claude-plugin and skills/ resources. No prompts required — the pipeline runs immediately after init.

Start Using

Once installed, start Claude Code and use pipeline tags:

claude
> [new-feature] Add a hello world feature

Since uctm init automatically sets up permissions, the pipeline runs without additional prompts. If you need to bypass permissions in a CI/isolated environment:

claude --dangerously-skip-permissions

Warning: Only use bypass mode in isolated environments or when you trust the pipeline fully. See Claude Code Permissions for details.

The agents analyze your request, plan the work, and execute through isolated subagent pipelines.

What Makes This Pipeline Different

1. Procedures Must Be Executed Properly and Recorded

Rather than focusing on writing better code (like TDD or DDD), this agent focuses on executing development procedures properly.
It systematizes the full pipeline: Requirement (user) → Specification → WORK Plan → Per-TASK Execution Plan → Per-TASK Execution/Verification/Completion → Per-TASK Result Storage (WORK PIPELINE)
The result: end-to-end records from requirement to delivery, providing full traceability.

How to work with this AI Agent:

Start: Give a prompt starting with [] to trigger the WORK-PIPELINE

[game-dev] Build a brick breaker game in HTML

Requirement Analysis: The agent analyzes your requirement and asks for approval. Review works/WORK-NN/Requirement.md and type "approve" to proceed.

{Requirement specification content}
If you approve Requirement.md, I will call the Planner to create PLAN.md + TASK decomposition.
Let me know if you want to modify anything.

WORK Execution Plan: The agent builds an execution plan and asks again. Review works/WORK-NN/PLAN.md and TASK-NN.md, then type "approve" to proceed.

WORK-31 Development Approval Request

  Project folder structure reorganization ~~~~~~~ / ########

  ┌─────────┬─────────────────────────┐
  │  Item   │        Details          │
  ├─────────┼─────────────────────────┤
  │ Mode    │ full                    │
  ├─────────┼─────────────────────────┤
  │ TASKs   │ 6 (TASK-00 ~ TASK-05)  │
  └─────────┴─────────────────────────┘

  DAG Structure

  TASK-00 (move agents/ en files → en/ subdirectory)
     ├─→ TASK-01 (~~~~~~~~~~~~) ─→ TASK-03 (#########)
     ├─→ TASK-02 (create plugin/) ─→ TASK-04 (????????)
     └─────────────────────────────────→ TASK-05 ($$$$$$$$$)

  - TASK-01/02 parallel, TASK-03/04 parallel, TASK-05 final integration
  - If approved, scheduler → builder → verifier → committer pipeline will execute.

  Proceed?

Per-TASK: build → verify → commit repeats automatically for each TASK.

● TASK-05 committed. Updating PROGRESS.md and finalizing WORK-31.

When TASKs complete, verify via works/WORK-NN/TASK-NN_result.md and actual testing.

  push, merge

Want to rollback? Type WORK-NN rollback. Commit hashes are stored in the files, so only that WORK's changes are reverted.

Too much ceremony for a simple button rename?

[WORK start] Change the submit button label to "Send" — auto

Add "auto" to skip all approval steps and run the entire process automatically.

2. Token Economy

I'm cost-conscious (honestly). So this agent applies four token-saving strategies:

(1) Serena MCP for codebase analysis. The agent prioritizes Serena MCP for code exploration — reading symbols instead of entire files. (Huge thanks to the Serena team.)

(2) Three execution modes + spawn consolidation to minimize subagent overhead. The WORK-PIPELINE has 6 agent stages running sequentially. For a single-TASK WORK, that's 6 subagent sessions — each consuming tokens just to boot up. Wasteful. So the specifier agent decides the execution mode based on complexity: direct mode uses only 3 agent calls (specifier → builder → verifier+committer), skipping planner and scheduler. On top of this, related agents are combined into a single spawn: specifier+planner run together, and verifier+committer run together — reducing total spawns by ~30% (6 TASKs: 20 → 14 spawns). See Three Execution Modes.

(3) Structured XML communication. Subagents can't nest — Main Claude orchestrates everything.

When one agent finishes and the next agent starts, Main Claude sits in between, causing data to be transmitted twice. This communication is a blob of text —
The receiving side has to parse it again. So we standardized the communication format as XML.
Every bit helps.
(This also made agent log monitoring much easier.) See Structured Agent Communication.

(4) Sliding Window Context Transfer. Agent A finishes and tells B what it did. B finishes and tells C what A did plus what B did. But does C really need A's full details? So B passes its own work in full to C, and summarizes A's work. One degree of separation = just a name and phone number — you don't need to know their personality. If curious, just ask them directly. Testing shows ~20-30% token savings. See Sliding Window Context Transfer.

"Why not skip agents entirely and do everything in one session?" See Context Isolation. In long sessions, AI gradually loses coherence — like sudden memory loss mid-conversation. Strict context isolation prevents this and directly impacts output quality.

3. Dependency-Aware Parallel Execution

TASKs within a WORK have dependency management via DAG. Parallel execution only happens when TASKs have no mutual dependencies — meaning no source code conflicts from concurrent edits.

I've also built a requirement management system that integrates with this pipeline. It manages requirements per project. Queue up requirements before bed, and by morning they're all developed — your to-do list just shifts to reviewing instead of coding. WORKs execute in parallel across projects too (cross-project dependencies don't exist).

What's Next

Currently designing a RAG-based system to store accumulated artifacts and query similar past requirements during specification analysis — for faster and more accurate requirement decomposition. (If enough data accumulates, who knows — maybe a fine-tuned LLM behind an MCP someday.)

Tip for prompting AI agents: Think of it like SQL WHERE clause ordering (developers only). The first condition should narrow the dataset the most — and if it hits an index, even better. That's why I maintain a glossary with terms and source code entry points, and have the agent reference it. My tokens are precious.

Six subagents work across any project and any language, automatically handling request routing → task decomposition → dependency management → code implementation → verification → commit.

"[new-feature] Build a user authentication feature"
→ specifier decides WORK, planner creates WORK-01 with 5 TASKs, pipeline executes

Usage

Trivial Fix (direct mode)

> [bugfix] Fix typo in login error message

Main Claude calls specifier, which selects execution-mode: direct and returns a dispatch XML. Main Claude then calls builder (implements the change) and committer (commits). Creates WORK-NN directory + PLAN + result.md + commit.

Quick Task (pipeline mode)

> [bugfix] Fix the login button not responding on mobile

Main Claude calls specifier, which selects execution-mode: pipeline and creates PLAN. Then Main Claude calls builder → verifier → committer in sequence.

Complex Feature (WORK)

1. Create WORK (Planning)

> [new-feature] Build a user authentication feature. Plan it.

The planner analyzes the project and creates WORK-01:

WORK-01: User Authentication

  WORK-01: TASK-00: Project initialization        ← no dependencies
  WORK-01: TASK-01: DB schema design              ← TASK-00
  WORK-01: TASK-02: JWT auth API                  ← TASK-01
  WORK-01: TASK-03: User CRUD                     ← TASK-02
  WORK-01: TASK-04: Tests + documentation         ← TASK-03

  Do you approve this plan?

2. Execute WORK

> Run WORK-01 pipeline

The scheduler executes WORK-01's TASKs in dependency order.

3. Add to Existing WORK

If WORK-01 is IN_PROGRESS, the specifier asks:

"WORK-01 (User Authentication) is in progress. Add as a new TASK or create a new WORK?"

4. Check Status

> WORK list

WORK Status
   WORK-01: User Authentication    ✅ 5/5 completed
   WORK-02: Payment Integration    🔄 2/4 in progress
   WORK-03: Admin Dashboard        ⬜ 0/6 pending

5. Auto Mode / Resume

> Run WORK-02 automatically
> Resume WORK-02

6. Run a Specific TASK

Skip to a specific TASK within a WORK (e.g., retry after a failure):

> Run WORK-02: TASK-02

The scheduler returns the next TASK, then Main Claude calls builder → verifier → committer in sequence.

7. Force WORK Creation (Skip Complexity Check)

Use the [new-work] tag to always create a new WORK regardless of complexity:

> [new-work] Refactor the auth module

8. Handle Failure / Retry

If a TASK fails during the pipeline, the scheduler retries up to 3 times automatically. If it still fails, you can inspect the result file and retry manually:

> WORK-02: TASK-01 failed. Retry it.

Or fix the issue and re-run:

> Fix the issue in src/auth.ts, then retry WORK-02: TASK-01

9. Add a TASK to an In-Progress WORK

> [enhancement] Add rate limiting to the auth API

If WORK-02 is IN_PROGRESS, the specifier asks:

"WORK-02 (Auth Module) is in progress. Add as a new TASK, or create a new WORK?"

10. Check Individual TASK Status

> Show WORK-02 progress
> What's the status of WORK-03: TASK-02?

The scheduler reads PROGRESS.md and result.md files to report current state.

The `[]` Tag System

Prefix your request with a [] tag to trigger the pipeline:

| Tag | Meaning | |-----|---------| | [new-feature] | New feature | | [enhancement] | Enhancement | | [bugfix] | Bug fix | | [new-work] | Always create new WORK (skip complexity check) |

No [] tag = handled directly without pipeline.

To register this rule in your project, add the following to your CLAUDE.md:

## Agent 호출 규칙

`[]` 태그로 시작하는 요청 → specifier 에이전트 호출 (WORK 파이프라인 시작)

This ensures Claude automatically delegates []-tagged requests to the specifier agent without manual invocation.

Installation

npm CLI

npm install -g uctm

# Per-project (copies agents + config + updates CLAUDE.md)
cd your-project
uctm init --lang en          # English agents
uctm init --lang ko          # 한국어 에이전트
uctm init                    # Interactive language selection

# Global (copies agents to ~/.claude/agents/)
uctm init --global --lang en

# Update agents after upgrading uctm (--lang required)
uctm update --lang en

Manual

git clone https://github.com/UCJung/uc-taskmanager-claude-agent.git /tmp/uc-tm
mkdir -p .claude/agents
cp /tmp/uc-tm/npm/agents/*.md .claude/agents/
rm -rf /tmp/uc-tm
git add .claude/agents/ && git commit -m "chore: add uc-taskmanager agents"

Local Plugin Test

# Test plugin locally
claude --plugin-dir ./

Verify

claude
> /agents
# specifier, planner, scheduler, builder, verifier, committer → confirm all 6

Concept: Three Execution Modes

Main Claude detects the [] tag and calls the specifier subagent, which selects one of three execution-mode values:

User Request → Main Claude (orchestrator)
                    │
                    ▼
              ┌───────────┐
              │ specifier │ (called by Main Claude)
              └─────┬─────┘
                    │
              execution-mode returned
                    │
      ├─ direct  (no build/test required)
      │   → specifier returns dispatch XML → Main Claude calls builder → [verifier+committer]
      │
      ├─ pipeline  (build/test required, single domain, sequential)
      │   → Main Claude calls: [specifier+planner] → builder → [verifier+committer]
      │
      └─ full  (multi-domain / complex DAG / new module / 5+ tasks)
          → Main Claude calls: [specifier+planner] → scheduler → [builder → verifier+committer] × N

Sub-agent Spawn Count by Mode:

| Mode | Spawns | Agents | |------|--------|--------| | direct | 3 | specifier + builder + verifier+committer | | pipeline | 3 | specifier+planner + builder + verifier+committer | | full (N TASKs) | 2+2N | specifier+planner + scheduler + [builder + verifier+committer] × N |

Compared to the previous approach (separate spawns): 6 TASKs: 20 → 14 spawns (-30%).

All three modes output to works/WORK-NN/ and guarantee result.md + COMMITTER DONE callback.

WORK (Multi-Task, full mode)

A two-level hierarchy for complex features:

WORK (unit of work)       A single goal. The unit requested by the user.
└── TASK (unit of task)   An individual execution unit to achieve the WORK.
    └── result            Completion proof. Auto-generated after verification.

pipeline mode (Single Task, Delegated)

Subagent-delegated path for moderate single tasks. specifier+planner run in a single spawn, then builder, then verifier+committer.

Main Claude → [specifier+planner](opus) → builder(sonnet) → [verifier+committer](haiku)
              (each group called as a single spawn by Main Claude)

direct mode (Trivial)

Main Claude calls specifier, which determines direct mode and returns a dispatch XML. Main Claude then calls builder (implements the change) and verifier+committer (verifies and commits) as a single spawn.

Main Claude → specifier: Analyze → return dispatch XML → [back to Main Claude]
Main Claude → builder: Implement → Self-check → [back to Main Claude]
Main Claude → [verifier+committer]: Verify → Commit → result.md

Pipeline

WORK Pipeline (Complex)

Subagents cannot nest — Main Claude (CLI terminal) orchestrates every call.

                               Main Claude (orchestrator)
                    ┌─────────────┼──────────────────────┐
                    │             │                       │
  specifier        planner          scheduler         builder          verifier         committer
 ┌──────────┐    ┌─────────┐     ┌──────────┐     ┌──────────┐     ┌──────────┐     ┌──────────┐
 │Request   │────▶│Create   │────▶│Dependency│────▶│Code      │────▶│Build/Test│────▶│Result    │
 │Analysis  │     │WORK/TASK│     │DAG+Order │     │Implement │     │Verify    │     │→ git     │
 └──────────┘    └─────────┘     └──────────┘     └────┬─────┘     └────┬─────┘     └────┬─────┘
                                                        │                │                │
                                                        └── Retry on fail┘                │
                                                           (max 3 times)                  │
                                                                          Next TASK loop ◀┘

pipeline mode (Simple → Delegated)

  specifier+planner    builder       verifier+committer
 ┌──────────────┐     ┌──────────┐     ┌────────────────┐
 │PLAN          │────▶│Code      │────▶│Build/Test      │
 │+TASK         │     │Implement │     │Verify→ git     │
 └──────────────┘     └──────────┘     └────────────────┘
   (opus, 1 spawn)     (sonnet)         (haiku, 1 spawn)
              ← each group called by Main Claude →

direct mode (Trivial)

  specifier        builder                    verifier+committer
 ┌──────────┐     ┌──────────────────────┐     ┌──────────────┐
 │ Analyze  │────▶│ Implement → Self-chk │────▶│Verify→Commit │
 │ dispatch │     └──────────────────────┘     │→ result      │
 └──────────┘      (no build/test required)    └──────────────┘

Agents

Six agents work together in a clean, isolated pipeline:

| Agent | Role | Model | Permission | MCP | Spawn | |-------|------|-------|------------|-----|-------| | specifier | [] tag detection, execution-mode selection (direct/pipeline/full), PLAN creation, WORK-LIST management, returns dispatch XML for all modes | opus | read + dispatch | Serena (codebase exploration), sequential-thinking (complexity check) | combined with planner (pipeline/full) | | planner | Create WORK + decompose TASKs + generate PLAN.md (full mode) + pre-create progress templates | opus | read-only | Serena (codebase exploration), sequential-thinking (task decomposition) | combined with specifier | | scheduler | Manage DAG for a specific WORK + run pipeline with sliding window context | haiku | read + dispatch | — | standalone (full mode only) | | builder | Code implementation + progress.md checkpoint recording | sonnet | full access | Serena (symbol-level explore/edit) | standalone | | verifier | Progress gate (Status=COMPLETED) → build/lint/test verification (read-only) | haiku | read + execute | — | combined with committer | | committer | Gate check (progress.md) → write result.md → git commit → COMMITTER DONE callback | haiku | read + write + git | — | combined with verifier |

Support Files (included in Plugin)

In addition to the 6 pipeline agents, the plugin includes 6 support files that agents reference at startup. These are located in plugin/skills/sdd-pipeline/references/ (synced from develop/references/):

| File | Purpose | |------|---------| | agent-flow.md | Pipeline orchestration rules — how Main Claude calls each agent in sequence | | file-content-schema.md | Single source of truth for all file formats (PLAN.md, TASK.md, progress.md, result.md) | | shared-prompt-sections.md | Shared prompt sections with cache_control — reduces repeated token cost up to 90% | | context-policy.md | Sliding window context transfer rules between agents | | work-activity-log.md | Activity log format for builder stage tracking | | xml-schema.md | XML communication format for dispatch and task-result messages |

File Structure

works/
├── WORK-LIST.md                      ← Master list of all WORKs (managed by specifier)
├── WORK-01/                          ← "User Authentication"
│   ├── PLAN.md                       ← Plan + dependency graph
│   ├── PROGRESS.md                   ← Progress tracking (auto-updated)
│   ├── TASK-00.md                    ← Task specification
│   ├── TASK-00_progress.md           ← Real-time checkpoint (builder writes)
│   ├── TASK-00_result.md             ← Completion report (committer writes)
│   ├── TASK-01.md
│   └── ...
└── WORK-02/
    └── ...

File Naming Convention

| File | Naming Rule | |------|-------------| | Task spec | TASK-NN.md (no prefix) | | Progress checkpoint | TASK-NN_progress.md (underscore separator) | | Completion report | TASK-NN_result.md | | Plan | PLAN.md | | Work progress | PROGRESS.md |

WORK-LIST.md

The specifier maintains works/WORK-LIST.md as the master index:

| WORK ID | Title | Status | Created | |---------|-------|--------|---------| | WORK-01 | User Authentication | DONE | 2026-03-01 | 2026-03-01 | | WORK-02 | Payment Integration | IN_PROGRESS | 2026-03-05 | |

| Status | Meaning | |--------|---------| | IN_PROGRESS | WORK created, TASKs in progress | | DONE | All TASKs committed — set automatically by committer on last TASK | | COMPLETED | Archived to _COMPLETED/ — set during push procedure |

IN_PROGRESS: specifier checks this before creating new WORKs
DONE: committer automatically changes IN_PROGRESS → DONE when the last TASK completes
COMPLETED: during push, DONE WORKs are batch-processed — rows removed from WORK-LIST and folders moved to works/_COMPLETED/

git push Procedure

When you ask Claude to push ("push this", "git push"), Claude handles the full sequence automatically:

1. Agent sync — copy develop/ source to npm/ and plugin/
2. DONE WORK batch completion — remove DONE rows from WORK-LIST, move folders to _COMPLETED/
3. Check README.md — update if changes are missing
4. git push

DONE is set by committer when the last TASK completes. COMPLETED happens at push time when DONE WORKs are archived to _COMPLETED/.

Tips

Keep CLAUDE.md Up to Date

The language setting and project context live in CLAUDE.md. Agents read this on every invocation — keeping it accurate reduces back-and-forth.

Use `[]` Tags Consistently

Requests without [] tags are handled directly by Claude without routing. If you want guaranteed pipeline behavior, always use a tag.

Parallel TASKs

The planner creates dependency-aware TASK graphs. Independent TASKs (same blockedBy set) can be dispatched concurrently by the scheduler — mention it when approving:

> Approve. Run independent tasks in parallel.

Resume After Context Reset

If Claude loses context mid-pipeline, you can always resume:

> Resume WORK-02 from where it stopped

The scheduler reads PROGRESS.md to determine the last completed TASK and continues.

Example Session

User: [new-feature] Build a comment feature for the blog system.

Claude: [specifier → WORK path]
  Complexity: 4+ files, DB schema change, multiple modules
  → Creating new WORK

Claude: [planner]
  Project analysis
     Tech Stack: Next.js + Prisma + PostgreSQL
     Existing code: Post CRUD done, Auth done

  WORK-03: Comment Feature

  WORK-03: TASK-00: Comment model + migration          ← no dependencies
  WORK-03: TASK-01: Comment CRUD API                   ← TASK-00
  WORK-03: TASK-02: Nested comments (self-relation)    ← TASK-01
  WORK-03: TASK-03: Frontend comment component         ← TASK-00 (parallelizable)
  WORK-03: TASK-04: Integration + notifications        ← TASK-02, TASK-03

  Do you approve?

User: Approve. Run automatically.

Claude: [scheduler → auto mode]
  WORK-03: TASK-00 → builder → verifier ✅ → committer [a1b2c3d]
  WORK-03: TASK-01 → builder → verifier ✅ → committer [d4e5f6g]
  WORK-03: TASK-02 → builder → verifier ✅ → committer [h7i8j9k]
  WORK-03: TASK-03 → builder → verifier ✅ → committer [l0m1n2o]
  WORK-03: TASK-04 → builder → verifier ✅ → committer [p3q4r5s]

  🎉 WORK-03 completed! 5 tasks, 5 commits

Why This Approach?

Agent File Design

All agent files (agents/*.md) are written with a single principle: core content only, no decoration. Descriptions, emphasis markers, and redundant examples have been removed. The result is ~1,600 lines total across all agents — less than half the original size — while covering the same functional scope.

Agent prompts avoid pipe (|) commands in Bash sequences for cross-platform compatibility (Windows, Linux, macOS). Each command runs as a single call without shell pipelines.

Each agent file follows a consistent four-section structure:

## 1. 역할 (Role)
   Agent's purpose and responsibility declaration.
   Single paragraph stating what the agent is and what it owns.

## 2. 수행업무 (Responsibilities)
   Flat table of owned tasks.
   | 업무 (Task) | 설명 (Description) |

## 3. 업무수행단계 및 내용 (Execution Steps)
   Step-by-step procedure for each task listed in § 2.
   Always starts with a STARTUP block listing required files to read on boot.
   References file formats via file-content-schema.md (single source of truth).
   References inter-agent communication via xml-schema.md.

## 4. 제약사항 및 금지사항 (Constraints and Prohibitions)
   Immutable rules the agent must always follow.
   Written as a flat prohibition/constraint list.

file-content-schema.md is the single authoritative definition for all file formats (PLAN.md, TASK.md, progress.md, result.md). Agents reference it instead of embedding format specs inline — eliminating duplication across 6 agent files.

ref-cache: Reference File Caching

Each agent reads 4-5 shared reference files (shared-prompt-sections.md, file-content-schema.md, xml-schema.md, etc.) on startup — totaling ~26 file reads across a full pipeline. ref-cache eliminates this redundancy:

First agent reads reference files normally and returns <ref-cache> in its result XML
Main Claude copies ref-cache into the next agent's dispatch
Subsequent agents use cached content instead of reading files from disk

Phase 2 (selective delivery) further reduces token usage by passing only the sections each agent needs — not the full file contents. The section mapping per agent is defined in agent-flow.md.

Measured impact (direct mode, 3 agents):

File reads: 14 → 5 (-64%)
Token usage: ~85K → ~72K (-15%)

WORK ID Assignment Strategy

WORK IDs are assigned based on a filesystem-first approach:

Filesystem Source: The planner scans works/ directory to find existing WORK directories and determines the next WORK ID based on the latest directory found.
MEMORY.md NOT used: Project memory (MEMORY.md) is never referenced for WORK numbering. Only the filesystem is the authoritative source.
Consistency Check: The specifier validates WORK ID consistency by checking both the filesystem and WORK-LIST.md before dispatching to the planner.

This ensures:

No duplicate WORK IDs even if MEMORY.md is stale or corrupted
Reliable resumption across sessions
Clear traceability: WORK-NN directly corresponds to works/WORK-NN/

Context Isolation

Each subagent runs in an independent context. Even if the builder creates 50 files using 20,000 tokens, the scheduler only receives a 3-line summary.

scheduler's context after 5 TASKs:

  PLAN.md (loaded once)                              ~500 tokens
  WORK-01: TASK-00 result: "20 files, PASS"           ~200 tokens
  WORK-01: TASK-01 result: "15 files, PASS"           ~200 tokens
  WORK-01: TASK-02 result: "8 files, PASS"            ~200 tokens
  WORK-01: TASK-03 result: "12 files, PASS"           ~200 tokens
  WORK-01: TASK-04 result: "5 files, PASS"            ~200 tokens
  ────────────────────────────────────────
  Total: ~1,500 tokens (stays flat)

Single Session vs uc-taskmanager

| | Single Session | uc-taskmanager | |---|---|---| | Context per TASK | All code + logs stacked | Summary only (~200 tokens) | | After 10 TASKs | 50K~100K tokens, quality degrades | ~3K tokens, quality stable | | Failure recovery | Start over | Resume from last result file | | Tracking | Scroll chat history | File-based (PLAN.md, result.md) | | Verification | Manual | Automated (build/lint/test) |

Router Rule Config (`.agent/router_rule_config.json`)

The specifier reads .agent/router_rule_config.json from the project root to determine routing criteria. If the file is absent, the specifier uses its built-in defaults.

File location:

{project-root}/.agent/router_rule_config.json

JSON structure:

{
  "$schema": "http://uc-taskmanager.local/schemas/specifier-rules/v1.0.json",
  "version": "1.1.0",
  "description": "Specifier execution-mode decision criteria. Customize per project.",
  "decision_flow": [
    "1. build_test_required? → false → direct",
    "2. single_domain + sequential DAG → pipeline",
    "3. any full_conditions met → full"
  ],
  "rules": {
    "direct": {
      "criteria": {
        "build_test_required": false,
        "note": "File/line count irrelevant. If no build/test needed → direct (text edits, config changes, simple substitutions)"
      }
    },
    "pipeline": {
      "criteria": {
        "build_test_required": true,
        "single_domain_only": true,
        "max_tasks": 5,
        "dag_complexity": "sequential"
      }
    },
    "full": {
      "criteria": {
        "any_of": [
          "task_count > 5",
          "dag_complexity == complex (2+ dependency levels)",
          "multi_domain == true (BE + FE simultaneously)",
          "new_module == true (design → implement → verify multi-phase)",
          "partial_rollback_needed == true"
        ]
      }
    }
  },
  "customization_guide": {
    "doc-heavy projects (md edits)": "Widen direct scope. Most build_test_required=false cases → direct",
    "code-heavy projects": "Center on pipeline/full. Simple bug fixes → pipeline, multi-domain → full",
    "max_tasks tuning": "Adjust pipeline max_tasks between 3–7 based on team size or context limits"
  }
}

Key fields: | Field | Description | |-------|-------------| | rules.direct.criteria.build_test_required | false → specifier handles implementation, then committer commits | | rules.pipeline.criteria.max_tasks | Max task count before escalating to full (default: 5) | | rules.pipeline.criteria.dag_complexity | sequential only; complex DAG → escalates to full | | rules.full.criteria.any_of | List of conditions — any match triggers full mode |

Fallback behavior: If .agent/router_rule_config.json is absent or malformed, the specifier falls back to its built-in defaults (equivalent to the structure above).

Per-project customization example:

For a documentation-heavy project where most changes are text edits:

{
  "rules": {
    "direct": {
      "criteria": { "build_test_required": false }
    },
    "pipeline": {
      "criteria": { "max_tasks": 3, "single_domain_only": true, "dag_complexity": "sequential" }
    }
  }
}

For a monorepo with strict build requirements:

{
  "rules": {
    "pipeline": {
      "criteria": { "max_tasks": 7 }
    },
    "full": {
      "criteria": {
        "any_of": ["task_count > 7", "multi_domain == true"]
      }
    }
  }
}

Three Execution Modes

The specifier matches effort to complexity via execution-mode:

direct: 1-line typo fix — Main Claude calls specifier, which returns a dispatch XML. Main Claude then calls builder (implements) + [verifier+committer] (verifies and commits). 3 spawns total.
pipeline: Moderate fix — Main Claude calls [specifier+planner] → builder → [verifier+committer] in sequence. 3 spawns total.
full: Complex features — [specifier+planner] → scheduler → [builder + verifier+committer] × N. 2+2N spawns total.

Agent pairs are combined into a single spawn to eliminate redundant context loading. This reduces total spawns by ~30% across all modes.

All three modes output to works/WORK-NN/ with identical artifact structure (PLAN.md + result.md + COMMITTER DONE callback), ensuring Runner integration works regardless of mode.

Structured Agent Communication

Instead of ambiguous natural language prompts, agents communicate using structured XML format:

Dispatch Format (Caller → Receiver):

<dispatch to="builder" work="WORK-03" task="TASK-00" execution-mode="pipeline">
  <context>
    <project>uc-taskmanager</project>
    <language>ko</language>
    <plan-file>works/WORK-03/PLAN.md</plan-file>
  </context>
  <task-spec>
    <file>works/WORK-03/TASK-00.md</file>
    <title>공통 시스템 프롬프트 섹션 식별 및 XML 스키마 설계</title>
    <action>implement</action>
  </task-spec>
  <cache-hint sections="output-language-rule,build-commands"/>
</dispatch>

Result Format (Receiver → Caller):

<task-result work="WORK-03" task="TASK-00" agent="builder" status="PASS">
  <summary>Created shared-prompt-sections.md and xml-schema.md</summary>
  <files-changed>
    <file action="created" path="agents/shared-prompt-sections.md">Common sections with cache_control</file>
  </files-changed>
  <verification>
    <check name="file_existence" status="PASS">Both files created</check>
  </verification>
</task-result>

Benefits:

Clarity: Explicit XML structure eliminates ambiguous natural language ("Pass the context" ← confusing vs. <context> ← explicit)
Lower Output Tokens: Agents don't generate clarification questions; receivers parse XML directly
Prompt Caching: Common sections (Output Language Rule, Build Commands) are marked with Anthropic API cache_control, saving up to 90% on repeated tokens
Scalability: Cache hit rates improve with WORK count (5 TASKs at ~0.03 tokens/token vs 2K+ tokens without cache)

See agents/xml-schema.md for complete format, and agents/shared-prompt-sections.md for cacheable sections.

Sliding Window Context Transfer

Each subagent starts with an empty context — the cost of isolation. The sliding window system minimizes token waste when passing context between agents and across dependent TASKs.

Rule: the further back, the less detail:

| Distance | Detail Level | Content | |----------|-------------|---------| | Immediate predecessor | FULL | what + why + caution + incomplete | | 2 steps back | SUMMARY | what only (1–3 lines) | | 3+ steps back | DROP | not transmitted |

Each agent outputs a context-handoff — a structured reasoning document, not just a result log:

<context-handoff from="builder" detail-level="FULL">
  <what>auth.ts modified — added JWT silent refresh logic</what>
  <why>Previous code returned 401 immediately on expiry. Silent refresh improves UX.</why>
  <caution>Coupled to session.ts setSession(). Changes there may cause side effects.</caution>
  <incomplete>Unit tests not written. Verifier should check.</incomplete>
</context-handoff>

Result responsibility shift: builder focuses on implementation only, writing a progress.md checkpoint. The committer synthesizes builder + verifier context-handoffs into the final result.md. This prevents result files from being skipped when builder is context-pressured.

Estimated token savings: ~48% on a 3-TASK dependency chain vs. the naive approach of passing full results forward.

See docs/spec_sliding-window-context.md for full design details.

External System Callback (Optional)

uc-taskmanager is generic by default. To integrate with an external system (e.g., a CI/CD backend), add callback URLs to CLAUDE.md:

## Task Callbacks
TaskCallback: http://localhost:3000/api/v1/runner/{{executionId}}/task-result
ProgressCallback: http://localhost:3000/api/v1/runner/{{executionId}}/task-progress
CallbackToken: <your-token>

No config → works as-is, no external calls made
TaskCallback → committer POSTs result after each TASK commit
ProgressCallback → builder POSTs checkpoint after each progress.md update
Callback failures are non-fatal — a warning is printed and the pipeline continues

See docs/spec_callback-integration.md for payload schema and implementation guide.

Output Language

Output language is resolved from CLAUDE.md in your project. No manual configuration needed after first setup.

1. Check CLAUDE.md for "Language: xx"
   ├─ Found → use that language
   └─ Not found ↓

2. Ask: "Would you like to set the output language? (e.g., ko, en, ja)"
   ├─ User specifies → write to CLAUDE.md + use it
   └─ User declines ↓

3. Auto-detect system locale → write to CLAUDE.md as default

Once set, stored in CLAUDE.md and never asked again. Priority: PLAN.md > CLAUDE.md > en

By default, all output including git commit messages and code comments uses the configured language:

| Item | Default | Override | |------|---------|----------| | PLAN.md / TASK descriptions | Language | — | | Result reports | Language | — | | Git commit messages (title/body) | Language | CommitLanguage: en | | Code comments | Language | CommentLanguage: en | | Commit type prefix (feat, fix...) | Always English | — | | File names, paths, commands | Always English | — |

Per-Category Override

Add to CLAUDE.md to override specific categories:

## Language
ko
CommitLanguage: en
CommentLanguage: en

This gives you ko for plans/reports but en for commits and code comments — useful for open-source projects or global teams.

Customization

Place a file with the same name in .claude/agents/ to override.

| What | File | Section | |------|------|---------| | Routing criteria | specifier.md | 3-2. Execution-Mode 결정 | | Approval policy | scheduler.md | Phase 1: User Approval | | Commit message format | committer.md | Step 3: Stage + Commit | | Verification steps | verifier.md | Verification Pipeline | | Task granularity | planner.md | Task Decomposition Rules | | Build/lint commands | builder.md + verifier.md | Self-Check / Step 1-2 | | Output language | planner.md | Output Language Rule |

Supported Stacks

Auto-detected from project files. No configuration needed.

| File | Stack | |------|-------| | package.json | Node.js / TypeScript / React / NestJS / Next.js | | pyproject.toml / setup.py | Python / FastAPI / Django | | Cargo.toml | Rust | | go.mod | Go | | build.gradle / pom.xml | Java / Kotlin | | Gemfile | Ruby | | Makefile | Generic |

Repository Structure

uc-taskmanager/
├── develop/                 ← Source of truth (edit here)
│   ├── agents/              ← 6 agent prompts (language-agnostic)
│   │   ├── specifier.md     ← [] tag detection + execution-mode routing
│   │   ├── planner.md       ← WORK creation + TASK decomposition
│   │   ├── scheduler.md     ← DAG management + pipeline orchestration
│   │   ├── builder.md       ← Code implementation
│   │   ├── verifier.md      ← Build/lint/test verification
│   │   └── committer.md     ← git commit + result.md
│   ├── references/          ← 6 support files (shared across agents)
│   │   ├── agent-flow.md    ← Pipeline orchestration rules
│   │   ├── file-content-schema.md  ← File format definitions
│   │   ├── shared-prompt-sections.md  ← Cacheable shared sections
│   │   ├── context-policy.md    ← Sliding window context rules
│   │   ├── work-activity-log.md ← Activity log format
│   │   └── xml-schema.md    ← XML communication format
│   ├── skills/              ← Skill definitions
│   └── hooks/               ← Hook scripts
├── npm/                     ← npm package (published as `uctm`)
│   ├── agents/              ← Synced from develop/agents/ + develop/references/
│   ├── bin/cli.mjs          ← CLI entry point (uctm init/update)
│   ├── lib/                 ← CLI implementation (constants.mjs, init.mjs, update.mjs)
│   ├── .agent/              ← Default router config bundled with npm
│   │   └── router_rule_config.json
│   ├── package.json         ← npm package config
│   ├── .npmignore
│   └── LICENSE
├── plugin/                  ← Claude Plugin (Marketplace)
│   ├── agents/              ← Synced from develop/agents/
│   ├── references/          ← Synced from develop/references/
│   ├── skills/              ← Plugin skills
│   │   ├── sdd-pipeline/
│   │   │   ├── SKILL.md     ← Skill manifest
│   │   │   └── references/  ← Synced from develop/references/
│   │   ├── work-pipeline/
│   │   │   └── SKILL.md
│   │   ├── work-status/
│   │   │   └── SKILL.md
│   │   └── init/
│   │       └── SKILL.md     ← /uctm-init (setup works/, CLAUDE.md, permissions)
│   ├── .claude-plugin/
│   │   └── plugin.json      ← Plugin manifest (name, version, agents array)
│   └── README.md
├── .claude/                 ← Local Claude settings (not committed)
│   └── settings.local.json
├── README.md                ← English (default, this file)
├── README_KO.md             ← Korean
├── CLAUDE.md                ← Project-level Claude instructions (push procedure, language, agent call rules)
├── LICENSE
├── docs/                    ← Design specifications
│   ├── spec_pipeline-architecture_v1.3.md  ← Pipeline architecture v1.3 (ref-cache, Specifier-based)
│   ├── spec_sliding-window-context.md      ← Sliding window context design
│   ├── spec_callback-integration.md        ← External system callback integration
│   ├── spec_SDD_with_ucagent_requirement.md ← SDD v1.5 requirement management system design
│   ├── pipeline-architecture-v1.3-visual.html ← Interactive pipeline visualization (with ref-cache tab)
│   ├── SDD-requirement-visual.html         ← Interactive SDD visualization (with ref-cache tab)
│   ├── callback-integration-visual.html    ← Interactive callback visualization
│   ├── sliding-window-context-visual.html  ← Interactive sliding window visualization
│   └── _archive/                           ← Legacy docs (Router-based)
└── works/                   ← WORK directories (auto-generated)
    ├── WORK-LIST.md          ← Master index
    ├── WORK-01/              ← all modes output here (direct/pipeline/full)
    └── ...

Requirements

Claude Code CLI
Git initialized (git init)
No other dependencies.

Optional: MCP Configuration

Serena MCP — Symbol-Level Code Navigation

Special thanks to the Oraios team for building and open-sourcing Serena. Their symbol-level code navigation tools make a real difference in reducing token usage and improving edit precision for AI agents.

The builder agent integrates with Serena MCP for symbol-level code exploration. When Serena is available, builder follows this exploration hierarchy instead of reading entire files:

| Step | Tool | Purpose | |------|------|---------| | 1 | list_dir | Directory structure (replaces find) | | 2 | get_symbols_overview | File symbol map before any file read | | 3 | find_symbol(depth=1) | Class/module method list | | 4 | find_symbol(include_body=true) | Precise body read for target symbol only | | 5 | find_referencing_symbols | Impact analysis before editing | | 6 | Read | Last resort when above tools are insufficient |

This reduces read tokens by 30–50% on large codebases by reading only the symbols needed, not entire files.

Disable Auto Browser Launch

Serena opens a web dashboard in your browser on every startup. To disable this, add --open-web-dashboard False to your ~/.claude.json:

{
  "mcpServers": {
    "serena": {
      "command": "uvx",
      "args": [
        "--from", "git+https://github.com/oraios/serena",
        "serena", "start-mcp-server",
        "--context", "ide-assistant",
        "--project", ".",
        "--open-web-dashboard", "False"
      ]
    }
  }
}

The dashboard is still available at http://localhost:PORT — it just won't auto-open on startup.

The Bigger Picture

This agent is designed to work with an SDD-based requirement management and automated development system — a server application that links requirement management → automated development → plans and artifacts. The full system architecture is documented in docs/spec_SDD_with_ucagent_requirement.md. Use it as a reference to build your own system tailored to your needs.

License

GPL-3.0

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

uc-taskmanager

Quick Start

npm CLI

Start Using

What Makes This Pipeline Different

1. Procedures Must Be Executed Properly and Recorded

2. Token Economy

3. Dependency-Aware Parallel Execution

What's Next

Usage

Trivial Fix (direct mode)

Quick Task (pipeline mode)

Complex Feature (WORK)

1. Create WORK (Planning)

2. Execute WORK

3. Add to Existing WORK

4. Check Status

5. Auto Mode / Resume

6. Run a Specific TASK

7. Force WORK Creation (Skip Complexity Check)

8. Handle Failure / Retry

9. Add a TASK to an In-Progress WORK

10. Check Individual TASK Status

The [] Tag System

Installation

npm CLI

Manual

Local Plugin Test

Verify

Concept: Three Execution Modes

WORK (Multi-Task, full mode)

pipeline mode (Single Task, Delegated)

direct mode (Trivial)

Pipeline

WORK Pipeline (Complex)

pipeline mode (Simple → Delegated)

direct mode (Trivial)

Agents

Support Files (included in Plugin)

File Structure

File Naming Convention

WORK-LIST.md

git push Procedure

Tips

Keep CLAUDE.md Up to Date

Use [] Tags Consistently

Parallel TASKs

Resume After Context Reset

Example Session

Why This Approach?

Agent File Design

ref-cache: Reference File Caching

WORK ID Assignment Strategy

Context Isolation

Single Session vs uc-taskmanager

Router Rule Config (.agent/router_rule_config.json)

Three Execution Modes

Structured Agent Communication

Sliding Window Context Transfer

External System Callback (Optional)

Output Language

Per-Category Override

Customization

Supported Stacks

Repository Structure

Requirements

Optional: MCP Configuration

Serena MCP — Symbol-Level Code Navigation

Disable Auto Browser Launch

The Bigger Picture

License

The `[]` Tag System

Use `[]` Tags Consistently

Router Rule Config (`.agent/router_rule_config.json`)