claude-virtual-company

v1.4.0

Published

4 months ago

A skill framework for AI coding assistants (Claude Code & Gemini CLI) that simulates a virtual software development company with hierarchical roles, governance, and quality gates

Claude Virtual Company

A skill framework for AI coding assistants (Claude Code & Gemini CLI) that simulates a hierarchical software development company. You act as the CEO, delegating work through a structured engineering organization with proper governance, quality gates, and dynamic specialist hiring.

Supported Providers

| Provider | Status | Task Management | Parallel Execution | |----------|--------|-----------------|-------------------| | Claude Code | ✅ Full Support | Native | ✅ Native | | Gemini CLI | ✅ Full Support | MCP Server | Sequential |

Both providers share the same .company/ state directory, so you can switch between them seamlessly.

Features

Multi-Provider Support: Works with both Claude Code and Gemini CLI
Hierarchical Role System: CTO, Architect, Tech Lead, Senior Dev, Developer, QA
Dynamic Specialist Hiring: Automatically creates specialists based on project needs
Proposal-Based Governance: Cross-role actions require approval
Quality Gates: Mandatory testing, code review, and acceptance criteria
Design Pattern Enforcement: Architect selects patterns, roles follow consistently
Git Flow Integration: Built-in branching strategy and PR workflows
Task Dependency Tracking: Manage complex work with dependencies (MCP server for Gemini)
Fresh Context Windows: Each role operates in isolation with explicit handoffs
GSD-Inspired Project Management: Phase-based workflow with discuss→plan→execute→verify cycles
State Persistence: Pause and resume work across sessions with full context
Automatic Context Management: Tiered document loading, context decay, and archival to prevent bloat
Shared State: Switch between providers seamlessly with persistent workflow state
Interactive Playgrounds: Optional HTML-based visual decision-making for discussions, reviews, and verification

Quick Start

Installation

# Install for both Claude Code and Gemini CLI (default)
npx claude-virtual-company init

# Install for Claude Code only
npx claude-virtual-company init --provider claude

# Install for Gemini CLI only
npx claude-virtual-company init --provider gemini

# Install globally
npx claude-virtual-company init --global

Start a Project

Claude Code:

claude
/company "Build a user authentication system with email/password login"

Gemini CLI:

gemini
/company "Build a user authentication system with email/password login"

The orchestrator will:
- Evaluate expertise needs and hire specialists
- Create a feature branch
- Guide work through the hierarchy
- Ensure quality at each phase

Check Status

/company-status

Merge When Complete

/company-merge

How It Works

The Hierarchy

┌─────────────────────────────────────────────────────────────────┐
│                         CEO (You)                                │
│                    Provides Vision/Goal                          │
└─────────────────────────────────────────────────────────────────┘
                              │
                              ▼
┌─────────────────────────────────────────────────────────────────┐
│                    /company Orchestrator                         │
│              (Manages workflow, tracks state)                    │
└─────────────────────────────────────────────────────────────────┘
        │           │           │           │           │
        ▼           ▼           ▼           ▼           ▼
    ┌───────┐   ┌───────┐   ┌───────┐   ┌───────┐   ┌───────┐
    │  CTO  │──▶│ Arch  │──▶│ Lead  │──▶│ Dev   │──▶│  QA   │
    └───────┘   └───────┘   └───────┘   └───────┘   └───────┘
        │           │           │           │           │
        ▼           ▼           ▼           ▼           ▼
    [Strategy] [Design]    [Plan]     [Code]    [Verified]

Workflow Phases

Expertise Assessment: Hiring manager evaluates what specialists are needed
Architecture (CTO): Technical strategy and technology decisions
Design (Architect): Component design, API contracts, data models
Planning (Tech Lead): Feature breakdown, task creation, dependency mapping
Implementation (Developer): Code implementation with tests
Review (Code Reviewer): Quality, security, and standards check
Verification (QA): Comprehensive testing and validation
Merge: PR creation and merge to main

Quality Gates

Each phase transition requires:

Completed artifacts in .company/artifacts/[role]/
Handoff document with acceptance criteria
Passing verification commands
No blocking issues

Project Manager Workflow (GSD-Inspired)

For larger projects, use the full PM workflow:

/company-new-project "Build a task management app"

This initiates a structured cycle:

Discuss - Capture implementation preferences and resolve gray areas
Plan - Create atomic tasks (max 2-3 per plan) with XML format
Execute - Parallel wave execution with atomic commits
Verify - Automated checks + User Acceptance Testing

Each phase produces artifacts in .planning/phase-{N}/:

CONTEXT.md - Decisions from discuss phase
{N}-PLAN.md - Executable task plans
{N}-SUMMARY.md - Completion records
VERIFICATION.md - Test results
UAT.md - User acceptance confirmation

Use /company-progress to see current state and recommended next action.

Use /company-quick "task" for ad-hoc work without full ceremony.

Commands

Core Commands

| Command | Description | |---------|-------------| | /company [goal] | Start a new project | | /company-status | Check workflow state | | /company-reply [message] | Route feedback/questions through the framework | | /company-settings [path] [value] | View/modify configuration | | /company-merge [branch] | Merge to main with validation | | /company-roster | View specialists | | /company-hire [domain] | Request new specialist | | /company-propose [type] | Submit a proposal |

Project Manager Commands (GSD-Inspired)

| Command | Description | |---------|-------------| | /company-new-project | Start new project with roadmap | | /company-progress | Check progress, route to next action | | /company-discuss [N] | Capture phase requirements | | /company-plan-phase [N] | Create executable plans | | /company-execute [N] | Execute plans with parallel waves | | /company-verify [N] | Verify phase completion + UAT | | /company-quick [task] | Quick mode for ad-hoc tasks | | /company-pause | Create context handoff | | /company-resume | Resume from previous session | | /company-milestone | Complete and archive milestone |

Configuration

Configuration is stored in .company/config.json. Key settings:

Model Settings

Configure which Claude model to use for each role. More capable models (opus) are used for strategic roles, while faster models (sonnet/haiku) handle implementation tasks.

{
  "company": {
    "models": {
      "cto": "opus",
      "architect": "opus",
      "tech-lead": "opus",
      "developer": "sonnet",
      "senior-dev": "sonnet",
      "code-reviewer": "sonnet",
      "qa": "opus",
      "hiring-manager": "haiku"
    }
  }
}

Available models: opus, sonnet, haiku

Modify model for a role:

/company-settings company.models.developer haiku

Quality Settings

{
  "quality": {
    "test_coverage_minimum": 80,
    "require_tests": {
      "unit": "required",
      "integration": "required",
      "e2e": "required_for_user_flows",
      "ui": "required_for_frontend"
    },
    "require_code_review": true
  }
}

Design Patterns

The framework encourages consistent, maintainable code through design pattern enforcement:

Role Responsibilities:

Architect selects patterns and documents in component-design.md
Tech Lead references patterns in feature specs
Developer implements following specified patterns

Common Patterns Used:

| Pattern | Purpose | |---------|---------| | Repository | Abstract data access | | Service Layer | Encapsulate business logic | | Controller + DTO | Clean HTTP handling | | Middleware | Cross-cutting concerns | | Factory | Complex object creation |

File Organization (typical):

src/
├── controllers/    # HTTP handlers
├── services/       # Business logic
├── repositories/   # Data access
├── models/         # Domain entities
└── middleware/     # Auth, logging, etc.

This prevents "god files" and spaghetti code by enforcing separation of concerns.

Git Flow Settings

{
  "git_flow": {
    "strategy": "gitflow",
    "require_pr": true,
    "squash_on_merge": true
  }
}

Hiring Settings

{
  "hiring": {
    "auto_hire": true,
    "require_ceo_approval_for_new_roles": false,
    "expertise_evaluation": {
      "on_project_init": true,
      "on_escalation": true,
      "self_evaluation_enabled": true
    }
  }
}

Modify settings:

/company-settings quality.test_coverage_minimum 90

Specialists

Default Specialists

Git Flow: Branching strategy, commit conventions, PR workflows
Code Reviewer: Code quality, security, best practices
Test Architect: Testing strategy, unit/integration/E2E

Available Domains

The hiring manager can create specialists for:

Frontend: React, Vue, Angular, Svelte, CSS, Accessibility
Backend: Node.js, Python, Go, Rust, Java, .NET
Database: PostgreSQL, MongoDB, Redis
Infrastructure: Docker, Kubernetes, AWS, GCP
Testing: Unit, Integration, E2E, Visual
Security: Application security, Authentication

Manual Hiring

/company-hire frontend-react

Governance

Proposal System

Cross-role actions require proposals:

| Action | Auto-Approve | Needs Review | Needs CEO | |--------|--------------|--------------|-----------| | Create own subtask | ✅ | | | | Developer → QA task | ✅ | | | | Cross-role task | | ✅ | | | Reject handoff | | ✅ | | | Scope change | | | ✅ | | Block release | | | ✅ |

Escalation

Issues are escalated based on severity:

Low: Orchestrator resolves
Medium: Senior role consulted
High: CEO notified
Blocking: Immediate CEO decision

Project Structure

After installation:

# Claude Code skills
.claude/
└── skills/
    ├── company/              # Main orchestrator
    ├── company-protocols/    # Shared standards
    ├── company-git-flow/     # Git expertise
    ├── company-[role]/       # Role skills
    └── company-specialists/  # Dynamic specialists

# Gemini CLI configuration
.gemini/
├── context/                 # Role context files (transpiled from SKILL.md)
│   ├── company.md
│   ├── company-cto.md
│   └── ...
├── commands/company/        # TOML command definitions
│   ├── company.toml
│   └── ...
└── settings.json            # MCP server configuration

GEMINI.md                    # Project context for Gemini (root level)

# Shared state (used by both providers)
.company/
├── config.json              # Configuration
├── roster.json              # Specialists roster
├── state.json               # Workflow state
├── tasks/                   # Task storage (MCP server)
│   ├── index.json
│   └── task-*.json
├── proposals/               # Pending/approved/rejected
├── artifacts/               # Role outputs
│   ├── playground/          # Interactive HTML playgrounds
│   └── ...                  # Per-role artifacts
└── inboxes/                 # Role communication

.planning/                   # Project Manager (GSD-inspired)
├── config.json              # PM configuration
├── PROJECT.md               # Vision and objectives
├── REQUIREMENTS.md          # Scoped requirements
├── ROADMAP.md               # Phase breakdown
├── STATE.md                 # Session state and decisions
├── phase-{N}/               # Phase artifacts
│   ├── CONTEXT.md           # Phase decisions
│   ├── {N}-PLAN.md          # Executable plans
│   ├── {N}-SUMMARY.md       # Completion summaries
│   └── VERIFICATION.md      # Verification results
└── quick/                   # Ad-hoc task tracking

Provider Differences

Claude Code vs Gemini CLI

| Feature | Claude Code | Gemini CLI | |---------|-------------|------------| | Context isolation | Native (context: fork) | Sequential with file handoffs | | Parallel execution | Native (background tasks) | Sequential only | | Task management | Native tools | MCP server | | Tool restrictions | Enforced (allowed-tools) | Trust-based guidance | | Per-role model selection | ✅ Supported | ❌ Not supported | | Hooks | Native support | Not supported | | Dynamic context | Backtick syntax | Pre-loaded context files |

Note: The model configuration in .company/config.json (opus/sonnet/haiku per role) only applies to Claude Code. Gemini CLI uses your globally configured model for all roles.

MCP Task Server

For Gemini CLI, task management is provided via an MCP (Model Context Protocol) server that exposes these tools:

cvc_task_create - Create a new task
cvc_task_list - List all tasks
cvc_task_get - Get task details
cvc_task_update - Update task status

Tasks are stored in .company/tasks/ and work with both providers.

Switching Providers

Both providers share the same .company/ state directory. You can:

Start work in Claude Code
Continue in Gemini CLI
Switch back anytime

The workflow state, artifacts, and tasks persist across provider switches.

For detailed information, see:

docs/PROVIDER-COMPARISON.md - Feature comparison matrix
docs/GEMINI-SETUP.md - Gemini CLI setup guide

Best Practices

Staying in the Framework

When you need to respond to agent output (report bugs, ask questions, give approvals), use /company-reply instead of responding directly. This keeps your communication routed through the proper channels:

/company-reply "The login button doesn't work on mobile"
/company-reply "Why did we choose PostgreSQL over MongoDB?"
/company-reply "Looks good, proceed with implementation"

The command automatically:

Classifies your message (bug, question, approval, feature request, blocker)
Routes to the appropriate role (Developer, Architect, QA, etc.)
Maintains context from state files and artifacts
Creates an audit trail of interactions

For Best Results

Clear Goals: Provide specific, well-defined project goals
Let It Work: Allow the workflow to progress through phases
Review Escalations: Respond to CEO-level decisions promptly
Check Status: Use /company-status to monitor progress
Use /company-reply: Keep feedback within the framework

Customization

Adjust Quality: Set appropriate coverage and test requirements
Configure Git: Match your team's branching strategy
Manage Specialists: Add domain-specific expertise as needed

Context Management

The framework includes automatic context management to keep Claude's context fresh and prevent bloat during long projects.

Tiered Document Loading

Handoffs and artifacts use tier markers for progressive loading:

<!-- TIER:SUMMARY -->
TL;DR in ~50 words - always loaded
<!-- /TIER:SUMMARY -->

<!-- TIER:DECISIONS -->
Acceptance criteria, verification commands, key constraints - loaded by default
<!-- /TIER:DECISIONS -->

<!-- TIER:FULL -->
Full rationale, alternatives considered - loaded only when blocked
<!-- /TIER:FULL -->

Each role skill automatically loads the appropriate tier (usually SUMMARY + DECISIONS) from upstream artifacts. If you need full context while working, run:

cat .company/artifacts/[role]/[file].md

Automatic Context Decay

Session Log Trimming: When STATE.md exceeds 500 lines, old session entries are archived to .planning/archive/sessions/ and only the 10 most recent entries are kept
Milestone Archival: When completing a milestone, all phase directories move to .planning/archive/v{version}/
Quick Task Cleanup: Quick tasks older than 7 days are automatically archived
Proposal Archival: Approved/rejected proposals older than 30 days are archived

Configuration

Context management settings in templates/pm-config.json:

{
  "context_management": {
    "session_log_max_entries": 25,
    "summarize_after_entries": 20,
    "archive_completed_phases": true,
    "handoff_max_lines": 100,
    "default_tier": "decisions",
    "quick_task_retention_days": 7
  }
}

Platform Utilities

For programmatic context management, src/platform.js provides:

const { readTier, trimSessionLog, archiveAndResetState } = require('./src/platform');

// Read specific tier from a tiered document
const decisions = readTier('.company/artifacts/architect/handoff.md', 'decisions');

// Trim session log keeping only recent entries
trimSessionLog('.planning/STATE.md', 10);

// Archive STATE.md and create fresh one for new milestone
archiveAndResetState('.planning/STATE.md', '.planning/archive/v1.0/');

Mermaid Diagrams

Architect and Tech Lead roles use Mermaid diagrams to convey relationships efficiently. Claude reads the Mermaid source as structured text, making it effective for communicating:

| Diagram Type | Use Case | |--------------|----------| | graph TD/LR | Component relationships, service boundaries | | sequenceDiagram | API flows, request/response patterns | | erDiagram | Data model relationships | | Dependency graphs | Task waves, parallel execution |

Example component diagram:

graph TD
    API[API Gateway] --> Auth[AuthService]
    API --> User[UserService]
    Auth --> DB[(PostgreSQL)]
    Auth --> Cache[(Redis)]

Guidelines:

Keep diagrams small (5-10 nodes max)
Place in DECISIONS tier for implementation context
Use consistent naming across documents

Writing Tiered Documents

When creating handoffs or key artifacts, structure them with tiers:

SUMMARY tier: One-liner decisions, tech choices, key constraints
DECISIONS tier: Acceptance criteria, verification commands, Mermaid diagrams
FULL tier: Rationale, alternatives considered, detailed context

This ensures downstream roles get exactly the context they need without loading historical rationale they don't.

Interactive Playgrounds (Optional)

The framework supports interactive HTML playgrounds for visual decision-making during discussions, code reviews, and verification. Playgrounds open in your browser as self-contained HTML files with no external dependencies.

What Playgrounds Provide

| Playground Type | Used By | Purpose | |----------------|---------|---------| | Design Playground | Discussion, UI Designer | Configure layout, colors, spacing visually | | Concept Map | Discussion (Architecture) | Visualize service topology and connections | | Document Critique | Verification | Review findings interactively (approve/reject/discuss) | | Diff Review | Code Reviewer | Respond to review comments with visual diff context | | Architecture Map | Architect | Adjust component topology and patterns |

How It Works

Skills generate self-contained HTML files in .company/artifacts/playground/
Files open automatically in your default browser
You interact with controls, configure preferences, review findings
Click "Copy Prompt" to copy structured decisions
Paste back into the conversation

Session Preference

The first time a playground is available, you'll be asked to opt in:

"Would you like to use interactive HTML playgrounds for visual decision-making?"

Your preference is stored in .company/state.json under playground_preference
Say "enable playgrounds" or "disable playgrounds" anytime to change
All skills fall back gracefully to AskUserQuestion when playgrounds are disabled

Skipping Playgrounds

Playgrounds are automatically skipped when:

Fewer than 3 gray areas in discussions
0 findings in verification
Diff under 20 lines in code review
Running on Gemini CLI (no browser-open capability)

Installation

During cvc init, you'll be prompted to install playground support. Or skip:

npx claude-virtual-company init --no-playground

Enhanced Context Memory (Optional)

For projects with extended testing/debugging cycles, the framework supports claude-mem for persistent cross-session memory.

What claude-mem Provides

| Feature | Benefit | |---------|---------| | Automatic capture | All tool observations saved automatically | | Semantic search | Query "what issues did we have with X?" | | Cross-session memory | Testing feedback persists across sessions | | Pattern detection | Identify recurring issues |

Why Use It

The framework's handoff system works well for planned context transitions between roles. However, emergent context during testing and debugging often falls through the cracks:

Bug reports you mention while testing
UI feedback like "these elements should align"
Patterns you notice but don't formally document

Claude-mem captures these observations automatically and makes them searchable.

Installation

During npx claude-virtual-company init, you'll be prompted to install claude-mem. You can also install it manually:

# In Claude Code
claude
/install-plugin thedotmack/claude-mem

After installation, restart Claude Code. The worker service starts automatically.

Dashboard (optional): While Claude Code is running, visit http://localhost:37777 to view the memory dashboard. The dashboard is only available during active Claude Code sessions.

Or skip the prompt during installation:

npx claude-virtual-company init --no-claude-mem

How the Framework Uses It

When claude-mem is installed, these commands are enhanced:

| Command | Enhancement | |---------|-------------| | /company-progress | Searches for recent issues, testing feedback | | /company-resume | Recovers context not in formal handoffs | | /company-reply | Checks for similar past issues, detects patterns |

Graceful Fallback: All commands work fully without claude-mem. It's purely an enhancement for projects where testing feedback gets lost.

Claude-Mem MCP Tools

The framework uses these MCP tools when available:

search - Semantic search across observations (~50-100 tokens)
timeline - Chronological context around results
get_observations - Full details for specific observation IDs

This 3-layer retrieval pattern aligns with the framework's tiered document loading, providing ~10x token savings.

Licensing Note

Claude-mem is a separate project licensed under AGPL-3.0. The framework uses it as an optional external plugin - no claude-mem code is bundled or modified. See claude-mem repository for full license details.

Windows Compatibility

The skill files (.claude/skills/company*/SKILL.md) contain bash commands that Claude executes. While the Node.js CLI (cvc) works on all platforms, the skill commands assume a Unix-like shell environment.

Recommended Environments

For full compatibility on Windows, use one of these terminals:

Git Bash (included with Git for Windows) - Recommended
WSL/WSL2 (Windows Subsystem for Linux)
PowerShell with Unix tools (via chocolatey or scoop)

Commands That May Need Alternatives

Some bash commands used in skill files have Windows equivalents:

| Bash Command | Purpose | Windows Alternative | |--------------|---------|---------------------| | date -Iseconds | ISO timestamp | Use src/platform.js:getISOTimestamp() | | tr \| sed \| cut | String slugify | Use src/platform.js:slugify() | | cat file \|\| echo '{}' | Safe JSON read | Use src/platform.js:readJsonSafe() | | mkdir -p | Create nested dirs | Works in PowerShell 5+ | | find / ls | File listing | Works in Git Bash |

Cross-Platform Utilities

For programmatic use, the src/platform.js module provides cross-platform Node.js alternatives:

const { getISOTimestamp, slugify, readJsonSafe } = require('./src/platform');

// Instead of: date -Iseconds
const timestamp = getISOTimestamp();

// Instead of: echo "$str" | tr ... | sed ... | cut ...
const branchName = slugify('My Feature Name', 40);

// Instead of: cat file.json 2>/dev/null || echo '{}'
const config = readJsonSafe('.company/config.json', {});

Troubleshooting

Workflow Stuck

Check /company-status for current state
Look for pending proposals in .company/proposals/pending/
Check role inboxes for blocked messages
Reset state if needed: echo '{"phase":"idle"}' > .company/state.json

Tests Failing

Ensure test frameworks are installed
Check test configuration in .company/config.json
Review test output in QA artifacts

Specialist Not Found

Check roster with /company-roster
Manually hire with /company-hire [domain]
Verify skill files exist in .claude/skills/company-specialists/

Gemini CLI: MCP Server Not Working

Verify the server is configured in .gemini/settings.json
Check the path to the server is correct
Ensure Node.js 18+ is available
Test manually: node node_modules/claude-virtual-company/mcp/task-server/index.js

Gemini CLI: Commands Not Found

Verify .gemini/commands/company/ contains TOML files
Check TOML syntax is valid
Restart Gemini CLI after installation

Check Installation Status

cvc status

This shows the installation status for both Claude Code and Gemini CLI.

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Make your changes
Submit a pull request

License

MIT License - see LICENSE for details.

Acknowledgments

Built for Claude Code by Anthropic and Gemini CLI by Google.

Uses:

Claude Code Agent Skills framework
Model Context Protocol (MCP) for cross-provider task management
Gemini CLI TOML Commands for Gemini integration

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

Claude Virtual Company

Supported Providers

Features

Quick Start

Installation

Start a Project

Check Status

Merge When Complete

How It Works

The Hierarchy

Workflow Phases

Quality Gates

Project Manager Workflow (GSD-Inspired)

Commands

Core Commands

Project Manager Commands (GSD-Inspired)

Configuration

Model Settings

Quality Settings

Design Patterns

Git Flow Settings

Hiring Settings

Specialists

Default Specialists

Available Domains

Manual Hiring

Governance

Proposal System

Escalation

Project Structure

Provider Differences

Claude Code vs Gemini CLI

MCP Task Server

Switching Providers

Best Practices

Staying in the Framework

For Best Results

Customization

Context Management

Tiered Document Loading

Automatic Context Decay

Configuration

Platform Utilities

Mermaid Diagrams

Writing Tiered Documents

Interactive Playgrounds (Optional)

What Playgrounds Provide

How It Works

Session Preference

Skipping Playgrounds

Installation

Enhanced Context Memory (Optional)

What claude-mem Provides

Why Use It

Installation

How the Framework Uses It

Claude-Mem MCP Tools

Licensing Note

Windows Compatibility

Recommended Environments

Commands That May Need Alternatives

Cross-Platform Utilities

Troubleshooting

Workflow Stuck

Tests Failing

Specialist Not Found

Gemini CLI: MCP Server Not Working

Gemini CLI: Commands Not Found

Check Installation Status

Contributing

License

Acknowledgments