ai-development-framework

v0.6.3

Published

19 days ago

The system around the AI that makes the AI reliable. Structured AI-assisted development with the PIV+E loop.

0High
0Medium
0Low

roby248

ai claude claude-code development-framework agentic piv-loop

AIDevelopmentFramework

The system around the AI that makes the AI reliable.

An open-source framework for structured AI-assisted software development. Built on the PIV+E loop (Plan, Implement, Validate, Evolve) — a methodology where the human owns planning and validation, the AI owns implementation, and the system gets smarter with every cycle.

Why This Exists

AI coding assistants are powerful but unpredictable. They lose context, repeat mistakes, and produce code that looks right but isn't. The solution isn't a better AI — it's a better system around the AI: plans that survive session boundaries, rules that prevent known mistakes, agents that maintain architectural knowledge, and a self-improving feedback loop.

Quick Start

Install via npm

# In your project directory
npx ai-development-framework init

This downloads the latest framework and sets up .claude/, CLAUDE.md, and docs/ in your project. Existing files are backed up as .backup — run /start to merge your project configuration with the new framework.

Update to latest version

npx ai-development-framework update

Updates framework files while preserving your project-specific configurations (knowledge bases, agent data, custom rules).

First session

# 1. Open Claude Code in your project
# 2. Check dependencies
/setup
# 3. Start working
/start

/start detects where you are and routes you to the right pipeline:

New project? Brainstorm, create knowledge base, generate issues
New feature? Plan and decompose into issues
Have an issue? Load context and start building
Found a bug? Debug systematically
Building a design artifact (prototype, deck, mockup, motion piece)? Asks you 3 clarifying questions, then routes through /brand-extract → huashu-design → 5D Visual Critique

The PIV+E Loop

Plan ──> Implement ──> Validate ──> Evolve
  ^                                    │
  └────────────────────────────────────┘

Plan — Brainstorm, write specs, decompose into issues Implement — TDD, execute plans, use specialist agents Validate — Automated checks, visual testing, code review Evolve — Update rules and knowledge from what was learned

Pipeline Commands

| Command | Phase | Purpose | |---------|-------|---------| | /start | Router | Detects what you're doing, routes to the right pipeline | | /prime | Plan | Loads codebase + knowledge base context into session | | /create-prd | Plan | Brainstorms and generates a PRD, seeds knowledge base | | /plan-project | Plan | Decomposes PRD into GitHub milestones, issues, and feature notes | | /plan-feature | Plan | Creates detailed implementation plan | | /execute | Implement | Executes plan with TDD, reads related knowledge | | /validate | Validate | Runs lint, tests, visual testing, code review | | /ship | Validate | Updates knowledge base, optional Codex review, commits, pushes, creates PR | | /evolve | Evolve | Updates rules and knowledge base | | /setup | Utility | Checks framework health and dependencies | | /merge-configs | Utility | Reconciles existing user config with framework files after init/update |

Scope Levels

| Level | You say... | Pipeline | |-------|-----------|---------| | L0 (Project) | "I have an idea" | Brainstorm > PRD > Issues > Knowledge base > per-issue L2 | | L1 (Feature) | "I need a feature" | Plan feature > Create issues > per-issue L2 | | L2 (Issue) | "I have issue #42" | Prime > Plan > Execute > Validate > Ship | | L3 (Bug) | "There's a bug" | Prime > Debug > Fix > Validate > Ship |

Production Code vs. Design Artifacts

Two distinct paths through the pipeline. The framework asks before guessing:

| Path | What it produces | Validation | |---|---|---| | Production code | Real shipping React/Next.js/Vue/Svelte components, routes, features | Implementer + spec-reviewer subagent pairing, lint, types, QA E2E, security review | | Design artifacts | Clickable HTML prototypes, slide decks (editable PPTX), motion (MP4/GIF), infographics, mockups | 5D Visual Critique (Hierarchy + Functionality blocking; Philosophy / Execution / Innovation advisory) | | Hybrid | Artifact first, then production implementation | Both, in sequence; the artifact's handoff bundle becomes the spec for the code phase |

When /start or /plan-feature detects ambiguous UI intent on an in-development project, it runs a 3-question clarifying script (see .claude/references/design-clarifying-script.md) instead of keyword-classifying. Your answers persist in plan frontmatter so /execute reads them rather than re-classifying.

The design-artifact path uses huashu-design (external skill, install with npx skills add alchaincyf/huashu-design) and the framework-local /brand-extract skill that writes a canonical .design-system/brand-spec.md once per project so brand assets aren't asked for every run. Project-state-aware: fresh projects route to a Direction Advisor (5 schools × 20 design philosophies); ad-hoc projects (no Tailwind / no tokens) require an explicit opt-in before "codifying" their existing patterns. License acknowledgement is required at first dispatch per repo (huashu-design is non-commercial-by-default).

What's Inside

.claude/
├── commands/       15 pipeline commands (incl. 4 /kb commands)
├── agents/         4 specialist agents + template
├── skills/         3 framework-local skills (brand-extract, e2e-test, playwright-cli)
├── rules/          9 auto-loading domain rules + template (incl. frontend-antislop)
├── references/     References + templates (PRD, plan, issue, patterns, KB,
│                   anti-slop catalogue, design clarifying script)
├── hooks/          8 guardrails (branch protection, spec-reviewer marker enforcement,
│                   evolve/plan reminders, output compaction)
└── .versions.json  Pinned external-skill SHAs + brand-spec schema contract
cli/
└── kb-search.js    TF-IDF search tool for knowledge base

Knowledge Base (Optional)

A unified LLM knowledge base inspired by Karpathy's LLM Knowledge Bases workflow. External research and project knowledge live together as a flat wiki. The LLM ingests raw sources, compiles them into wiki articles, auto-searches during task work, and grows the wiki from every coding session.

.obsidian/           # or any custom path
├── raw/             # Ingested source material (articles, papers, docs, repos, sessions)
│   └── _manifest.md # Index of all raw sources with status
├── wiki/            # Unified wiki — all knowledge as flat .md files with frontmatter
│   ├── _index.md    # Master index grouped by type
│   └── _tags.md     # Tag registry with article counts
└── _search/
    ├── index.json       # TF-IDF search index (auto-generated)
    └── lean-index.json  # Lean (metadata-only) index for cheap context loading

KB Commands:

/kb ingest <source> — ingest URL, file, repo, or session learnings
/kb compile — expand stubs, cross-link, extract concepts, health check
/kb search <query> — TF-IDF search across wiki
/kb ask <question> — Q&A against wiki, answer filed back as new article

Enable it by adding to your project's CLAUDE.md:

## Knowledge Base

Path: .obsidian/

Works with Obsidian for a navigable UI, but Obsidian is not required — it's just markdown.

Review Layers

The framework supports three complementary review layers:

| Layer | When | What it checks | |-------|------|---------------| | Superpowers Code Review | /validate Phase 5 | Implementation defects, plan adherence, security, edge cases | | 5D Visual Critique | /validate Phase 2.5 (design artifacts only) | Visual hierarchy + functionality (blocking ≥7/10), philosophy / execution / innovation (advisory). Backported from huashu-design with the self-grading bias fixed via an independent scoring subagent. | | Codex Adversarial Review | /ship (optional) | Design choices, tradeoffs, assumptions, alternative approaches |

The adversarial review requires an OpenAI subscription and the Codex plugin. It questions whether the approach is right; the code review checks whether the implementation is correct; the visual critique scores whether design artifacts ship-ready quality across the dimensions that LLMs can judge reliably (hierarchy and functionality), with the more taste-driven dimensions surfaced as advisory rather than gating.

Specialist Agents

| Agent | Purpose | |-------|---------| | architect-agent | Codebase knowledge base. Call before structural changes. | | tester-agent | Web browser testing via playwright-cli. | | mobile-tester-agent | Mobile app testing via mobile-mcp. | | ui-ux-analyzer | Design audits with screenshots and reports. |

CLI Reference

npx ai-development-framework init             Install framework (backs up existing files)
npx ai-development-framework update           Update framework (backs up, run /start to merge)
npx ai-development-framework lean-index       Rebuild the lean (metadata-only) KB index
npx ai-development-framework merge-settings   Deep-merge user .claude/settings.local.json with framework version
npx ai-development-framework --version        Show version
npx ai-development-framework --help           Show help

Documentation

Methodology — The PIV+E loop explained
Getting Started — Setup and walkthrough
Command Reference — All commands
Customization — Adding agents, rules, skills, knowledge base
Plugin Install Guide — Dependencies
Changelog — v0.4 (PIV+E hardening), v0.5 (design-artifact pipeline), v0.6 (context budgets + /merge-configs decomposition), v0.6.1 (consumer-project CLI invocations), v0.6.2 (unified CLI distribution), v0.6.3 (three-way merge + post-merge budget guard)

Credits

Synthesizes approaches from:

Cole Medin's AI Coding Summit Workshop — PIV loop, context resets, plan-as-specification
Real-world battle-tested Claude Code configurations from production development

License

MIT