mertgsd

v1.3.2

Published

11 days ago

AI agent framework for structured project execution. 18 agents + 39 workflows. Plan, build, audit, deploy — fully autonomous or human-in-the-loop. https://github.com/mertdlkr/MertGSD

Downloads

772

0High
0Medium
0Low

mertdlkr

ai ai-agents claude-code copilot codex project-management autonomous-coding multi-agent anti-hallucination mertgsd

MertGSD

The open-source AI agent framework for structured project execution. Plan, build, audit, and deploy — fully autonomous or human-in-the-loop.

npm i -g mertgsd && mertgsd install

MertGSD is a multi-agent system that turns AI coding assistants into complete project management engines. 18 specialized agents + 39 workflows that handle everything from initial research to production deployment, brownfield mapping, milestone ops, debugging, and maintenance.

Works with Claude Code, GitHub Copilot CLI, OpenAI Codex CLI, Cursor, Windsurf, Cline, Antigravity, and any AI coding tool that supports custom agents or slash commands.

"I built this because every AI coding tool is great at writing code but terrible at managing projects. MertGSD bridges that gap." — Mert Ali Dalkır

Why MertGSD?

The problem: AI coding assistants write code fast but lose context between sessions, hallucinate project state, skip testing, forget to audit security, and can't manage multi-phase projects.

The solution: MertGSD adds a structured execution layer on top of any AI coding tool:

Context management — .planning/ directory persists project state across sessions
Anti-hallucination — every claim verified against real files and real command output
Quality gates — security, performance, mobile, SEO, accessibility, brand audits built-in
Autonomous execution — give a prompt, walk away, get notified when done
Phased delivery — complex projects broken into manageable phases with atomic commits

Quick Start

Option A: npm (recommended)

npm i -g mertgsd
cd your-project
mertgsd install

That's it. Works on macOS, Linux, and Windows.

One-time use without global install:

cd your-project
npx mertgsd install

Update later:

npm update -g mertgsd
mertgsd update

Option B: Git clone (manual)

git clone https://github.com/mertdlkr/MertGSD.git

macOS / Linux / WSL:

bash MertGSD/mertgsd-install.sh /path/to/your-project

Windows (PowerShell):

# Use Git Bash
bash MertGSD/mertgsd-install.sh C:/Users/you/your-project

# Or manual copy
Copy-Item -Recurse MertGSD/.agent your-project/.agent

Start building

Open any AI coding tool (Claude Code, Copilot, Codex, Cursor, Antigravity, etc.) and follow this flow:

Step 1 — Plan your project:

/mertgsd-new-project

It will ask you questions about what you want to build, research the domain, and create a structured project plan. No code is written at this stage.

Step 2 — Build it autonomously:

/mertgsd-super

After planning, run this to execute the entire project autonomously — plan, build, test, audit, and deploy. Walk away and get notified when it's done.

Or skip planning and go full autonomous with a prompt:

/mertgsd-super "Build a SaaS dashboard with auth, billing, and analytics"

See all 39 commands:

/mertgsd-help

Works with any AI coding tool. For non-slash-command tools, just tell them: "Read .agent/workflows/mertgsd-new-project.md and follow the workflow."

CLI Commands

mertgsd install [path]   # Install MertGSD to a project
mertgsd update [path]    # Update to latest version
mertgsd info             # Show agent/workflow counts
mertgsd --version        # Show version
mertgsd --help           # Show help

Usage by Platform

Claude Code (CLI)

MertGSD is natively designed for Claude Code. All workflows are slash commands:

cd your-project
claude
/mertgsd-new-project          # Initialize project
/mertgsd-plan 1               # Plan phase 1
/mertgsd-execute 1            # Execute with atomic commits
/mertgsd-audit                # Full quality audit
/mertgsd-deploy               # Deploy to production

GitHub Copilot CLI / OpenAI Codex CLI

The .agent/ directory works as context files. Point the CLI to read them:

# Copilot CLI
copilot "Read .agent/workflows/mertgsd-new-project.md and follow the workflow to initialize this project"

# Codex CLI
codex "Follow the workflow in .agent/workflows/mertgsd-super.md to build: [your prompt]"

Add a .github/copilot-instructions.md referencing MertGSD:

When I say /mertgsd-[command], read and follow .agent/workflows/mertgsd-[command].md
Available agents are in .agent/agents/

Cursor / Windsurf / Cline / VS Code (IDE Agents)

Install MertGSD to your project: ./mertgsd-install.sh .
Open your project in the IDE
Reference workflows in chat:
- "Follow .agent/workflows/mertgsd-new-project.md to set up this project"
- "Use the mertgsd-security-auditor agent to scan for vulnerabilities"
The IDE agent reads the .agent/ files as context and follows the structured workflows

Any LLM / Custom Setup

MertGSD agents and workflows are plain Markdown files. Any LLM that can read files and execute commands can use them:

System prompt: "You have access to a project management framework in .agent/.
Read .agent/workflows/mertgsd-help.md for available commands. When the user requests
a workflow, read and follow the corresponding .md file step by step."

Default Workflow (The MertGSD Loop)

This is how most projects flow from idea to production:

┌─────────────────────────────────────────────────────────────┐
│                    THE MERTGSD LOOP                           │
│                                                              │
│  ┌──────────────┐                                           │
│  │ /mertgsd-new-project │  ← You describe what to build         │
│  └──────┬───────┘                                           │
│         │  Creates: PROJECT.md, REQUIREMENTS.md,            │
│         │  ROADMAP.md, STATE.md, mertgsd-config.json            │
│         ▼                                                    │
│  ┌──────────────┐                                           │
│  │ /mertgsd-plan N  │  ← Research + create task plans           │
│  └──────┬───────┘                                           │
│         │  Creates: RESEARCH.md, PLAN.md files              │
│         ▼                                                    │
│  ┌──────────────┐                                           │
│  │ /mertgsd-execute N│  ← Build with atomic commits             │
│  └──────┬───────┘                                           │
│         │  Creates: SUMMARY.md, VERIFICATION.md             │
│         │  Sends: push notification ✓                       │
│         ▼                                                    │
│  ┌──────────────┐                                           │
│  │ /mertgsd-verify N│  ← User acceptance testing                │
│  └──────┬───────┘                                           │
│         │                                                    │
│         ├── Pass? → Next phase (repeat from /mertgsd-plan N+1)  │
│         └── Gaps? → /mertgsd-plan N --gaps (fix and re-execute) │
│                                                              │
│  After all phases:                                           │
│  ┌──────────────┐                                           │
│  │ /mertgsd-audit   │  ← Security, performance, mobile, SEO,   │
│  └──────┬───────┘    accessibility, brand review             │
│         ▼                                                    │
│  ┌──────────────┐                                           │
│  │ /mertgsd-deploy  │  ← Build, deploy, verify live URL         │
│  └──────────────┘                                           │
│                                                              │
│  SHORTCUT: /mertgsd-super does ALL of this autonomously         │
└─────────────────────────────────────────────────────────────┘

For quick tasks that don't need full project setup:

/mertgsd-quick "Add dark mode toggle to the navbar"

Command Surface

Core

| Command | What it does | |---------|-------------| | /mertgsd-super [prompt] | Full autonomy — prompt to production, zero human input | | /mertgsd-new-project | Interactive setup: questions → research → requirements → roadmap → config | | /mertgsd-no-halluc [question] | Verified Q&A with mandatory external research and citations |

Build Cycle

| Command | What it does | |---------|-------------| | /mertgsd-discuss [phase] | Capture implementation decisions before planning | | /mertgsd-plan [phase] | Research and create executable task plans | | /mertgsd-execute [phase] | Execute plans with atomic git commits | | /mertgsd-verify [phase] | User acceptance testing against phase goals |

Quality & Review

| Command | What it does | |---------|-------------| | /mertgsd-audit | Full project audit — 6 agents in parallel | | /mertgsd-review [phase] | PR-style code review with multi-agent analysis | | /mertgsd-refactor [desc] | Safe refactoring with test snapshots + auto-rollback |

Operations

| Command | What it does | |---------|-------------| | /mertgsd-deploy | Build → deploy → verify (CF Pages / Vercel / Netlify) | | /mertgsd-migrate [desc] | DB migration with safety gates + rollback SQL | | /mertgsd-setup-config | Configure ntfy, deploy platform, Supabase, tests |

Utilities

| Command | What it does | |---------|-------------| | /mertgsd-quick [desc] | Small ad-hoc task with MertGSD guarantees | | /mertgsd-progress | Current state, blockers, next steps | | /mertgsd-commit-memory | Distill context into long-term memory | | /mertgsd-help | Show all commands |

Brownfield & Milestones

| Command | What it does | |---------|-------------| | /mertgsd-map-codebase | Build a real codebase map before planning existing projects | | /mertgsd-new-milestone | Start the next milestone on an existing project | | /mertgsd-audit-milestone | Verify milestone-wide requirement coverage and integration | | /mertgsd-complete-milestone | Archive a shipped milestone and prepare the next cycle | | /mertgsd-plan-milestone-gaps | Turn milestone audit gaps into roadmap work |

Continuity & Maintenance

| Command | What it does | |---------|-------------| | /mertgsd-debug [issue] | Start or resume a structured debug session | | /mertgsd-pause-work | Save a precise handoff for the next session | | /mertgsd-resume-work | Restore project context and route to the next step | | /mertgsd-add-todo | Capture follow-up work without losing context | | /mertgsd-check-todos | Review pending todos and route them into action | | /mertgsd-health | Validate .planning/ integrity and repair safe issues | | /mertgsd-update | Update a MertGSD clone or explain reinstall path | | /mertgsd-settings | Inspect or update MertGSD project settings | | /mertgsd-set-profile | Set the preferred model profile | | /mertgsd-research-phase | Run standalone phase research before planning | | /mertgsd-add-phase | Append a new roadmap phase | | /mertgsd-insert-phase | Insert an urgent decimal phase | | /mertgsd-remove-phase | Remove an unstarted future phase | | /mertgsd-add-tests | Add or strengthen tests for an area or phase | | /mertgsd-reapply-patches | Reapply locally saved patch overlays after updates |

All Agents (18)

Planning & Execution

| Agent | What it does | |-------|-------------| | mertgsd-executor | Executes plans with atomic commits, deviation handling, checkpoints | | mertgsd-planner | Creates task plans with dependency analysis and wave ordering | | mertgsd-roadmapper | Creates phased project roadmaps from requirements | | mertgsd-phase-researcher | Researches implementation approach before planning | | mertgsd-project-researcher | Researches domain, stack, ecosystem before roadmap | | mertgsd-research-synthesizer | Merges parallel research outputs into actionable summaries |

Verification

| Agent | What it does | |-------|-------------| | mertgsd-verifier | Goal-backward verification — did the code deliver what was promised? | | mertgsd-plan-checker | Pre-execution plan quality check | | mertgsd-integration-checker | Cross-phase integration and E2E flow verification | | mertgsd-debugger | Scientific method bug investigation with hypothesis testing | | mertgsd-codebase-mapper | Explores and documents codebase structure |

Quality Auditors

| Agent | What it does | |-------|-------------| | mertgsd-security-auditor | OWASP top 10, dependency audit, secrets detection, CSP | | mertgsd-performance-tester | Lighthouse, bundle size, lazy loading, render performance | | mertgsd-mobile-auditor | Responsive design, touch targets, viewport, overflow | | mertgsd-seo-checker | Metadata, sitemap, structured data, hreflang, headings | | mertgsd-accessibility-tester | WCAG 2.2 AA, ARIA, keyboard nav, contrast, focus | | mertgsd-brand-reviewer | Brand consistency, copy quality, design system, typography |

Infrastructure

| Agent | What it does | |-------|-------------| | mertgsd-notifier | Push notifications via ntfy.sh at milestones |

Anti-Hallucination System

The #1 problem with AI coding: it says it did something but didn't actually do it. MertGSD has 8 structural safeguards:

| Protection | How | |------------|-----| | File-First Context | Always re-reads files before acting — never trusts memory | | Source Verification | Technical claims verified against docs, not training data | | Confidence Levels | Research tagged HIGH (verified) / MEDIUM (searched) / LOW (memory) | | Verification Gates | Every task output is READ — not assumed to pass | | No Auto-Pass | Build/test results never marked passed without actual output | | Decision Attribution | Tracks USER-decided vs AI-suggested for audit trail | | Context Freshness | New conversation recommended between workflow steps | | Checkpoint Integrity | Waits for human approval — never skips confirmation |

Push Notifications

Configure during /mertgsd-new-project or /mertgsd-setup-config:

"Want push notifications?" → yes
"ntfy channel name?" → my-project-alerts

You'll get notified on your phone when:

A phase completes
Deploy succeeds or fails
Audit finds critical issues
Errors block execution

Uses ntfy.sh — free, open-source, no account needed.

File Structure

your-project/
├── .agent/                          ← MertGSD system (copied by install)
│   ├── agents/                      ← 18 specialized agents
│   │   ├── mertgsd-executor.md
│   │   ├── mertgsd-planner.md
│   │   ├── mertgsd-security-auditor.md
│   │   └── ... (18 total)
│   └── workflows/                   ← 39 workflows
│       ├── mertgsd-new-project.md
│       ├── mertgsd-super.md
│       ├── mertgsd-audit.md
│       └── ... (39 total)
│
└── .planning/                       ← Project state (created by mertgsd-new-project)
    ├── PROJECT.md                   ← Vision and context
    ├── REQUIREMENTS.md              ← v1/v2 requirements
    ├── ROADMAP.md                   ← Phases and progress
    ├── STATE.md                     ← Current position (living memory)
    ├── mertgsd-config.json              ← Settings (ntfy, deploy, tests)
    ├── research/                    ← Domain research
    └── phases/
        └── 01-phase-name/
            ├── 01-01-PLAN.md        ← Task plan
            ├── 01-01-SUMMARY.md     ← Execution summary
            ├── 01-VERIFICATION.md   ← Goal verification
            ├── SECURITY-AUDIT.md    ← Security report
            ├── PERFORMANCE-REPORT.md
            ├── MOBILE-AUDIT.md
            ├── SEO-REPORT.md
            ├── ACCESSIBILITY-REPORT.md
            ├── BRAND-REVIEW.md
            └── AUDIT.md             ← Unified audit

Real-World Example

This system was built while shipping real production projects:

NexVar.io — AI-first software studio site (Next.js 16, i18n, AI chatbot, newsletter, blog CMS)
Vault — Personal finance tracker (Supabase, multi-currency, AI assistant, real-time rates)

Every agent and workflow exists because it was needed during real development. Read more at mertdlkr.com.

Requirements

Required:

Git
Any AI coding CLI or IDE (Claude Code, Copilot, Codex, Cursor, Windsurf, Cline)

Optional:

Node.js 18+ (for web projects)
ntfy app (push notifications)
Supabase / Prisma (database features)
Cloudflare / Vercel / Netlify (deployment)

Model Compatibility

MertGSD works with any LLM:

| Model | Status | |-------|--------| | Claude (Anthropic) | Fully tested | | GPT / Codex (OpenAI) | Compatible | | Gemini (Google) | Compatible | | Llama / Mistral / DeepSeek | Compatible |

Model-agnostic by design — uses structural safeguards (file reads, command verification, user gates) instead of model-specific prompting.

Contributing

PRs welcome. If you add a new agent or workflow, follow the existing format in .agent/agents/ and .agent/workflows/.

Author

Mert Ali Dalkır — AI-first builder, co-founder of NexVar.

I build tools and systems that make AI-powered development faster, more reliable, and more structured. MertGSD is the execution framework behind everything I ship.

License

MIT

If MertGSD saves you time, give it a star. It helps others find it.

Built with conviction at mertdlkr.com.