@pentoai/ml-ralph-ui
v0.2.0
Published
Autonomous ML engineering agent with TUI
Readme
@pentoai/ml-ralph-ui
An autonomous ML engineering agent with a terminal user interface (TUI).
ml-ralph helps you iterate on ML projects by automating the experiment loop: planning, execution, analysis, and learning extraction. You interact with it through a clean TUI that lets you create PRDs, monitor agent execution, and review accumulated knowledge.
Key Features
- PRD-driven development: Define your ML project goals, constraints, and stories through conversational chat with Claude Code
- Autonomous execution: Agent runs continuously through stories until stopped, making decisions based on evidence
- Learning accumulation: Structured insights extracted from every iteration, searchable and actionable
- Research integration: Agent researches approaches and documents findings
- Training monitoring: Track long-running jobs with W&B integration
Architecture
ml-ralph is built as a TUI using Ink (React for terminals) with Bun as the runtime. It orchestrates Claude Code to perform actual ML engineering work.
┌─────────────────────────────────────────────────────────────┐
│ ml-ralph TUI │
│ ┌─────────────────────┐ ┌──────────────────────────────┐ │
│ │ Planning │ │ Monitor │ │
│ │ ┌───────┬───────┐ │ │ ┌─────────┬──────────────┐ │ │
│ │ │ CC │Learn- │ │ │ │ Agent │ Experiments │ │ │
│ │ │ Chat │ings/ │ │ │ │ Output │ + Metrics │ │ │
│ │ │ │Research│ │ │ │ │ │ │ │
│ │ └───────┴───────┘ │ │ └─────────┴──────────────┘ │ │
│ └─────────────────────┘ └──────────────────────────────┘ │
└─────────────────────────────────────────────────────────────┘
│
▼
┌───────────────┐
│ Claude Code │
└───────────────┘
│
▼
┌───────────────┐
│ Codebase │
│ + W&B │
└───────────────┘Two Modes
Planning Mode
- Chat with Claude Code to create/refine your PRD
- View accumulated learnings from past iterations
- Review research the agent has gathered
- See your story backlog
Monitor Mode
- Watch the agent execute stories in real-time
- View experiment metrics and training curves
- See current story and hypothesis
- Control agent (start/stop)
Quick Start
# Run directly with bunx
bunx @pentoai/ml-ralph-ui
# Or install globally
bun install -g @pentoai/ml-ralph-ui
ml-ralphRequirements
- Bun runtime
- Claude Code CLI installed and authenticated
Documentation
- Architecture - System design and layers
- Data Models - All type definitions
- File Layout - Project file structure
- Prompts - Claude Code system prompts
- MVP Plan - Development phases
Tech Stack
- Runtime: Bun
- TUI Framework: Ink (React for terminals)
- Language: TypeScript
- Agent: Claude Code
- Experiment Tracking: Weights & Biases
License
MIT
