@pentoai/ml-ralph-ui

v0.2.0

Published

13 days ago

Autonomous ML engineering agent with TUI

0High
0Medium
0Low

pento.ai

@pentoai/ml-ralph-ui

An autonomous ML engineering agent with a terminal user interface (TUI).

ml-ralph helps you iterate on ML projects by automating the experiment loop: planning, execution, analysis, and learning extraction. You interact with it through a clean TUI that lets you create PRDs, monitor agent execution, and review accumulated knowledge.

Key Features

PRD-driven development: Define your ML project goals, constraints, and stories through conversational chat with Claude Code
Autonomous execution: Agent runs continuously through stories until stopped, making decisions based on evidence
Learning accumulation: Structured insights extracted from every iteration, searchable and actionable
Research integration: Agent researches approaches and documents findings
Training monitoring: Track long-running jobs with W&B integration

Architecture

ml-ralph is built as a TUI using Ink (React for terminals) with Bun as the runtime. It orchestrates Claude Code to perform actual ML engineering work.

┌─────────────────────────────────────────────────────────────┐
│                     ml-ralph TUI                            │
│  ┌─────────────────────┐  ┌──────────────────────────────┐  │
│  │     Planning        │  │         Monitor              │  │
│  │  ┌───────┬───────┐  │  │  ┌─────────┬──────────────┐  │  │
│  │  │  CC   │Learn- │  │  │  │ Agent   │ Experiments  │  │  │
│  │  │ Chat  │ings/  │  │  │  │ Output  │ + Metrics    │  │  │
│  │  │       │Research│  │  │  │         │              │  │  │
│  │  └───────┴───────┘  │  │  └─────────┴──────────────┘  │  │
│  └─────────────────────┘  └──────────────────────────────┘  │
└─────────────────────────────────────────────────────────────┘
                            │
                            ▼
                    ┌───────────────┐
                    │  Claude Code  │
                    └───────────────┘
                            │
                            ▼
                    ┌───────────────┐
                    │   Codebase    │
                    │   + W&B       │
                    └───────────────┘

Two Modes

Planning Mode

Chat with Claude Code to create/refine your PRD
View accumulated learnings from past iterations
Review research the agent has gathered
See your story backlog

Monitor Mode

Watch the agent execute stories in real-time
View experiment metrics and training curves
See current story and hypothesis
Control agent (start/stop)

Quick Start

# Run directly with bunx
bunx @pentoai/ml-ralph-ui

# Or install globally
bun install -g @pentoai/ml-ralph-ui
ml-ralph

Requirements

Bun runtime
Claude Code CLI installed and authenticated

Documentation

Architecture - System design and layers
Data Models - All type definitions
File Layout - Project file structure
Prompts - Claude Code system prompts
MVP Plan - Development phases

Tech Stack

Runtime: Bun
TUI Framework: Ink (React for terminals)
Language: TypeScript
Agent: Claude Code
Experiment Tracking: Weights & Biases

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@pentoai/ml-ralph-ui

Key Features

Architecture

Two Modes

Planning Mode

Monitor Mode

Quick Start

Requirements

Documentation

Tech Stack

License