@eval-kit/ui
v0.3.1
Published
React components and pages for scoring, reviewing, and diffing eval runs. Tailwind + Radix.
Maintainers
Readme
@eval-kit/ui
React components for scoring, reviewing, and diffing eval runs. Tailwind + Radix. Built for eval-kit — the scoring cockpit for research agents.
Install
npm install @eval-kit/ui @eval-kit/core
# or
pnpm add @eval-kit/ui @eval-kit/corePeer deps: react ^18 || ^19, react-dom ^18 || ^19. Bring your own Tailwind setup.
What's in the box
- Page-level surfaces —
DashboardPage,RunReviewPage,DiffPage. Drop-in views composed from the primitives below. - Scoring primitives —
ScoreSlider,StepReviewCard,DimensionExplainer,AutosaveBadge. The 0–3 rubric UI. - Inbox —
InboxView,InboxRow. Prioritized queue with keyboard-first scoring (1/2/3,j/k,a,s,Enter). - Diff view —
DiffRow,RegressionSummary,ToolCallDelta. Two-run side-by-side comparison. - Shell —
AppShell,SidebarNav,CommandPalette,ThemeToggle,SessionTracker. The dashboard chrome. - Design primitives —
Button,Card,Dialog,Popover,Tabs,Tooltip,Pill,Badge,StatCard,Sparkline,Kbd,EmptyState,InlineHelp,ProgressRing,Select,Input,Textarea. All Tailwind + Radix-based, themed via CSS variables.
Usage
Import the styles once at your app root:
import "@eval-kit/ui/styles.css";Then use components directly:
import { RunReviewPage } from "@eval-kit/ui";
import type { ScoredRun, EvalSuite } from "@eval-kit/core";
export default function ReviewRoute({ run, suite }: { run: ScoredRun; suite: EvalSuite }) {
return (
<RunReviewPage
run={run}
suite={suite}
onSave={async (updated) => { /* server action */ }}
/>
);
}The dashboard at apps/dashboard in the main repo is a complete reference — every component has a real usage you can copy.
Tailwind setup
The primitives use a token system (bg-surface, text-fg, border-edge, etc.) defined in src/styles/tokens.css. Either:
- Import
@eval-kit/ui/styles.css(recommended — gets you the tokens and base styles) - Or copy
src/styles/tokens.cssand customize for your brand
Tailwind config: ensure content includes node_modules/@eval-kit/ui/dist/**/*.{js,mjs}.
Status
v0.3.0 — API stable for the 0.3.x line. Component surfaces may grow in v0.4 (multi-reviewer, agreement view) and v0.5 (training proposals).
Links
License
MIT
