@decocms/parity

v0.1.0

Published

2 days ago

E2E parity validator for site migrations. Compares prod vs cand and reports UI, functional, SEO, visual, and Web Vitals deltas with an LLM-ranked HTML report.

@decocms/parity

E2E parity validator for site migrations. CLI that runs comparative tests between two URLs — prod (source of truth) and cand (migrated version) — and produces a single HTML report flagging UI, functional, SEO, and Web Vitals regressions.

Built originally for Fresh → TanStack Start migrations of Deco storefronts, but the checks are generic enough for any side-by-side migration.

Status: alpha. APIs and report layout may still change.

What it checks

| Check | What it catches | | ----------------------------- | ---------------------------------------------------------------------- | | HTTP status parity | Routes that 404 / 500 in cand but worked in prod | | Console errors | New hydration mismatches, failed fetches, JS exceptions | | HTML structural diff | Section / element counts drifting beyond tolerance | | Meta / SEO parity | <title>, <meta description>, canonical, og:*, twitter:*, JSON-LD | | Visual diff (LLM Vision) | Sections missing, wrong hero, broken shelf, layout shifts | | Purchase journey | Home → PLP → PDP → CEP → cart → checkout completes in both | | Network summary | Request count / bytes / cache hit rate | | Web Vitals | LCP, FCP, TTFB, INP, CLS — mobile | | Image loading health | Missing alt text, no srcset, broken <img> | | Lazy section presence | Deco /deco/render and /_loader/* routes responding | | SEO deep audit | robots.txt, sitemap, noindex regressions | | Cache coverage | Cache hit rate, opportunities to cache |

All results are aggregated (optionally via Claude) and ranked by severity. Each issue includes screenshots, reproduction, and a suggested fix.

Quickstart

# Install
npm install -g @decocms/parity
# or run without install
npx @decocms/parity run --prod ... --cand ...

# First-time smoke run (~30s, no LLM needed)
parity run \
  --prod https://oldsite.com \
  --cand https://newsite.example.dev \
  --preset smoke --open

# Full audit with visual diff (set ANTHROPIC_API_KEY for the LLM bits)
ANTHROPIC_API_KEY=sk-... parity run \
  --prod https://oldsite.com \
  --cand https://newsite.example.dev \
  --preset full --open

Presets

--preset smoke — homepage only, mobile only, no LLM, no extra crawl. Use to validate the pipeline runs at all (~30s).

--preset full — purchase journey, mobile + desktop, 5 visual diff pages, 10 vitals pages, LLM enabled if API key is set. Use for releases.

--preset ci — purchase journey on mobile, smaller crawls (3 visual + 5 vitals). Tuned for CI runtime.

Individual flags always override the preset.

Commands

| Command | What it does | | ------------------ | ----------------------------------------------------------------------------- | | parity run | Full comparison run between two URLs | | parity journey | CI-friendly: only the purchase journey, with JUnit / GitHub annotations | | parity vitals | Crawl N pages, compare Web Vitals prod vs cand | | parity cache | CDN cache analysis, opportunities, request categorization | | parity serve | Local HTTP server with iframe proxy so side-by-side tab works for any site | | parity report | Reopen an existing run's HTML report | | parity compare | Compare a run against a baseline | | parity baseline | Manage baselines (set, list, unset) | | parity list | List saved runs | | parity prompt | Export issues as a Markdown prompt for any LLM | | parity explain | LLM deep-dive on a specific issue (needs ANTHROPIC_API_KEY) | | parity learned | Inspect the learned-selectors library |

Run any command with --help for the full flag list.

Output

./parity-output/runs/<runId>/
├── report.html       # standalone, open in any browser
├── report.json       # structured output for CI / tooling
├── screenshots/      # per-page, per-viewport, per-side; includes pixelmatch heatmaps
├── har/              # Playwright HARs (one per viewport/side)
├── traces/           # Playwright traces — drag into trace.playwright.dev
└── console/          # console messages captured per page

Plus ./parity-baselines/<name>.json for git-trackable baseline manifests.

Visual Diff tab

When --visual-pages > 0 and an LLM key is set, the report's Visual Diff tab shows, per page:

prod screenshot · cand screenshot · pixelmatch heatmap, side-by-side
list of Deco sections present in prod but missing in cand (auto-detected from data-section)
semantic differences identified by Claude Vision (region, type, severity, description)
one-click "Export visual prompt" — Markdown ready to paste into Claude / ChatGPT to generate the fix

The visual prompt focuses only on visual diffs, references the screenshot paths, and includes migration-specific guidance (register section in setup.ts, loader shape drift, useDevice hydration, etc).

Configuration (optional)

.parityrc.json at the project root — selector overrides and run defaults:

{
  "cep": "01310-100",
  "selectors": {
    "categoryLink": "header a[href*='/c/']",
    "productCard": "[data-product-card] a",
    "buyButton": "button:has-text('Comprar')",
    "minicartTrigger": "[data-minicart-trigger]",
    "cepInputPdp": "input[name='shipping-zipcode']",
    "cepInputCart": "input[name='cart-zipcode']",
    "checkoutButton": "a:has-text('Finalizar compra')"
  }
}

.parityignore — noise suppression:

{
  "ignoreSelectorsVisual": [".banner-rotativo", "#trustvox-trustbar"],
  "ignoreRequestPatterns": ["*.gif?t=*", "**/pixel*"],
  "ignoreConsolePatterns": ["ERR_BLOCKED_BY_CLIENT"]
}

Both files are gitignored by default — they're per-user, not per-repo.

LLM (optional)

Set one environment variable to unlock LLM-driven features:

ANTHROPIC_API_KEY — direct Anthropic API (Claude Sonnet 4.6 with prompt caching). Preferred.
OPENROUTER_API_KEY — OpenRouter (default model anthropic/claude-sonnet-4.5; override with PARITY_OPENROUTER_MODEL).

What LLM enables:

Issue aggregation — flat raw issues become ranked, deduplicated top issues
Selector discovery — infers categoryLink / productCard / buyButton etc. from HTML
Step recovery — when a flow step fails, suggest a working selector
Visual semantic diff — Claude Vision interprets pixel diffs as missing sections / wrong layout / etc.
PLP picker / PDP matcher — cross-site disambiguation
parity explain <issue> — root-cause analysis on demand

Without any key, the CLI still runs and outputs raw check results — only the smart bits are skipped. Deterministic fallbacks always apply.

Cost — A --preset full run with visual diff uses ~6-12 Claude calls (each with 2 screenshots). Roughly $0.05–$0.20 per run on Sonnet 4.6 with prompt caching.

CI usage

- name: Parity check
  env:
    ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}
  run: |
    npx @decocms/parity journey \
      --prod ${{ vars.PROD_URL }} \
      --cand ${{ env.PR_PREVIEW_URL }} \
      --junit parity.junit.xml \
      --github

parity journey emits GitHub Actions ::error and ::warning annotations for each failed step and writes JUnit XML.

Development

git clone [email protected]:decocms/parity.git
cd parity
bun install
bunx playwright install chromium
bun run dev run --prod ... --cand ...

Tests: bun run test (vitest). Typecheck: bun run check. Build: bun run build.

License

MIT

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@decocms/parity

What it checks

Quickstart

Presets

Commands

Output

Visual Diff tab

Configuration (optional)

LLM (optional)

CI usage

Development

License