@riddledc/riddle-proof-packs

v0.8.26

Published

2 months ago

Reusable proof pack profiles and metadata helpers for the Riddle proof framework.

0High
0Medium
0Low

joeddjd

@riddledc/riddle-proof-packs

Reusable starter profile definitions and proof-pack metadata for Riddle Proof.

Exports

RIDDLE_PROOF_PACK_PROFILES:
- Canonical profile map keyed by profile name.
RIDDLE_PROOF_PACK_MANIFEST:
- Array of normalized profile entries with optional pack_id, pack_public_name, required_receipts, and purpose metadata.
listRiddleProofPackProfiles():
- Returns all normalized profile manifests.
listRiddleProofPacks():
- Alias for pack listing used by downstream CLI helpers.
getRiddleProofPackProfile(name):
- Fetch a profile by name.
getRiddleProofProfilesByPackId(packId):
- Fetch all profiles tagged with a given pack id.
getPackEnabledRiddleProofPackProfiles():
- Filter manifest entries that include a pack id.
getRiddleProofPackProfileManifest(name):
- Fetch full profile manifest including sourcePath, packPublicName, and metadata.
getRiddleProofPackProfileByPackId(packId):
- Returns first manifest entry for a matching pack_id.
instantiateRiddleProofProfile(name, options):
- Returns a copy of a profile with optional url, route, and target overrides.
findHumanReviewPacket(proofOrPacket):
- Recursively finds the first human_review_packet in a proof artifact or returns null.
requireHumanReviewPacket(proofOrPacket):
- Same extraction behavior, but throws if no packet is present.
formatHumanReviewPacketMarkdown(packet, options):
- Formats a compact Markdown handoff with recommendation, objective receipts, ranking role, proof boundary, listening prompts, and caveats.
createHumanReviewPacketArtifacts(proofOrPacket, options):
- Returns { packet, json, markdown } for storing a standalone review packet next to a proof run.
createMixingCanonSurrogateReview(proofOrPacket, options):
- Reviews a compact audio-mix human-review packet against conservative development-approval rules and returns an explicit mixing_canon_surrogate approval only when the recommended candidate is subtle, metric-supported, guardrail-preserving, reversible, free of candidate review warnings, and still framed as requiring listening review.
buildNeonApprovedCandidateProfileFromReviewPacket(proofOrPacket, options):
- Builds a Neon approved-candidate profile from a prior human-review packet recommendation, narrowing the approval proof to the already-selected set_mixer_level candidate instead of rerunning the full candidate search.
createNeonApprovedCandidateProfileArtifacts(proofOrPacket, options):
- Returns { packet, candidate, profile, json } for writing the generated one-candidate approval profile next to a proof run.
getNeonApprovedCandidateFromReviewPacket(proofOrPacket):
- Extracts the recommended Neon mixer-level candidate from a human-review packet.
createDurableCandidatePatchPlan(proofOrPacket, options):
- Validates an explicitly applied human_review_packet and returns a durable patch handoff plan.
formatDurableCandidatePatchPlanMarkdown(plan, options):
- Formats the durable handoff without treating approval, ranking, or source application as proof of subjective taste.
createDurableCandidatePatchPlanArtifacts(proofOrPacket, options):
- Returns { plan, json, markdown } for storing a durable source/config handoff next to a proof run.
buildNeonDurableCurrentTargetProfile(override, options):
- Builds a Neon current-target profile that checks a durable mix override against app contract state, visible mixer text, and basic render guardrails.
readActiveNeonDurableMixOverrides(document):
- Reads active durable mix override entries from an app/config document and normalizes bounded mixer levels.
createNeonDurableCurrentTargetArtifacts({ override, profileResult }, options):
- Returns { summary, json, markdown } for the final durable current-target proof receipt.
buildNeonUiMixerControlProfile(options):
- Builds a Neon interaction proof that moves a real browser mixer slider by at least the configured minAbsLevelDelta, checks proof-contract/readout/render guardrails, and restores the slider without using the proof API edit helper.
createNeonUiMixerControlArtifacts(profileResult, options):
- Returns { summary, json, markdown } for the UI mixer-control proof receipt.

Browser apps that only need the pure audio review heuristics should import the browser-safe subpath instead of the root package:

import {
  audioMixCandidateMagnitudeMatchesRequest,
  compareAudioSectionEnergy,
  computeAudioSectionReviewMetric,
  estimateLoudnessStyleLufs,
  resolveAudioMixRequestMagnitude,
} from "@riddledc/riddle-proof-packs/audio-mix-heuristics";

That subpath exposes the loudness-style and section-energy helpers without the Node-oriented profile, artifact, or CLI helpers.

It also exposes a narrow request-magnitude policy for claim-candidate loops. For example, resolveAudioMixRequestMagnitude({ intent: "turn the bass down a little" }) returns magnitude: "subtle" and maxAbsDelta: 0.12, while audioMixCandidateMagnitudeMatchesRequest(candidate, request) can reject a larger-but-otherwise-safe candidate as a claim-translation mismatch. This is a semantic guardrail for the requested change size, not a taste score.

Browser or agent code that needs the conservative development approval helper can import the review subpath:

import {
  createMixingCanonSurrogateReview,
} from "@riddledc/riddle-proof-packs/audio-mix-review";

The review helper can approve a candidate for development application, but it does not approve subjective taste. It returns needs_human_review unless the packet is already a supported listening-review candidate with objective receipts, section-energy guardrails, state restoration, review-order-only ranking, and an explicit proof/taste boundary.

Proof claims and evidence roles

Riddle Proof proves claims about a running browser/app target. A before/after change proof is one pattern built from smaller proof claims, not the core proof primitive.

1) Atomic proof

An atomic proof is the base unit:

claim
- What the profile is asserting.
target
- The URL/route/runtime state that is being tested.
setup/actions
- The steps that drive the target to the evidence-bearing state.
evidence
- Screenshots, assertions, console/HAR, route text, DOM checks, and any explicit output.
verdict
- Pass/fail/review status based on whether each atomic evidence receipt is present and valid.

2) Evidence-role patterns

current_target (current-target audit)
- One deployed/preview target. No implementation diff artifacts are required. No measured before/after visual delta is required.
reference_candidate (reference/candidate change proof)
- Reference/prod/baseline evidence plus candidate/after evidence is compared inside the same workflow. Comparison supports a change claim.
interaction_snapshots (interaction proof)
- A pre-action and post-action snapshot set is collected in one proof run, e.g. before clicking Play and after clicking Play.

3) Naming guidance

Avoid using before/after alone in user-facing docs when describing proof purpose.
Prefer:
- reference_candidate for change/variance proofs.
- pre-action / post-action for snapshots in a single run.
- current_target for audit/profile/no-diff proofs.
If you encounter before_cdn / after_cdn in existing payloads, treat those as legacy artifact role names in the change workflow, not as universal proof terminology.

4) Pack/report declarations

Each proof pack sample report should state:

Which evidence-role pattern it uses (current_target, reference_candidate, or interaction_snapshots).
What the pack does not prove.

Profiles are stored under packs/<slug>/profile.json or generated from typed helper builders when the profile needs target-specific options, then mirrored into the runtime exports.

Bundled profiles

page-content-basic
route-inventory-basic
handled-recovery-list-load
handled-recovery-action-malformed-success
terminal-result-partial-evidence
gameplay-window-call-until
spa-route-exit-state-hygiene
canvas-gameplay
mobile-layout-smoke
auth-smoke
neon-step-sequencer-fast-mix-health
neon-step-sequencer-source-readiness
neon-step-sequencer-playback-sync
neon-step-sequencer-mix-change-before-after
neon-step-sequencer-mobile-trainer-layout
neon-step-sequencer-full-mix-health-matrix
neon-step-sequencer-explore-songs-and-mixes
neon-step-sequencer-deep-explore-songs-and-mixes
neon-step-sequencer-ratchet-loop-mix-level-search
neon-step-sequencer-ratchet-loop-approved-candidate
neon-step-sequencer-durable-current-target
neon-step-sequencer-ui-mixer-control

Audio and Neon ratchet packs

The audio-mix directory contains reusable audio-proof authoring guidance, a profile template, a metrics schema, a ratchet method, and a human-review rubric.

The neon-step-sequencer directory contains the first app-specific ratchet lab under the new architecture. Its profiles declare current_target or interaction_snapshots evidence-role patterns and explicitly state what they do not prove. The ratchet-loop profiles now expect a compact humanReviewPacket for listening handoff: supported/rejected candidates, objective guardrails, state restoration, review-order ranking, taste caveats, and, when explicitly requested, an applied-candidate receipt. The case-study files record the claim, evidence, failure classification, smallest layer changed, and next sharper question for each run.

Natural mix requests should become explicit claim constraints before candidate ranking. Target, direction, and requested magnitude are separate receipts: bass/down/subtle is not the same claim as "find the largest guardrail- preserving bass cut." The shared audio-mix heuristic helper treats phrases such as "a little", "a bit", "slightly", "touch", and "small" as a subtle magnitude with a default 0.12 max absolute mixer-level delta. Candidates outside that bound can still be useful exploration data, but they should be rejected for the subtle claim before review-order ranking.

Two-speed local ratchet

Neon uses two exploration speeds:

neon-step-sequencer-explore-songs-and-mixes is the fast bounded current-target sweep for normal iteration.
neon-step-sequencer-deep-explore-songs-and-mixes is the slower pre-deploy sweep for batching deterministic app/audio guardrail failures before release.

The deep profile still proves only objective receipts: catalog coverage within bounds, active-lane/proof-window agreement, clipping/headroom, browser health, and state restoration. It does not prove subjective mix taste or that a candidate sounds better.

Bounded sweeps can also emit non-failing review warnings for objective signals that are close to a guardrail. For example, collectAudioExplorationReviewWarnings() reports a low_headroom_margin warning when a sampled part is below the configured review margin but has not necessarily failed the proof. Use formatAudioExplorationReviewWarningsMarkdown() when an app or agent needs consistent handoff copy. Treat these warnings as listening/review cues, not as automated taste decisions.

When a workflow tests several small mix intents against the same running target, use summarizeAudioMixIntentMatrix() and formatAudioMixIntentMatrixMarkdown() to produce a compact matrix handoff. The matrix format is for review ordering: it can show that bass -0.05, guitar -0.02, and chord -0.035 each had objective receipts, guardrails, finding counts, and warning counts. It must not claim that any candidate sounds better or should be applied automatically.

The same summary can record whether the workflow used separate proof launches or one browser/profile call, for example executionMode: "single_browser_intents". If a conservative mixing-canon surrogate reviewed the packets, include mixingCanonSurrogateReview so the Markdown can show approval counts and the recommended development candidate while still saying that surrogate approval is not a listener preference.

Use selectAudioMixIntentSet() when the same intent set needs both a fast smoke lane and a fuller matrix lane. For example, a smoke run can select only guitar-down-little to verify that the target, proof contract, and candidate loop are working before a later run selects the full intent set. Selection is a bounded workflow receipt: it scopes which objective claim-candidate loops ran, reports unknown ids without throwing, and does not prove subjective mix quality. Use formatAudioMixIntentSelectionMarkdown() next to the JSON receipt when a human or agent needs to inspect requested ids, selected ids, unknown ids, and the selected intent table without opening the raw plan.

Use buildAudioMixLevelIntentSet() for standard bounded level-change requests, for example bass/guitar/chord down a little smoke and matrix lanes. Apps still own the track list and target route, but the proof-pack helper owns the repeatable intent ids, claim wording, focus/target tracks, direction, and subtle magnitude metadata.

Use resolveAudioMixIntentRouteAlignment() when a smoke lane selects one intent from a broader route template. The receipt keeps compatibility fields such as requestedRoute, but also records inputRouteRole, inputRoute, and a routeAdjustment object so reviewers can distinguish an explicit route from a default route that was aligned to the selected claim target.

Use selectAudioMixProofWindows() when the same target needs both a fast focused smoke lane and a fuller review lane. A smoke run can request one known proof window such as introBed so agents can iterate quickly, while a later matrix or promotion run omits the option and uses the full profile window set. The receipt records focused_smoke_window versus full_profile_window_set, selected window names, unknown window requests, and boundary language that says focused smoke does not replace broad review or prove subjective mix quality. Use formatAudioMixProofWindowSelectionMarkdown() next to the JSON receipt when a human or agent needs to inspect the proof-window scope without opening the raw plan.

Human-review packet handoff

Human-review packets are proof artifacts for subjective follow-up. They are deliberately not taste scores. A packet should say what objective receipts passed, what was preserved, which candidate is ready for listening review, and which caveats remain.

When candidates include sectionEnergyComparison, the Markdown includes per-section baseline/candidate/delta tables. When candidates include loudnessConsequenceComparison, the Markdown includes intent-aware loudness consequence rows. When candidates include activeLaneReceipt, the Markdown includes an Active Lane Receipts table so required-lane preservation is visible without digging through raw JSON. These are deterministic review aids; they do not prove subjective mix quality.

The packet formatter also emits Packet Diagnostics. The evidence-completeness status says whether the recommended candidate actually captured the expected review layers: target movement, section energy, loudness consequences, active-lane receipt, state restoration, and proof/taste boundary language. Use collectHumanReviewPacketEvidenceCompleteness() when an agent needs the same audit in JSON.

From the CLI:

riddle-proof-review-packet \
  --proof artifacts/riddle-proof/proof.json \
  --output artifacts/riddle-proof

This writes human-review-packet.json and human-review-packet.md next to the proof run. Use --stdout when an agent should read the Markdown handoff immediately.

import {
  createHumanReviewPacketArtifacts,
  findHumanReviewPacket,
  formatHumanReviewPacketMarkdown,
} from "@riddledc/riddle-proof-packs";

const proof = JSON.parse(await fs.promises.readFile("proof.json", "utf8"));
const packet = findHumanReviewPacket(proof);
if (!packet) throw new Error("proof did not emit a human review packet");

const markdown = formatHumanReviewPacketMarkdown(packet, {
  title: "Neon Human Review Packet",
});
const artifacts = createHumanReviewPacketArtifacts(proof, {
  title: "Neon Human Review Packet",
});

console.log(markdown);
await fs.promises.writeFile("human-review-packet.json", artifacts.json);
await fs.promises.writeFile("human-review-packet.md", artifacts.markdown);

Approved-candidate profile handoff

Once a bounded ratchet loop has already produced a human-review packet, follow-on proof does not need to rerun the whole candidate search just to apply the selected candidate. Build a one-candidate approved profile from the packet recommendation instead:

import {
  buildNeonApprovedCandidateProfileFromReviewPacket,
  createHumanReviewPacketArtifacts,
} from "@riddledc/riddle-proof-packs";

const proof = JSON.parse(await fs.promises.readFile("proof.json", "utf8"));
const { packet } = createHumanReviewPacketArtifacts(proof, {
  title: "Neon Human Review Packet",
});

const approvedProfile = buildNeonApprovedCandidateProfileFromReviewPacket(packet, {
  url: "http://127.0.0.1:5173",
});

await fs.promises.writeFile("approved-candidate-profile.json", `${JSON.stringify(approvedProfile, null, 2)}\n`);

This still proves the applied candidate and preserves the listening-review caveat. It only narrows the candidate generator to the packet recommendation so local ratchet batches do not pay for a duplicate broad search.

When an agent is allowed to stand in for a human during development, make that surrogate approval explicit and evidence-backed:

import {
  buildNeonApprovedCandidateProfileFromReviewPacket,
  createMixingCanonSurrogateReview,
} from "@riddledc/riddle-proof-packs";

const surrogate = createMixingCanonSurrogateReview(packet, {
  approvedBy: "codex",
  requireLoudnessConsequenceComparison: true,
});
if (!surrogate.ok) {
  throw new Error(`needs human review: ${surrogate.failedChecks.join(", ")}`);
}

const approvedProfile = buildNeonApprovedCandidateProfileFromReviewPacket(packet, {
  approval: surrogate.approval,
});

This approval mode is intentionally narrow: subtle level edits only, objective receipts first, no clipping/headroom/low-level violations, state restored, and ranking used only for review order. When loudness consequences are required, the recommended candidate must stay within the expected intent-aware loudness range and carry no candidate-level review warnings. Rejected-candidate warnings can still appear in the packet without blocking a clean recommendation. This keeps local development moving without turning metrics into a claim that the mix is better.

Durable candidate patch handoff

A human-review packet is still not a source edit. Durable patch handoff is the next gate after explicit approval:

A bounded proof loop produces a supported candidate.
A human or visible surrogate approval mode sets applyBest.
The packet reports candidate_applied_for_listening_review.
A durable candidate plan validates that the packet is no longer transient.
The app/repo applies a scoped config/source patch.
A final current_target proof verifies the running app sees the durable state.

The durable plan refuses packets that still say candidate_ready_for_listening_review, packets without approval metadata, packets whose candidateActionsAreTransient flag is still true, and packets that lost the listening-review caveat.

From the CLI:

riddle-proof-durable-candidate-plan \
  --proof artifacts/riddle-proof/neon-approved-candidate/proof.json \
  --output artifacts/riddle-proof/neon-approved-candidate \
  --source-file src/Games/songs/neon-approved-mix-overrides.json \
  --require-mix-profile

This writes durable-candidate-patch-plan.json and durable-candidate-patch-plan.md. If the packet is not ready for durable application, the plan is still written with status: "not_ready_for_durable_patch" and the CLI exits nonzero.

import {
  createDurableCandidatePatchPlanArtifacts,
} from "@riddledc/riddle-proof-packs";

const proof = JSON.parse(await fs.promises.readFile("proof.json", "utf8"));
const artifacts = createDurableCandidatePatchPlanArtifacts(proof, {
  title: "Neon Durable Candidate Patch Plan",
  sourceFile: "src/Games/songs/neon-approved-mix-overrides.json",
  requireMixProfileId: true,
});

if (!artifacts.plan.ok) {
  throw new Error(`not ready for durable patch: ${artifacts.plan.errors.join(", ")}`);
}

await fs.promises.writeFile("durable-candidate-patch-plan.json", artifacts.json);
await fs.promises.writeFile("durable-candidate-patch-plan.md", artifacts.markdown);

Durable current-target proof

After a durable source/config patch lands, run a separate current_target proof. This is not another taste claim. It proves that the deployed or preview app sees the approved durable state through three objective surfaces:

app contract mixer levels
app contract mix-profile source levels
visible mixer text plus basic offline render guardrails

The reusable helper builds the profile from an active durable override:

import {
  buildNeonDurableCurrentTargetProfile,
  createNeonDurableCurrentTargetArtifacts,
  readActiveNeonDurableMixOverrides,
} from "@riddledc/riddle-proof-packs";

const document = JSON.parse(await fs.promises.readFile("src/Games/songs/neon-approved-mix-overrides.json", "utf8"));
const [override] = readActiveNeonDurableMixOverrides(document);
const profile = buildNeonDurableCurrentTargetProfile(override, {
  url: "https://lilarcade.com",
});

await fs.promises.writeFile("generated-profile.json", `${JSON.stringify(profile, null, 2)}\n`);

// After running the profile with a local or hosted runner:
const profileResult = JSON.parse(await fs.promises.readFile("profile-result.json", "utf8"));
const artifacts = createNeonDurableCurrentTargetArtifacts({ override, profileResult });
await fs.promises.writeFile("durable-current-target-summary.json", artifacts.json);
await fs.promises.writeFile("durable-current-target-summary.md", artifacts.markdown);

This proof can catch contract/source disagreements, stale durable overrides, visible-control drift, clipping, and silence/low-level windows. It does not prove subjective mix quality.

Usage

import {
  getRiddleProofPackProfile,
  getRiddleProofProfilesByPackId,
  instantiateRiddleProofProfile,
} from "@riddledc/riddle-proof-packs";

const profile = getRiddleProofPackProfile("spa-route-exit-state-hygiene");
if (!profile) throw new Error("missing profile");

const hygieneProfiles = getRiddleProofProfilesByPackId("state_hygiene");
const instantiated = instantiateRiddleProofProfile("mobile-layout-smoke", {
  url: "https://example.com",
  route: "/",
  target: { wait_for_selector: "body" },
});

console.log("Using profile", instantiated.name);

Published

Vulnerabilities

Links

Maintainers

Keywords

Readme

@riddledc/riddle-proof-packs

Exports

Proof claims and evidence roles

1) Atomic proof

2) Evidence-role patterns

3) Naming guidance

4) Pack/report declarations

Bundled profiles

Audio and Neon ratchet packs

Two-speed local ratchet

Human-review packet handoff

Approved-candidate profile handoff

Durable candidate patch handoff

Durable current-target proof

Usage