@etohq/workflows-input-dataset-pdf-builder
v0.0.1-next-20260318155517
Published
Builder/draft model + compilation for PdfExtractSpec (preview-first UX)
Readme
@etohq/workflows-input-dataset-pdf-builder
Builder/draft model for creating PdfExtractSpec without asking users to write regex.
The intended UX is:
- Upload a PDF
- Show extracted text preview (line-numbered)
- User selects a representative row line
- UI tokenizes the line (after normalization) and lets the user pick token ranges as columns
- Builder compiles that draft into a canonical
PdfExtractSpec - Runtime previews structured records and import validation
API
import { compilePdfExtractDraft } from "@etohq/workflows-input-dataset-pdf-builder"