npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

content-auth-export

v1.1.0

Published

Comprehensive website content auditor with automated screenshots and deep OCR capabilities.

Readme

Content-Auth-Export (v1.1.0)

A professional-grade, comprehensive Node.js CLI tool for auditing website content. It performs an exhaustive search for text strings across all layers of a site: visible UI text, hidden accessibility attributes, SEO metadata, and text embedded within all images (including SVGs and CSS backgrounds).

Key Features

  • Comprehensive DOM Auditing: Automatically extracts and searches text from:
    • Visible page content.
    • Hidden accessibility attributes: alt, title, aria-label, and placeholder.
    • SEO & Social Metadata: <title>, <meta name="description">, Open Graph (og:), and Twitter tags.
    • Form UI: Button values and input placeholders.
  • Visual Evidence (Automatic Screenshots): For every match found, the tool automatically:
    • Highlights the matching element with a red border and yellow background.
    • Scrolls the match into view.
    • Captures a high-resolution screenshot.
  • Embedded Excel Reporting: Screenshots are embedded directly into the exported .xlsx report for instant verification.
  • Deep Image Audit (Always-On OCR): No flags required. The tool automatically reads text inside:
    • Standard <img> tags and modern <picture>/<source> elements.
    • Vector Graphics (SVGs): Automatically rasterized and processed.
    • CSS Background Images: Discovered via computed styles using Puppeteer.
  • Advanced OCR Engine: Powered by Tesseract.js with Sharp pre-processing:
    • Multi-Pass Analysis: Each image is processed through 5 different filters (Grayscale, High-Contrast, Threshold, Inverted) to maximize detection of stylized or artistic fonts.
    • Fuzzy Matching: Uses a 75% word-match heuristic to handle minor OCR misreads while maintaining high precision.
  • Dynamic Content Support: Uses Puppeteer to handle JavaScript-rendered sites (SPAs) and waits for networkidle2 to ensure all data-driven content is loaded.
  • Granular Mapping: Maps every match to its exact source (e.g., "Page HTML / Attributes" or the specific Image URL).

Installation

  1. Clone the repository.
  2. Install dependencies:
    npm install
  3. (Optional) Link the command globally:
    npm link

Usage

node bin/index.js --url https://example.com --content "Search Text" --output audit-results.xlsx

Options

  • -u, --url <url>: (Required) The base URL to start crawling from.
  • -c, --content <content>: (Required) The text content to search for.
  • -o, --output <path>: Output Excel file path (default: results.xlsx).
  • -t, --threads <number>: Number of concurrent requests (default: 5).

Examples

Audit a site for brand consistency with visual proof

node bin/index.js -u https://yoursite.com -c "Brand Name"

Architecture

The project follows a SOLID modular structure for high maintainability:

  • src/interfaces.js: Abstractions for core services.
  • src/services/HtmlSearcher.js: Logic for comprehensive DOM auditing (Attributes, Meta, Text).
  • src/services/OcrService.js: Advanced multi-pass image-to-text conversion.
  • src/services/WebCrawler.js: Headless browser-based crawler with CSS background extraction and visual capture.
  • src/services/CrawlAndSearchService.js: Orchestrator for the unified search workflow.

License

ISC