scraper-simplifier
v1.0.5
Published
π README.md for Scraper Simplifier
Readme
π README.md for Scraper Simplifier
Scraper Simplifier π
A lightweight web scraping tool that extracts data from websites and exports it as JSON or CSV.
π Quick Installation
You can install Scraper Simplifier using npm:
npm install -g scraper-simplifier
π Usage
CLI Usage
You can run the scraper directly from the command line:
scraper-simplifier --url="https://example.com" --selector="h1" --method="cheerio" --output="json" --filename="example"
Programmatic Usage (Node.js API)
You can also use it inside your Node.js project:
const { scrape } = require("scraper-simplifier");
(async () => {
await scrape("https://example.com", "h1", "cheerio", "json");
})();
β¨ Features
β
Supports Cheerio & Puppeteer (for JavaScript-heavy pages)
β
Exports data as JSON or CSV
β
Handles pagination seamlessly
β
Easy-to-use CLI interface
β
Saves extracted data into a file automatically
β
Proxy support to avoid rate-limiting
β
Google Sheets & Excel integration
β
Cloud storage support (Google Drive, etc.)
β
Real-time dashboards (React-based UI for monitoring)
π§ Options & Arguments
Option Description Default Value
--url The target URL to scrape. Required
--selector The CSS selector to extract content from. Required
--method The scraping method (cheerio or puppeteer). cheerio
--output Output format (json or csv). json
--filename The name of the output file (saved in the current dir). scraped
Example:
scraper-simplifier --url="https://example.com" --selector="p" --method="puppeteer" --output="csv" --filename="data"
π File Output Example
JSON Output (scraped.json)
[
{
"text": "Example Heading"
},
{
"text": "Another Heading"
}
]
CSV Output (scraped.csv)
text
Example Heading
Another Heading
π Licensing
This package is MIT Licensed. Always ensure compliance with website terms of service before scraping.
Google Inc. OFL-1.1
π₯ Contributing
If you have ideas for performance improvements or new features, feel free to star β and contribute on GitHub!
π GitHub Repository: https://github.com/adilsheikhh/scraper-simplifier
π Changelog & Versioning
v1.0.0 - Initial release π
v1.0.1 - Added README and improved CLI options
v1.1.0 - Introduced Puppeteer support & pagination handling
You can track all changes in the Changelog.
π Built & Signed On
Built with β€οΈ using GitHub Actions
π View build summary