npm package discovery and stats viewer.

Discover Tips

  • General search

    [free text search, go nuts!]

  • Package details

    pkg:[package-name]

  • User packages

    @[username]

Sponsor

Optimize Toolset

I’ve always been into building performant and accessible sites, but lately I’ve been taking it extremely seriously. So much so that I’ve been building a tool to help me optimize and monitor the sites that I build to make sure that I’m making an attempt to offer the best experience to those who visit them. If you’re into performant, accessible and SEO friendly sites, you might like it too! You can check it out at Optimize Toolset.

About

Hi, 👋, I’m Ryan Hefner  and I built this site for me, and you! The goal of this site was to provide an easy way for me to check the stats on my npm packages, both for prioritizing issues and updates, and to give me a little kick in the pants to keep up on stuff.

As I was building it, I realized that I was actually using the tool to build the tool, and figured I might as well put this out there and hopefully others will find it to be a fast and useful way to search and browse npm packages as I have.

If you’re interested in other things I’m working on, follow me on Twitter or check out the open source projects I’ve been publishing on GitHub.

I am also working on a Twitter bot for this site to tweet the most popular, newest, random packages from npm. Please follow that account now and it will start sending out packages soon–ish.

Open Software & Tools

This site wouldn’t be possible without the immense generosity and tireless efforts from the people who make contributions to the world and share their work via open source initiatives. Thank you 🙏

© 2026 – Pkg Stats / Ryan Hefner

icecrawl

v0.4.0

Published

Web scraping application with HTTP API (incl. Dashboard), CLI, and MCP Server interfaces.

Readme

Icecrawl

Buy Me a Coffee

A powerful web scraping application offering multiple interfaces: HTTP API (with Dashboard), CLI, and MCP Server.


Features

  • Multiple Interfaces:
    • HTTP API Server: RESTful API for integration, includes a web dashboard.
    • CLI Tool: icecrawl command for terminal-based scraping.
    • MCP Server: icecrawl-mcp command for programmatic use with MCP clients.
  • Web Dashboard: User-friendly UI for managing scrapes and viewing results.
  • Authentication: User management with role-based access control.
  • Database Storage: Persistent storage using Prisma ORM (SQLite default).
  • Crawling: Asynchronous website crawling with depth and scope control.
  • Flexible Output: JSON, Markdown, raw HTML, or screenshots.
  • Performance Optimization: Caching, request pooling.
  • Proxy Support: Use proxies for requests.
  • JS Rendering: Optional headless browser usage via Puppeteer.
  • And more: Content Transformation, Exporting, Scheduled Jobs...

Sitemap Generation Mode

  • Crawl an entire website to build a hierarchical sitemap of all internal links.
  • Does not save page content or extract text.
  • Useful for visualizing site structure, auditing SEO, or link analysis.
  • Enable by setting crawl option "mode": "sitemap" via API or CLI.
  • The sitemap is saved as JSON in the crawl job record and can be retrieved via API.

Installation

From npm (Recommended)

npm install -g icecrawl
  • Creates a default data directory:
    • Windows: C:\Users\<username>\Documents\Icecrawl
    • macOS/Linux: ~/Icecrawl
  • Generates .env file, initializes database, seeds default admin user.
  • After install:
icecrawl --help
icecrawl-mcp

From Source (Development)

git clone https://github.com/wangdangel/icecrawl.git
cd icecrawl
npm install
cp .env.example .env
# Edit .env with your config
npx prisma migrate dev
npm run prisma:generate
npm run build
npm run build:dashboard
# Optionally: npm link

Usage

Start Dashboard + MCP Server (default)

icecrawl
  • Dashboard: http://localhost:6971/dashboard
  • API Docs: http://localhost:6971/api-docs
  • MCP server runs concurrently for integrations.

Start only the Dashboard server

icecrawl dashboard

Start only the MCP server

icecrawl mcp-server

Scraping via CLI

icecrawl scrape url https://example.com
echo "https://example.com" | icecrawl scrape

See docs/cli-usage.md for full CLI documentation and examples.


Troubleshooting

Permission Denied Error when running icecrawl

If you successfully install globally (npm install -g icecrawl) but get a Permission denied error when trying to run icecrawl, you may need to manually add execute permissions:

  1. Find your global npm bin directory:

    npm bin -g
  2. Run the following command, replacing the path with the one found above:

    chmod +x /path/to/your/global/bin/icecrawl

This should resolve the permission issue.


MCP Server Configuration Example

Add this to your MCP client configuration (e.g., Cline):

{
  "command": "node",
  "args": ["k:/Documents/smart_crawler/dist/mcp-server.js"],
  "cwd": "k:/Documents/smart_crawler",
  "disabled": false,
  "autoApprove": [],
  "timeout": 60,
  "transportType": "stdio"
}

Default Login Credentials

For initial access after seeding:

| Username | Password | Email | Role | |-----------|-----------|---------------------|--------| | admin | password | [email protected] | admin |


Development Commands

npm test
npm run test:coverage
npm run lint
npm run format
npm run prisma:studio

Project Structure

To be documented.

CI/CD Workflow

To be documented.

Contribution Guidelines

To be documented.

Releasing

To be documented.


License

MIT