commentscraper
v1.3.3
Published
Research what real people say about any topic. Scrape comments and reviews from Reddit, YouTube, Hacker News, Amazon, Steam, Trustpilot, Product Hunt, Etsy, Quora, Google Maps, Notion, Shopify, phpBB forums, and Reddit profiles. Built for AI agents. Inclu
Maintainers
Readme
CommentScraper CLI
Scrape structured comments and reviews from supported discussion platforms for research, customer-voice analysis, and agent workflows.
Install
npm install -g commentscraper
commentscraper loginThe CLI requires a paid subscription. Pro supports Reddit. All Access supports every CLI platform.
Commands
commentscraper research "back pain remedies" --quiet
commentscraper search "crm software frustrations" --quiet
commentscraper scrape "https://reddit.com/r/programming/comments/abc123/post/" --format json --quiet
commentscraper mcp # run as stdio MCP server (see below)
commentscraper whoami
commentscraper platformsUse --quiet when piping JSON to another process so progress logs stay out of
stdout.
Supported Platforms
| Platform | Example | Plan |
| --- | --- | --- |
| Reddit | https://reddit.com/r/startups/comments/abc123/post/ | Pro+ |
| Reddit profile | https://reddit.com/user/spez | Pro+ |
| YouTube | https://youtube.com/watch?v=jNQXAC9IVRw | All Access |
| Hacker News | https://news.ycombinator.com/item?id=12345 | All Access |
| Amazon | https://amazon.com/product-reviews/B08N5WRWNW | All Access |
| Steam | https://store.steampowered.com/app/730/ | All Access |
| Steam recommendation | https://steamcommunity.com/profiles/76561198000000000/recommended/730/ | All Access |
| Trustpilot | https://trustpilot.com/review/www.amazon.com | All Access |
| Product Hunt | https://producthunt.com/posts/some-product | All Access |
| Product Hunt product | https://producthunt.com/products/some-product | All Access |
| Etsy | https://etsy.com/listing/560282270/personalized-custom-mug | All Access |
| Quora | https://quora.com/How-do-you-define-product-market-fit | All Access |
| Google Maps | https://google.com/maps/place/Blue+Bottle+Coffee/ | All Access |
| Notion | https://example.notion.site/page-id | All Access |
| Shopify | https://mystore.myshopify.com/products/example-product | All Access |
| phpBB forum | https://example.com/phpBB3/viewforum.php?f=20 | All Access |
| phpBB thread | https://example.com/phpBB3/viewtopic.php?t=12345 | All Access |
YouTube, Amazon, Trustpilot, Etsy, Quora, Google Maps, and Shopify use a
browser-backed Chromium scraper. For pages that need an authenticated browser session, provide
one of these optional environment variables before running scrape or mcp:
COMMENTSCRAPER_<PLATFORM>_STORAGE=/path/to/playwright-state.json
COMMENTSCRAPER_<PLATFORM>_USER_DATA_DIR=/path/to/chrome-profileOutput
scrape returns JSON by default:
{
"scrapedAt": "2026-05-10T00:00:00.000Z",
"source": "reddit",
"url": "https://reddit.com/r/startups/comments/abc123/post/",
"count": 342,
"post": { "title": "Thread title", "body": "Post body", "url": "..." },
"comments": [
{
"text": "Comment body",
"author": "username",
"timestamp": "2026-05-10T00:00:00.000Z",
"permalink": "https://reddit.com/...",
"score": 42,
"depth": 0
}
]
}CSV and text output are also available:
commentscraper scrape "<url>" --format csv --output comments.csv --quiet
commentscraper scrape "<url>" --format text --quiet
commentscraper scrape "https://example.com/phpBB3/viewforum.php?f=20" --max-threads 20 --quietClaude Desktop (MCP)
CommentScraper ships a stdio MCP server so Claude Desktop, Cursor, and other MCP clients can call it directly. Requires an All Access subscription.
One-time setup:
npm install -g commentscraper
commentscraper login # opens browser, stores tokenThen add to Claude Desktop config at
~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or
%APPDATA%\Claude\claude_desktop_config.json (Windows):
{
"mcpServers": {
"commentscraper": {
"command": "commentscraper",
"args": ["mcp"]
}
}
}Restart Claude Desktop. The MCP server reads the token saved by
commentscraper login automatically — nothing to copy or paste. If your MCP
client runs in a sandbox that can't see ~/.commentscraper/, pass the token
via env instead:
{
"mcpServers": {
"commentscraper": {
"command": "npx",
"args": ["-y", "commentscraper@latest", "mcp"],
"env": { "COMMENTSCRAPER_CLI_TOKEN": "your-cli-token" }
}
}
}Four tools become available in Claude:
| Tool | Description |
| --- | --- |
| scrape_url | Scrape comments from any supported URL |
| search_reddit | Find Reddit threads matching a keyword |
| research_topic | Search + scrape every matching Reddit thread |
| whoami | Inspect auth + plan |
Automation
For CI or smoke testing, provide a token through the environment:
COMMENTSCRAPER_CLI_TOKEN=<token> commentscraper scrape "<url>" --format json --quiet
COMMENTSCRAPER_CLI_TOKEN=<token> commentscraper whoamiWithout COMMENTSCRAPER_CLI_TOKEN, the CLI uses the token saved by
commentscraper login. The CLI does not auto-load a repository .env file;
export the variable in your shell, prefix the command, or use
commentscraper login.
