miyami-websearch-mcp
v1.4.0
Published
MCP server providing web search and content extraction for LLMs
Maintainers
Readme
MiyaMi WebSearch MCP
Connect your LLM to the internet! Search the web and extract content from any webpage using the Model Context Protocol.
🌟 Features
- 🔍 Web Search - Search across Google, DuckDuckGo, Bing, Brave, Wikipedia
- 🧠 Deep Research - Multi-query parallel research with compiled reports
- 🛡️ FREE Stealth Mode - Anti-bot bypass (Cloudflare, DataDome, etc.) - NEW!
- ⏰ Time-Range Filters - Filter results by recency (day, week, month, year)
- 📄 Enhanced Content Extraction - Trafilatura-powered (Firecrawl-quality) extraction
- 📝 Markdown Output - Get structured markdown from webpages
- 🎯 Rich Metadata - Automatically extract authors, dates, site names
- ⚡ Fast & Easy - One-line installation, zero configuration
- 🤖 LLM Optimized - Formatted responses perfect for AI consumption
- 🆓 100% Free - No API keys, no signup, no configuration needed
- 🔒 Privacy-First - No tracking, no data collection
📦 Installation
Option 1: Use with npx (Recommended - No Installation)
Add to your Claude Desktop config (~/Library/Application Support/Claude/claude_desktop_config.json):
{
"mcpServers": {
"miyami-websearch": {
"command": "npx",
"args": ["-y", "miyami-websearch-mcp"]
}
}
}Option 2: Global Installation
npm install -g miyami-websearch-mcpThen configure Claude Desktop:
{
"mcpServers": {
"miyami-websearch": {
"command": "miyami-websearch-mcp"
}
}
}That's it! Restart Claude Desktop and you're ready to search the web! 🎉
🚀 Quick Start
After adding to Claude Desktop config and restarting, try these prompts:
Search for the latest news about AISearch for Python tutorials and summarize the top resultFetch the content from https://example.com and summarize it🛠️ Available Tools
1. web_search
Search the web using multiple search engines with optional time-range filtering.
Parameters:
query(required) - Your search querycategories(optional) - general, news, images, videos, sciencelanguage(optional) - Language code (default: en)page(optional) - Page number (default: 1)time_range(optional) - NEW! Filter by recency: day, week, month, year
Examples:
Search for "quantum computing breakthroughs" in news categorySearch for AI news from the past 24 hours with time_range: dayFind recent Python tutorials from the past week with time_range: week2. fetch_webpage
Extract clean content from any webpage using Trafilatura (Firecrawl-quality extraction).
Parameters:
url(required) - The webpage URLinclude_links(optional) - Include links (default: true)include_images(optional) - Include images (default: true)max_content_length(optional) - Max length in characters (default: 50000)format(optional) - Output format: text, markdown (default), htmlextraction_mode(optional) - Engine: trafilatura (default, best quality), readability (faster)stealth_mode(optional) - NEW! Anti-bot bypass: off, low, medium, high (default: off)auto_bypass(optional) - NEW! Auto-escalate stealth if bot protection detected (default: false)
Enhanced Features:
- 📝 Markdown output - Get structured markdown like Firecrawl
- 🎯 Rich metadata - Authors, dates, site names automatically extracted
- 📊 Extraction stats - Word count, content length, format info
- 🛡️ Stealth mode - Bypass Cloudflare, DataDome, Akamai, etc.
Example:
Fetch and summarize https://en.wikipedia.org/wiki/Artificial_intelligence in markdown format3. search_and_fetch ⭐ RECOMMENDED
Search and automatically fetch full content from top results with Trafilatura-quality extraction.
Parameters:
query(required) - Your search querynum_results(optional) - How many results to fetch (1-5, default: 3)categories(optional) - Search categoriestime_range(optional) - Filter by recency: day, week, month, yearformat(optional) - Output format: text, markdown (default), htmlstealth_mode(optional) - NEW! Anti-bot bypass: off, low, medium, high (default: off)auto_bypass(optional) - NEW! Auto-escalate stealth if bot protection detected (default: false)
What it does:
- ✅ Searches for your query (with optional time filter)
- ✅ Gets top N results
- ✅ Automatically fetches full content (parallel)
- ✅ Uses Trafilatura for Firecrawl-quality extraction
- ✅ Returns both search snippets AND full webpage content
- ✅ FREE stealth mode for protected sites
Examples:
Research "climate change solutions" and give me detailed info from top 3 sourcesGet recent AI breakthroughs from past 24 hours with full articles (time_range: day, num_results: 5)Research recent web development tutorials from past week (time_range: week, format: markdown)4. deep_research 🧠
Perform comprehensive parallel research across multiple topics at once with AI-powered reranking and compiled markdown reports.
Parameters:
queries(required) - Comma-separated list of research queries (max 10)breadth(optional) - Results to fetch per query (1-5, default: 3)time_range(optional) - Filter by recency: day, week, month, yearmax_content_length(optional) - Max content per result (default: 30000)stealth_mode(optional) - NEW! Anti-bot bypass: off, low, medium, high (default: off)auto_bypass(optional) - NEW! Auto-escalate stealth if bot protection detected (default: false)
What it does:
- ✅ Process up to 10 queries in parallel for speed
- ✅ AI reranking for better relevance (always enabled)
- ✅ Auto-generates compiled markdown report
- ✅ Rich metadata extraction (author, date, source)
- ✅ Server-side caching (30 minutes)
- ✅ Aggregated statistics across all queries
- ✅ FREE stealth mode for protected sites
Examples:
Research "AI trends 2024,machine learning basics,ChatGPT use cases" with deep_researchDeep research on "React vs Vue,Next.js features,frontend trends" from past monthComprehensive research: "climate solutions,renewable energy,carbon capture" with breadth: 5💡 Usage Examples
Research a Topic
Use search_and_fetch to research "artificial general intelligence latest developments"
from the top 3 results and give me a comprehensive summaryGet Recent News (Time-Range Filter)
Search for AI breakthroughs from the past 24 hours using time_range: dayRecent Tutorials (Time-Range Filter)
Find Python tutorials from the past week using search with time_range: weekFetch with Markdown Output
Fetch this article in markdown format: https://example.com/articleResearch Recent Developments
Use search_and_fetch to research "quantum computing" from the past week
with time_range: week and get full article content in markdownFind Specific Information
Search for "best restaurants in Tokyo" and show me the top 5 resultsMulti-step Research
1. Search for "Python web scraping libraries"
2. Fetch the documentation page in markdown format
3. Explain how to use it with examples🔧 Configuration
No configuration needed! 🎉
This MCP server connects to a free public API automatically. Just add it to your Claude Desktop config and it works immediately.
If you're looking for advanced configuration options, there aren't any - we've kept it simple on purpose!
🐛 Troubleshooting
"MCP server not appearing in Claude Desktop"
- Check your
claude_desktop_config.jsonis valid JSON - Restart Claude Desktop completely (Quit and reopen)
- Check Console.app (macOS) for error messages
"First search is slow (30-60 seconds)"
This is normal! The free tier API sleeps after inactivity. Subsequent requests are fast.
"Connection timeout"
The backend API is on Render free tier and may be waking up. Wait 60 seconds and retry.
"Tools not working"
- Ensure you have Node.js 18+ installed:
node --version - Try global install instead of npx
- Check GitHub Issues
📡 API Backend
This MCP server connects to a free public API:
- URL: https://websearch.miyami.tech (hardcoded, no config needed)
- Cost: 100% Free - no API keys or signup required
- Privacy: No logging, no tracking, no data collection
- Engines: Google, DuckDuckGo, Bing, Brave, Wikipedia, Startpage
- Stealth Mode: FREE anti-bot bypass (Cloudflare, DataDome, Akamai, etc.)
🤝 Contributing
Contributions welcome! Please:
- Fork the repository
- Create a feature branch
- Make your changes
- Submit a pull request
Report Issues
Found a bug? Open an issue
📄 License
MIT License - see LICENSE file for details
🌟 Star History
If this tool helps you, please star the repo! ⭐
Made with ❤️ for the LLM community
Connect your AI to the internet in seconds, not hours.
