Large community for a self-hosted search tool. Active development with rolling Docker releases.
SearXNG
activePrivacy-first, self-hosted meta-search engine aggregating 70+ upstream engines. Zero cost, zero API keys, full data sovereignty.

Where it wins
Zero cost — no API keys, no per-query charges
Full data sovereignty — queries never leave your infrastructure
Aggregates 70+ search engines
Active development (last commit 2026-03-15)
Strong HN traction (302 pts + 134 pts)
Where to be skeptical
No independent benchmark data — quality ungradeable
Requires self-hosting (Docker)
No managed service option
Quality depends on upstream engines
No structured extraction — pure search only
Editorial verdict
#4 in search-news — the only option where no query ever leaves your infrastructure. 26,644 stars, active development. Not independently benchmarked on quality, but unmatched on privacy and cost.
Source
Videos
Reviews, tutorials, and comparisons from the community.
Private Internet Searches with SearXNG
Build your private Google: self-hosted AI search in 10 minutes
search results suck right now, use THIS instead
Related

Crawl4AI
93Free, open-source web scraping (Apache-2.0). 62K stars, 6,353 forks (nearly matches Firecrawl), actively maintained (v0.8.5, 2026-03-18), 384K weekly PyPI downloads. Best open-source alternative to Firecrawl.
Exa MCP Server
87Official Exa MCP for fast web search and crawling when the workflow is search-first rather than page-ops-first.
ScrapeGraphAI
82LLM-graph-based web scraper — describe what you want, AI builds the extraction graph. 23K stars, 194 HN pts, active development (v1.74.0, Mar 2026). Open-source + hosted API.

Firecrawl MCP Server
72Official Firecrawl MCP for scraping, extraction, and deep research workflows. 95K+ GitHub stars (main repo), 1.23M combined weekly downloads, backed by $14.5M Series A. ScrapeOps 10/10.
Public evidence
Strong developer community interest, especially from privacy-conscious users.
Consistent release cadence. Not abandoned — active open-source project.
Raw GitHub source
GitHub README could not be fetched right now.