HLE designed to be near-impossible; 26.6% is a massive lead over Perplexity (21.1%) and Gemini (6.2%). Shows reasoning superiority.
OpenAI Deep Research
activeAgentic research mode powered by o3/o4-mini. 26.6% HLE (highest of any system), 72.57% GAIA, MCP support (Feb 2026). Slower (3-15 min) but deepest reasoning.

Where it wins
26.6% on Humanity's Last Exam — highest of any system (Helicone)
72.57% GAIA benchmark — best reported result
MCP support added Feb 2026 — first major platform to do so
API available via /responses endpoint with deep research models
Domain-restricted searches, real-time progress tracking
Where to be skeptical
3-15 min per query — 10-60x slower than Perplexity
$200/mo for unlimited (Pro); Plus ($20/mo) limited to 10 queries/mo
Closed source, no self-hosting option
DeepResearch Bench: o3 standalone outperformed Deep Research mode in some evals (FutureSearch)
Editorial verdict
#2 in research. Reasoning king — 26.6% HLE and 72.57% GAIA are best reported results from any system. MCP support (Feb 2026) enables enterprise toolchain integration. Slower and pricier than Perplexity but unmatched for PhD-level questions.
Related

GPT Researcher
89Open-source autonomous deep research agent. CMU DeepResearchGym #1 on citation quality, report quality, info coverage. 25.8K stars, 15.9K weekly PyPI downloads. Apache 2.0.

Tongyi DeepResearch
82First fully open-source deep research agent matching closed-source leaders on benchmarks. HLE 32.9 (exceeds OpenAI's 26.6), 30.5B params / 3.3B active (MoE), runs locally. 18.5K stars. Apache 2.0.

STORM (Stanford)
66Stanford's LLM-powered knowledge curation system. Generates Wikipedia-style articles with citations in ~3 min. 28K stars, 84.8% citation recall / 85.2% precision (peer-reviewed). MIT license.

Perplexity Deep Research
43Research-first search engine with inline citations. Fastest deep research (15-30s), 93.9% SimpleQA accuracy, 50+ sources per report. $20/mo Pro.
Public evidence
GAIA measures real-world agentic reasoning. 72.57% vs previous top of 63.64%.
MCP integration means OpenAI Deep Research can plug into enterprise toolchains. Domain-restricted searches, real-time progress tracking.
Even within OpenAI's lineup, o3 standalone > Deep Research mode in some evals.
Raw GitHub source
GitHub README could not be fetched right now.