Browser Use

active

Python library for controlling a real browser with vision and DOM extraction, built for agent workflows.

Score 76

Where it wins

Vision + DOM hybrid approach for robust page understanding

Large public traction and active development

Works with multiple LLM providers

Handles complex multi-step browser tasks

Where to be skeptical

Python-only — no native TypeScript/Node support

Still evolving reliability for complex flows

Heavier setup than MCP-based browser tools

Editorial verdict

Unchallenged category leader — 81K stars, 1M+ weekly PyPI downloads, 89.1% WebVoyager. The gap to #2 is enormous: 4x stars, 6x downloads vs the next autonomous agent.

Source

GitHub: browser-use/browser-use

Docs: browser-use.com

Found via SkillPack? ★ Star us on GitHub

Videos

Reviews, tutorials, and comparisons from the community.

Browser Use: This New AI Agent Can Do Anything (Full AI Scraping Tutorial)

Tech With Tim·2025-03-23

Browser Use: FREE AI Agent CAN CONTROL BROWSERS & DO ANYTHING! (Beats Anthropic!)

WorldofAI·2025-01-06

Browser Use - AI Agent with the Browser

Ozgur Ozer·2025-01-05

Web Browsing / Browser Automation

#01of 9

Full autonomous web browsing where the LLM needs complete control over unpredictable workflows

Playwright MCP

Microsoft's official MCP server for Playwright. Uses accessibility snapshots instead of screenshots for structured browser control. Auto-configured in GitHub Copilot's Coding Agent.

Chrome DevTools MCP

Google Chrome team's official MCP server for Chrome DevTools. Gives coding agents deep debugging, performance profiling, and Core Web Vitals analysis through 26 tools across 6 categories.

Stagehand

AI-native browser automation SDK by Browserbase with natural language selectors and act/extract/observe primitives.

Skyvern

Vision-LLM browser automation for enterprise workflows. Combines computer vision with LLM reasoning to handle websites never seen before. YC S23 backed with CAPTCHA solving, 2FA, and proxy networks.

Public evidence

strong2025-03

TechCrunch: Browser Use raises $17M seed, backed by YC W25 and Paul Graham

$17M seed led by Felicis with Paul Graham participating. 20+ YC W25 batch companies used Browser Use. Manus (viral agent) built on top of it.

TechCrunch feature articleIvan Mehta (TechCrunch), investors: Felicis, Paul Graham, A Capital

strong2026-03

WebVoyager benchmark: 89.1% across 586 tasks (Steel.dev leaderboard)

Browser Use achieved 89.1% success rate on the WebVoyager benchmark across 586 web tasks. State-of-the-art open-source performance, though below commercial competitors (Surfer 2 at 97.1%).

Independent benchmark leaderboardSteel.dev (independent)

strong2026-03

Firecrawl: 11 Best AI Browser Agents in 2026 — Browser Use ranked #1 open-source

Firecrawl's independent ranking places Browser Use as the top open-source AI browser agent. Cites vision+DOM hybrid approach and multi-step task capability.

Major AI tooling company blogFirecrawl (independent)

strong2026-03

1M+ weekly PyPI downloads — real adoption, not just stars

1M+ weekly PyPI downloads confirms massive real-world adoption. No other open-source browser agent comes close — Skyvern is at 167K/wk (6x gap).

1,015K weekly PyPI downloadsPyPI API (Mar 16, 2026)

strongSelf-reported2026-03

YC W25 + SOC 2 Type II + cloud product at $30/month

YC W25 backing, SOC 2 Type II certified, cloud product at $30/month. Proprietary cost-optimized model BU-30B (200 tasks/$1). Enterprise runway no other open-source agent has.

Enterprise product featuresBrowser Use team (official)

moderateSelf-reported2026-03

browser-use/browser-use: 81K+ stars — fastest-growing OSS AI browser agent

Dominant AI browser agent framework by stars. Zero to 81K+ in ~18 months. Weekly release cadence. 314 contributors.

81K+ stars, 9.6K forks, 314 contributorsOpen-source community

moderate2026-03

Dev.to hands-on comparison: Browser Use wins for complex multi-step tasks

Hands-on comparison of 6 browser automation tools confirms Browser Use wins for 'complex multi-step tasks (form filling, autonomous workflows).'

DEV Community hands-on comparison of 6 toolsminatoplanb (independent developer)

moderate2026-03

AWS blog, InfoWorld, Apify integration confirm production use

Multiple independent sources confirm Browser Use in production contexts. AWS blog coverage, InfoWorld feature, and official Apify integration validate real-world deployment beyond star counts.

Multiple independent sourcesAWS, InfoWorld, Apify (independent)

strong2026-02

NxCode: Stagehand vs Browser Use vs Playwright — AI Browser Automation Compared (2026)

Confirms Browser Use's unique full-autonomy positioning: 'the LLM decides what to click, what to type, when to scroll, and when the task is complete.' Independent editorial, not sponsored.

Major tech comparison siteNxCode (independent tech publication)

strong2025-01

HN: Stagehand Show HN with Browser Use comparisons — 326 points, 86 comments

Multiple commenters compared Browser Use and Stagehand architectures in depth. Organic community discussion validates Browser Use as the autonomous agent default.

326 HN points, 86 commentsHN community

moderate2026-03

rtrvr.ai benchmark: Browser Use Cloud 43.9% vs Skyvern 64.4% — speed vs reliability trade-off

Browser Use Cloud showed 43.9% success rate vs Skyvern's 64.4% in cloud mode. However, Browser Use was 2x faster and 2x cheaper per task. Reveals reliability gap in cloud deployment vs strong cost/speed profile.

Independent benchmark comparisonrtrvr.ai (independent)

Raw GitHub source

GitHub README peek

Constrained peek so you can sanity-check the source material without leaving the site.

🌤️ Want to skip the setup? Use our <b>cloud</b> for faster, scalable, stealth-enabled browser automation!

🤖 LLM Quickstart

Direct your favorite coding agent (Cursor, Claude Code, etc) to Agents.md
Prompt away!

<br/>

👋 Human Quickstart

Browser Use 0.13 introduces a new beta agent powered by a Rust core and a browser harness built for current frontier models. It gives the model a real browser/computer action space, persistent tools, and recovery loops inspired by coding agents.

Python API -> Rust core -> Browser harness -> Web task done

1. Install Browser Use with the native core runtime (Python>=3.11):

uv add "browser-use[core]"
# or: pip install "browser-use[core]"

The [core] extra installs the native Browser Use runtime for your platform.

2. [Optional] Get your API key from Browser Use Cloud:

# .env
BROWSER_USE_API_KEY=your-key
# GOOGLE_API_KEY=your-key
# ANTHROPIC_API_KEY=your-key

3. Run your first agent:

from browser_use.beta import Agent, BrowserProfile, ChatBrowserUse
# from browser_use.beta import ChatOpenAI  # ChatOpenAI(model='gpt-5.5')
# from browser_use.beta import ChatGoogle  # ChatGoogle(model='gemini-3.1-pro-preview')
# from browser_use.beta import ChatAnthropic  # ChatAnthropic(model='claude-opus-4-8')
import asyncio

async def main():
    agent = Agent(
        task="Find the number of stars of the browser-use repo",
        llm=ChatBrowserUse(),
        # llm=ChatOpenAI(model='gpt-5.5'),
        # llm=ChatGoogle(model='gemini-3.1-pro-preview'),
        # llm=ChatAnthropic(model='claude-opus-4-8'),  # Sonnet also works well.
        browser_profile=BrowserProfile(
            headless=False,
            allowed_domains=["*.github.com"],
        ),
    )
    history = await agent.run()
    print(history.final_result())

if __name__ == "__main__":
    asyncio.run(main())

Existing Python agent users can keep using from browser_use import Agent. The new Rust-powered beta agent is from browser_use.beta import Agent.

Check out the library docs and the cloud docs for more!

<br/>

Open Source vs Cloud

We benchmark Browser Use across 100 real-world browser tasks. Full benchmark is open source: browser-use/benchmark.

Use the Open-Source Agent

You need custom tools or deep code-level integration
We recommend pairing with our cloud browsers for leading stealth, proxy rotation, and scaling
Or self-host the open-source agent fully on your own machines

Use the Fully-Hosted Cloud Agent (recommended)

Much more powerful agent for complex tasks (see plot above)
Easiest way to start and scale
Best stealth with proxy rotation and captcha solving
1000+ integrations (Gmail, Slack, Notion, and more)
Persistent filesystem and memory

<br/>

Demos

📋 Form-Filling

Task = "Fill in this job application with my resume and information."

Job Application Demo Example code ↗

🍎 Grocery-Shopping

View on GitHub →