Strongest sustained HN presence in the entire web-browsing category. 422-point peak (highest standalone launch) plus 327-point Launch HN. Consistent engagement over 2 years validates long-term community interest.
Skyvern
activeVision-LLM browser automation for enterprise workflows. Combines computer vision with LLM reasoning to handle websites never seen before. YC S23 backed with CAPTCHA solving, 2FA, and proxy networks.
Where it wins
Vision-LLM approach — handles websites never seen before, resilient to layout changes
Enterprise features: CAPTCHA solving, 2FA handling, proxy networks, geo-targeting
Multi-step workflow engine for complex business processes
YC S23 backed with $2.7M raised
Where to be skeptical
AGPL-3.0 license limits commercial use
Enterprise/RPA focus — overkill for coding agent browser tasks
Python-only
Pricing opacity noted by independent reviewers
Editorial verdict
Best pick for enterprise workflow automation on websites without APIs — form filling, data entry, procurement. Overkill for developer/coding agent browser tasks.
Source
Videos
Reviews, tutorials, and comparisons from the community.
This Browser Agent Automates ANYTHING (N8N + Skyvern)
Related
Playwright MCP
93Microsoft's official MCP server for Playwright. Uses accessibility snapshots instead of screenshots for structured browser control. Auto-configured in GitHub Copilot's Coding Agent.
Chrome DevTools MCP
92Google Chrome team's official MCP server for Chrome DevTools. Gives coding agents deep debugging, performance profiling, and Core Web Vitals analysis through 26 tools across 6 categories.
Stagehand
90AI-native browser automation SDK by Browserbase with natural language selectors and act/extract/observe primitives.
Vercel Agent Browser
88Token-efficient browser automation CLI for AI agents. Rust core with sub-50ms boot. Claims 93% context reduction vs Playwright MCP through ref-based element selection on accessibility snapshots.
Public evidence
Highest single-thread launch score in the category. Computer vision + LLM approach differentiates from DOM-centric tools. No-code workflow builder makes it accessible to non-developers.
Skyvern scored 85.85% on the WebVoyager benchmark. Solid but below Browser Use (89.1%). Validates the vision-LLM approach for enterprise automation.
Now at v1.x (production-ready). AGPL-3.0 license. Active weekly releases. Enterprise-focused with CAPTCHA, 2FA, proxy support.
Balanced review: strengths in vision+LLM approach and natural language automation. Weaknesses in pricing opacity, steep learning curve, and AGPL license.
Skyvern achieved 64.4% success rate vs Browser Use Cloud's 43.9%. Skyvern won on reliability while Browser Use won on speed (2x faster) and cost (2x cheaper per task).
Raw GitHub source
GitHub README peek
Constrained peek so you can sanity-check the source material without leaving the site.
Skyvern automates browser-based workflows using LLMs and computer vision. It provides a Playwright-compatible SDK that adds AI functionality on top of playwright, as well as a no-code workflow builder to help both technical and non-technical users automate manual workflows on any website, replacing brittle or unreliable automation solutions.
<p align="center"> <img src="https://raw.githubusercontent.com/Skyvern-AI/skyvern/main/fern/images/geico_shu_recording_cropped.gif"/> </p>Traditional approaches to browser automations required writing custom scripts for websites, often relying on DOM parsing and XPath-based interactions which would break whenever the website layouts changed.
Instead of only relying on code-defined XPath interactions, Skyvern relies on Vision LLMs to learn and interact with the websites.
How it works
Skyvern was inspired by the Task-Driven autonomous agent design popularized by BabyAGI and AutoGPT -- with one major bonus: we give Skyvern the ability to interact with websites using browser automation libraries like Playwright.
Skyvern uses a swarm of agents to comprehend a website, and plan and execute its actions:
<picture> <source media="(prefers-color-scheme: dark)" srcset="fern/images/skyvern_2_0_system_diagram.png" /> <img src="https://raw.githubusercontent.com/Skyvern-AI/skyvern/main/fern/images/skyvern_2_0_system_diagram.png" /> </picture>This approach has a few advantages:
- Skyvern can operate on websites it's never seen before, as it's able to map visual elements to actions necessary to complete a workflow, without any customized code
- Skyvern is resistant to website layout changes, as there are no pre-determined XPaths or other selectors our system is looking for while trying to navigate
- Skyvern is able to take a single workflow and apply it to a large number of websites, as it's able to reason through the interactions necessary to complete the workflow A detailed technical report can be found here.
Demo
<!-- Redo demo -->https://github.com/user-attachments/assets/5cab4668-e8e2-4982-8551-aab05ff73a7f
Quickstart
Skyvern Cloud
Skyvern Cloud is a managed cloud version of Skyvern that allows you to run Skyvern without worrying about the infrastructure. It allows you to run multiple Skyvern instances in parallel and comes bundled with anti-bot detection mechanisms, proxy network, and CAPTCHA solvers.
If you'd like to try it out, navigate to app.skyvern.com and create an account.
Run Locally (UI + Server)
Choose your preferred setup method:
Database default:
skyvern quickstartandskyvern run serverdefault to a SQLite database at~/.skyvern/data.dbso the pip path works without Postgres or Docker. To use Postgres instead, pass--postgresfor a local container or--database-stringfor an existing database. Docker Compose always uses the bundled Postgres service.
Option A: pip install (Recommended for Python-managed local setup)
Dependencies needed:
- Python 3.11, 3.12, or 3.13
Additionally, for Windows:
- Rust
- VS Code with C++ dev tools and Windows SDK
1. Install Skyvern
pip install "skyvern[all]"
2. Run Skyvern
skyvern quickstart
The pip quickstart uses SQLite by default. For a local Postgres container, run skyvern quickstart --postgres.
Option B: Docker Compose
Use this option if you want everything containerized (Postgres, API, UI) and don't want to install Python/Node locally.
- Install Docker Desktop
- Clone the repository:
git clone https://github.com/skyvern-ai/skyvern.git && cd skyvern - Configure your LLM provider in
.env(thequickstart --docker-composecommand below will create it from.env.exampleif missing):cp .env.example .env # if not already created # edit .env to add your LLM API key - Start everything:
docker compose up -d - Open http://localhost:8080
Troubleshooting
(sqlite3.OperationalError) table organizations already exists — You hit a known bug in pip install skyvern==1.0.31. Fix:
rm ~/.skyvern/data.db # remove the leftover SQLite file
pip install --upgrade skyvern # 1.0.32+ contains the fix
skyvern quickstart
If you are still on 1.0.31 and cannot upgrade, install via uv instead:
uv pip install skyvern
pip install skyvern fails with ResolutionImpossible (litellm / fastmcp) — You hit a dependency-resolution conflict in 1.0.31. Either upgrade to 1.0.32+ or use uv: uv pip install skyvern.
SDK
Skyvern is a Playwright extension that adds AI-powered browser automation. It gives you the full power of Playwright with additional AI capabilities—use natural language prompts to interact with elements, extract data, and automate complex multi-step workflows.
Installation:
- Python SDK / cloud API:
pip install skyvern - Local server + packaged UI:
pip install "skyvern[all]"then runskyvern quickstart - Local server + packaged UI with Postgres:
pip install "skyvern[all]"then runskyvern quickstart --postgres - Packaged UI for an existing API:
pip install "skyvern[ui]"then runskyvern run ui --api-url <api-url> --api-key <api-key> - TypeScript:
npm install @skyvern/client
AI-Powered Page Commands
Skyvern adds four core AI commands directly on the page object:
| Command | Description |
|---|