skillpack.co
All skills

Factory AI (Droids)

active

Terminal-Bench #1 (58.75%), $50M Series B at $300M valuation (Sequoia, NEA, NVIDIA). Wipro partnership (tens of thousands of engineers) — largest enterprise deployment commitment. Previously claimed 84.8% SWE-bench is UNVERIFIED. Zero grassroots developer adoption.

Generator
Orchestrator

53/100

Trust

610

Stars

5

Evidence

Product screenshot

Factory AI (Droids) in action

Videos

Reviews, tutorials, and comparisons from the community.

These Factory AI Droids Built My App in 10 Minutes! (Rust + TypeScript!) 👉 Code, Debug & Ship Apps

AI LABS·2025-06-01

AI Droids, Dev Velocity, and Bulletproof Security | Inside Factory

Max Abram·2025-09-17

Editorial verdict

Enterprise-only with legitimate backing, but zero grassroots signal and an unverified benchmark claim. Large enterprises with white-glove support needs may benefit; not for individual developers or small teams.

Source

Public evidence

strong2026-03
hyperdev — 'Promising Concept, Premature Execution'

Specific technical criticisms: slow response times, inferior code quality vs Claude, unusable history UI. Directly contradicts Every.to positive review.

Detailed independent technical reviewRobert Matsuoka (hyperdev, independent reviewer)
strong2026-03
Terminal-Bench #1 — 58.8% (beat Claude Code, Codex CLI)

Droid with Opus scored 58.8% on Terminal-Bench, beating Claude Code (43.2%) and Codex CLI (42.8%). Strongest benchmark result in the autonomous platform subcategory.

Public benchmark resultsTerminal-Bench (independent benchmark)

How does this compare?

See side-by-side metrics against other skills in the same category.

COMPARE SKILLS →

Where it wins

Terminal-Bench #1 at 58.75% (beat Claude Code 43.2%, Codex CLI 42.8%)

$50M Series B at $300M valuation — Sequoia, NEA, NVIDIA, J.P. Morgan

Wipro partnership: tens of thousands of engineers — largest enterprise deployment commitment in category

Enterprise customers: MongoDB, EY, Bayer, Zapier, Clari

Danny Aziz (GM of Spiral): 'canceled Claude + ChatGPT Max plans for Droid'

Where to be skeptical

Zero grassroots developer adoption — no significant HN threads, Reddit, or independent reviews

Previously claimed 84.8% SWE-bench score UNVERIFIED by any independent source

Robert Matsuoka: 'great vision, flawed execution, not ready for serious work'

Closed-source, no self-hosting, enterprise pricing only

Only 610 stars — no open-source community path

Ranking in categories

Know a better alternative?

Submit evidence and we'll run the full pipeline.

SUBMIT →

Similar skills

Raw GitHub source

GitHub README could not be fetched right now.