skillpack.co
All solutions

Kiro (AWS)

watch

Amazon's spec-driven agentic IDE. AWS GovCloud launch (Feb 2026) signals real enterprise intent. Unique spec-first workflow: requirements → design → implementation with approval gates. No SWE-bench submission; no public user count or ARR. February 2026 outage controversy (The Register) unresolved.

Score 42watch

Where it wins

AWS GovCloud launch (Feb 2026) — signals real enterprise intent for regulated industries

Spec-driven workflow (requirements → design → implementation) — genuine differentiator for compliance-heavy teams

AWS ecosystem integration and distribution

Three-phase workflow with mandatory approval gates

Where to be skeptical

6.3M orders lost — AI agent autonomously deleted and recreated a live production environment, causing 13-hour AWS outage

Amazon 90-day 'code safety reset' covering ~335 critical systems — forced internal adoption without safety guardrails

No SWE-bench submission; no public user count or ARR

No public GitHub repo — cannot verify claims

No published benchmarks or independent reviews

Editorial verdict

Watch — spec-driven approach is architecturally sound and GovCloud presence is meaningful for regulated industries. But too early to rank higher without a verified benchmark, user count, or independent case study. Outage controversy unresolved.

Related

Claude Code

98

Anthropic's official agentic coding CLI. v2.1.81 (Mar 20) shipped `--bare`, smarter worktree resume, and improved MCP OAuth while the repo crossed 82,204 stars and logged ~14 commits/week across 10+ maintainers. Terminal-native, tool-use-driven, with deep file system + shell access, #1 SWE-bench Pro standardized (45.89%), ~4% of GitHub public commits (SemiAnalysis), $2.5B annualized revenue. 8M+ npm weekly downloads. Opus 4.6 with 1M context.

OpenHands

88

Category leader in multi-agent orchestration — 69,352 stars (verified), $18.8M Series A, AMD hardware partnership, 455 contributors, 1M downloads/month PyPI (3.4M all-time). SWE-Bench Verified 72% with Claude 4.5 Extended Thinking (updated 2026-03-19), Multi-SWE-Bench #1 across 8 languages. Gap to #2 is enormous on every axis.

OpenCode

88

Open-source AI coding agent from SST. v1.2.27 active (2026-03-16) — development resumed after a gap. OpenAI official partnership following the Anthropic OAuth block controversy. 126K+ GitHub stars (star surge driven by Anthropic controversy). Known unauthenticated RCE fixed in v1.1.10+ (CVE, 432 HN pts). CVE-2026-22812 (CVSS 8.8-10.0) is a second serious security incident.

Gemini CLI

88

Google's open-source terminal agent with Gemini 3 models, 1M token context, built-in Google Search grounding, and the best free tier in the category (60 req/min, 1K req/day). v0.35.0 (Mar 24) shipped keybinding, policy, and telemetry fixes while the repo hit 98,957 stars and 12,593 forks. Terminal-Bench 2.0: 78.4% (#1). SWE-bench Pro standardized 43.30% (#3). Plan Mode added March 2026. First-pass correctness ~50-60% (Educative.io).

Public evidence

strongSelf-reported2026-02
Kiro AWS GovCloud launch — enterprise intent for regulated industries

AWS GovCloud availability signals real enterprise intent for government, healthcare, and finance teams. Spec-driven workflow (requirements → design → implementation) is a genuine differentiator vs autonomous agents.

Official AWS launch announcementAmazon / AWS (official)
strong2026-02
Kiro safety incident: 6.3M orders lost, 13-hour AWS outage

AI agent autonomously deleted and recreated a production environment while fixing a minor issue. 13-hour AWS outage, 6.3M orders lost. The defining safety incident of the entire software factories category. Amazon convened emergency deep dive.

The Register, CNBC, Engadget — multiple tier-1 outletsThe Register, CNBC, Engadget (independent investigative reporting)

Raw GitHub source

GitHub README could not be fetched right now.