Kiro (AWS)

watch

Amazon's spec-driven agentic IDE. AWS GovCloud launch (Feb 2026) signals real enterprise intent. Unique spec-first workflow: requirements → design → implementation with approval gates. No SWE-bench submission; no public user count or ARR. February 2026 outage controversy (The Register) unresolved.

Score 42watch

Where it wins

AWS GovCloud launch (Feb 2026) — signals real enterprise intent for regulated industries

Spec-driven workflow (requirements → design → implementation) — genuine differentiator for compliance-heavy teams

AWS ecosystem integration and distribution

Three-phase workflow with mandatory approval gates

Where to be skeptical

6.3M orders lost — AI agent autonomously deleted and recreated a live production environment, causing 13-hour AWS outage

Amazon 90-day 'code safety reset' covering ~335 critical systems — forced internal adoption without safety guardrails

No SWE-bench submission; no public user count or ARR

No public GitHub repo — cannot verify claims

No published benchmarks or independent reviews

Editorial verdict

Watch — spec-driven approach is architecturally sound and GovCloud presence is meaningful for regulated industries. But too early to rank higher without a verified benchmark, user count, or independent case study. Outage controversy unresolved.

Source

Found via SkillPack? ★ Star us on GitHub

Coding CLIs / Code Agents

#22of 22

Spec-driven development, AWS integration, GovCloud

Software Factories

#13of 18

Suspended — safety incident (6.3M orders lost) pending 90-day safety reset

Claude Code

Anthropic's official agentic coding CLI. v2.1.81 (Mar 20) shipped `--bare`, smarter worktree resume, and improved MCP OAuth while the repo crossed 82,204 stars and logged ~14 commits/week across 10+ maintainers. Terminal-native, tool-use-driven, with deep file system + shell access, #1 SWE-bench Pro standardized (45.89%), ~4% of GitHub public commits (SemiAnalysis), $2.5B annualized revenue. 8M+ npm weekly downloads. Opus 4.6 with 1M context.

OpenHands

Category leader in multi-agent orchestration — 69,352 stars (verified), $18.8M Series A, AMD hardware partnership, 455 contributors, 1M downloads/month PyPI (3.4M all-time). SWE-Bench Verified 72% with Claude 4.5 Extended Thinking (updated 2026-03-19), Multi-SWE-Bench #1 across 8 languages. Gap to #2 is enormous on every axis.

OpenCode

Open-source AI coding agent from SST. v1.2.27 active (2026-03-16) — development resumed after a gap. OpenAI official partnership following the Anthropic OAuth block controversy. 126K+ GitHub stars (star surge driven by Anthropic controversy). Known unauthenticated RCE fixed in v1.1.10+ (CVE, 432 HN pts). CVE-2026-22812 (CVSS 8.8-10.0) is a second serious security incident.

Gemini CLI

Google's open-source terminal agent with Gemini 3 models, 1M token context, built-in Google Search grounding, and the best free tier in the category (60 req/min, 1K req/day). v0.35.0 (Mar 24) shipped keybinding, policy, and telemetry fixes while the repo hit 98,957 stars and 12,593 forks. Terminal-Bench 2.0: 78.4% (#1). SWE-bench Pro standardized 43.30% (#3). Plan Mode added March 2026. First-pass correctness ~50-60% (Educative.io).

Public evidence

strongSelf-reported2026-02

Kiro AWS GovCloud launch — enterprise intent for regulated industries

AWS GovCloud availability signals real enterprise intent for government, healthcare, and finance teams. Spec-driven workflow (requirements → design → implementation) is a genuine differentiator vs autonomous agents.

Official AWS launch announcementAmazon / AWS (official)

strong2026-02

Kiro safety incident: 6.3M orders lost, 13-hour AWS outage

AI agent autonomously deleted and recreated a production environment while fixing a minor issue. 13-hour AWS outage, 6.3M orders lost. The defining safety incident of the entire software factories category. Amazon convened emergency deep dive.

The Register, CNBC, Engadget — multiple tier-1 outletsThe Register, CNBC, Engadget (independent investigative reporting)

moderate2025-11-24

'Kiro Mandate': 80% weekly usage target for all Amazon engineers — forced adoption

Internal memo set 80% weekly usage target for all Amazon engineers. Forced adoption without safety guardrails contributed to incidents.

CNBC investigative reporting, internal memo (leaked)CNBC (independent)

Raw GitHub source

GitHub README could not be fetched right now.