Codex CLI

active

OpenAI's open-source coding agent built in Rust. Terminal-Bench 77.3% (#2), SWE-bench Pro standardized 41.04% (GPT-5.2-Codex). GPT-5.4 shipped March 5, 2026. Codex Security agent adds appsec capabilities. 3-4x more token-efficient than Claude Code, 240+ tokens/sec. Free with ChatGPT subscription, sandbox-first execution. 1M+ first-month users. Cleanest security record in Tier 1 — no documented incidents.

Score 87

Where it wins

Terminal-Bench 77.3% (GPT-5.3-Codex) — strongest on terminal-native tasks (ForgeCode leads at 81.8%)

3-4x more token-efficient than Claude Code (Morph study), API pricing 60-75% cheaper ($1.50 vs $5.00 input)

Sandbox-first execution — genuine safety differentiator (caught SSRF that Claude Code missed per Blake Crosley)

Free with ChatGPT subscription — lowest barrier to entry

619 releases in 10 months — fastest iteration in the category

240+ tokens/sec throughput, 1M+ first-month users (Morphllm)

No documented security incidents — cleanest trust record in Tier 1

GPT-5.4 shipped March 5, 2026 — Codex Security agent adds appsec capabilities

Native macOS + Windows app (Feb-Mar 2026) for parallel agent management

Where to be skeptical

SWE-bench Pro standardized: 41.04% trails Claude Code's 45.89% by ~5pp. First-pass code quality trails Claude Code (67% vs 95% in blind tests)

Users report 20+ minute task times where Claude completes in <1 minute (HN: 104 pts thread). 'Too slow' is the consistent UX complaint

Tied to OpenAI models only

Headline 56.8% SWE-bench Pro number uses custom scaffold — not comparable to standardized results

Editorial verdict

#2 coding CLI. Rust rewrite eliminates Node.js dependency — unique in category. Terminal-Bench 77.3% (#2) and 3-4x more token-efficient. Cleanest security record among Tier 1 tools. GPT-5.4 shipped March 2026. Trails Claude Code by ~5pp on SWE-bench Pro standardized (41.04% vs 45.89%) and first-pass quality (67% vs 95%).

Source

GitHub: openai/codex

Docs: developers.openai.com

Found via SkillPack? ★ Star us on GitHub

Videos

Reviews, tutorials, and comparisons from the community.

OpenAI Codex CLI

OpenAI·2025-04-16

Getting started with Codex

OpenAI·2026-01-12

Using OpenAI Codex CLI with GPT-5-Codex

OpenAI·2025-10-14

Introducing the Codex app

OpenAI·2025-01-01

Coding CLIs / Code Agents

#04of 22

High-volume daily coding, speed-sensitive workflows, air-gapped/locked-down environments (Rust binary)

Software Factories

#04of 18

Developers in the OpenAI ecosystem wanting a fast, token-efficient coding CLI with strong benchmark scores

Claude Code

Anthropic's official agentic coding CLI. v2.1.81 (Mar 20) shipped `--bare`, smarter worktree resume, and improved MCP OAuth while the repo crossed 82,204 stars and logged ~14 commits/week across 10+ maintainers. Terminal-native, tool-use-driven, with deep file system + shell access, #1 SWE-bench Pro standardized (45.89%), ~4% of GitHub public commits (SemiAnalysis), $2.5B annualized revenue. 8M+ npm weekly downloads. Opus 4.6 with 1M context.

OpenHands

Category leader in multi-agent orchestration — 69,352 stars (verified), $18.8M Series A, AMD hardware partnership, 455 contributors, 1M downloads/month PyPI (3.4M all-time). SWE-Bench Verified 72% with Claude 4.5 Extended Thinking (updated 2026-03-19), Multi-SWE-Bench #1 across 8 languages. Gap to #2 is enormous on every axis.

OpenCode

Open-source AI coding agent from SST. v1.2.27 active (2026-03-16) — development resumed after a gap. OpenAI official partnership following the Anthropic OAuth block controversy. 126K+ GitHub stars (star surge driven by Anthropic controversy). Known unauthenticated RCE fixed in v1.1.10+ (CVE, 432 HN pts). CVE-2026-22812 (CVSS 8.8-10.0) is a second serious security incident.

Gemini CLI

Google's open-source terminal agent with Gemini 3 models, 1M token context, built-in Google Search grounding, and the best free tier in the category (60 req/min, 1K req/day). v0.35.0 (Mar 24) shipped keybinding, policy, and telemetry fixes while the repo hit 98,957 stars and 12,593 forks. Terminal-Bench 2.0: 78.4% (#1). SWE-bench Pro standardized 43.30% (#3). Plan Mode added March 2026. First-pass correctness ~50-60% (Educative.io).

Public evidence

strong2026-03

Fortune: OpenAI sees Codex users spike to 1.6 million

Codex surpassed 1.6M weekly active users (tripled since GPT-5.3 launch). Enterprise adopters include Cisco, Nvidia, Ramp, Rakuten.

Fortune feature article, tier-1 publicationJeremy Kahn (Fortune AI Editor), quotes Codex head at OpenAI

strong2026-02

Pragmatic Engineer: How Codex is built — deep technical dive

90%+ of Codex code generated by Codex itself. Engineers run 4-8 parallel agents. Written in Rust for performance. 1M+ weekly devs, usage up 5x since Jan 2026.

Pragmatic Engineer newsletter (700K+ subscribers)Gergely Orosz (former Uber eng manager, top engineering writer)

strong2025-05

HN: OpenAI Codex CLI launch — 516 points, 289 comments

Massive launch discussion. Top comments noted Claude Code outperformed early Codex on docs tasks. Shows improvement trajectory since launch.

516 points, 289 commentsHN community, developers who tested head-to-head vs Claude Code

strong2026-02

SmartScope: Codex CLI vs Claude Code 2026 — Opus 4.6 vs GPT-5.3-Codex Benchmark

Codex CLI leads Terminal-Bench 2.0 (62.9% vs Claude Code's 58.0%). Uses 3x fewer tokens. But Claude leads on OSWorld-Verified for GUI/computer-use tasks. Note: the 77.3% figure belongs to the Droid scaffold, not Codex CLI directly.

Independent benchmark comparison blogSmartScope (independent AI benchmarking blog)

moderate2025-12

Simon Willison: codex-cli tag — ongoing independent coverage and testing

Ongoing independent coverage: tested porting JustHTML with Codex CLI, documented skills adoption, GPT-5 model improvements.

Simon Willison's blog (massive organic reach in dev community)Simon Willison (Django co-creator, Datasette creator, independent)

strong2026-03

Terminal-Bench 2.0: Codex CLI at 77.3% (GPT-5.3-Codex), ForgeCode leads at 81.8%

Codex CLI scores 77.3% with GPT-5.3-Codex on Terminal-Bench 2.0 — strong but no longer #1. ForgeCode (running Opus 4.6) leads at 81.8%. Codex CLI remains the strongest consumer CLI on terminal-native tasks.

Independent benchmarkTerminal-Bench (independent benchmark suite)

strong2026-03

SWE-bench Pro standardized: Codex CLI at 41.0% (custom scaffold 56.8% not comparable)

Codex CLI scores 41.04% on SWE-bench Pro standardized, trailing Claude Code's 45.89% by ~5pp. The 56.8% score uses custom scaffolding and is not comparable to standardized results.

Authoritative standardized benchmark (SEAL)SWE-bench team (standardized evaluation)

Raw GitHub source

GitHub README peek

Constrained peek so you can sanity-check the source material without leaving the site.

Codex CLI is a coding agent from OpenAI that runs locally on your computer. <img src="https://github.com/openai/codex/blob/main/.github/codex-cli-splash.png" alt="Codex CLI splash" width="80%" /> If you want Codex in your code editor (VS Code, Cursor, Windsurf), <a href="https://developers.openai.com/codex/ide">install in your IDE.</a> If you want the desktop app experience, run <code>codex app</code> or visit <a href="https://chatgpt.com/codex?app-landing-page=true">the Codex App page</a>. If you are looking for the cloud-based agent from OpenAI, Codex Web, go to <a href="https://chatgpt.com/codex">chatgpt.com/codex</a>.

Quickstart

Installing and running Codex CLI

Run the following on Mac or Linux to install Codex CLI:

curl -fsSL https://chatgpt.com/codex/install.sh | sh

Run the following on Windows to install Codex CLI:

powershell -ExecutionPolicy ByPass -c "irm https://chatgpt.com/codex/install.ps1 | iex"

Codex CLI can also be installed via the following package managers:

# Install using npm
npm install -g @openai/codex

# Install using Homebrew
brew install --cask codex

Then simply run codex to get started.

<details> <summary>You can also go to the <a href="https://github.com/openai/codex/releases/latest">latest GitHub Release</a> and download the appropriate binary for your platform.</summary>

Each GitHub Release contains many executables, but in practice, you likely want one of these:

macOS
- Apple Silicon/arm64: codex-aarch64-apple-darwin.tar.gz
- x86_64 (older Mac hardware): codex-x86_64-apple-darwin.tar.gz
Linux
- x86_64: codex-x86_64-unknown-linux-musl.tar.gz
- arm64: codex-aarch64-unknown-linux-musl.tar.gz

Each archive contains a single entry with the platform baked into the name (e.g., codex-x86_64-unknown-linux-musl), so you likely want to rename it to codex after extracting it.

</details>

Using Codex with your ChatGPT plan

Run codex and select Sign in with ChatGPT. We recommend signing into your ChatGPT account to use Codex as part of your Plus, Pro, Business, Edu, or Enterprise plan. Learn more about what's included in your ChatGPT plan.

You can also use Codex with an API key, but this requires additional setup.

Docs

Codex Documentation
Contributing
Installing & building
Open source fund

This repository is licensed under the Apache-2.0 License.

View on GitHub →