Here are the 4 most prevalent themes from the Hacker News discussion, supported by direct quotations.
1. Loss of Reasoning Context Across User Turns
A significant technical constraint discussed is that reasoning tokens are often discarded between user turns in the agent loop, leading to a loss of context and forcing users to manually track progress.
- jumploops: "One thing that surprised me when diving into the Codex internals was that the reasoning tokens persist during the agent tool call loop, but are discarded after every user turn."
- behnamoh: "Codex spends 20m only for it to do something I didn't agree on. It burns OpenAI tokens too; they could save money by supporting this feature!"
- pcwelder: "Sonnet has the same behavior: drops thinking on user message."
2. Lack of Tooling Hooks and User Control
Users frequently complained that Codex lacks essential features like hooks and real-time observability, which are available in competitors like Claude Code. This prevents users from intervening or automating tasks effectively.
- CuriouslyC: "The problem with codex right now is it doesn't have hook support. It's hard to understate how big of a deal hooks are."
- behnamoh: "CC is the clunkiest PoS software I've ever used in terminal; feels like it was vibe coded... but CC currently has features like hooks that codex team has refused to add far too many times."
- behnamoh: "CC, I am shown a nice diff that I can approve/reject. in codex, the AI makes lots of changes but doesn't pin point what changes it's doing or going to make."
3. Performance vs. Accuracy Trade-offs (Speed vs. Thoroughness)
There is a recurring tension between the speed of inference and the thoroughness of the coding tasks. While Codex is praised for its efficiency and completeness in edits, its slowness disrupts the user's "flow state."
- postalcoder: "Codex is wicked efficient with context windows, with the tradeoff of time spent. It hurts the flow state, but overall I've found that it's the best at having long conversations/coding sessions."
- postalcoder: "It tends to properly scope out changes and generate complete edits, whereas I always have to bring Opus around to fix things it didn't fix."
- karmasimida: "Codexβs only caveat is too slow. This is the biggest UX killer, unfortunately."
4. Comparisons of CLI Interfaces and Model Performance
Users actively compare the usability and performance of Codex CLI, Claude Code, and Gemini CLI. While Codex CLI is praised for being open-source, lightweight, and reliable, Claude Code is often criticized for its UI despite the underlying model's strength.
- written-beyond: "The performance of all 3 of them is utter dog shit... I decided to try codex cli... Its performance is quite literally insane, its UX is completely seamless."
- georgeven: "I found codex cli to be significantly better than claude code. It follows instructions and executes the exact change I want without going off on an 'adventure' like Claude code."
- ltbarcly3: "Claude Code is very effective... Codex is trash. It is slow, tends to fail to solve problems, gets stuck in weird places... The codex models are poor."