Key Themes from the HN discussion
-
Caveman mode trades performance for brevity Shortening output can make the model “dumber.”
“More concise is dumber. Got it.” – taneq
-
Tokens are the currency of reasoning
Models “think” by emitting tokens; low‑entropy tokens convey little new information.“tokens are units of thinking.” – TeMPOraL > “The LLM has no accessible state beyond its own output tokens; each pass generates a single token and does not otherwise communicate with subsequent passes.” – dTal
-
Concise communication is valued by users
Many participants appreciate fewer fluff words, which saves context and speeds reading.“It makes my day not to have to read through entire essays about some trivial solution.” – bhwoo48
-
The ~75 % token‑saving claim needs proper validation
The author acknowledges the figure is preliminary and calls for rigorous evaluation.“The real eval is end‑to‑end: total input tokens, total output tokens, latency, quality/task success.” – author of the skill