How to code Claude Code in 200 lines of code

📝 Discussion Summary (Click to expand)

1. Core Simplicity of Coding Agents

The fundamental agent is just an LLM in a tool-calling loop, implementable in ~200 lines.
"the core of a coding agent is really simple, it's a loop with tool calling." – libraryofbabel
"Agents really are just tools in a loop. It's not rocket science." – nyellin

2. Production Features Enhance Reliability

TODO lists, subagents, and context management prevent early stopping and boost productivity.
"To fix 'early stopping' you need extra features... Claude Code does this with TODOs that are injected back into every prompt." – nyellin
"Subagents are a huge boon to productivity." – prodigycorp

3. Model Improvements Outweigh Harness

Better LLMs drive gains more than complex harnesses; simple agents with strong models compete well.
"model improvements far outweigh any harness or tooling." – aszen
"Simpler agents aren't that far behind in performance if the underlying model is the same." – qsort

4. Practical Alternatives and Open-Source Competition

Users share minimal agents and tools, arguing anyone can rival Claude Code.
"here's my take, in 70 lines of code: [nanoagent.js]" – kirjavascript
"We (the SWE-bench team) have a 100 line of code agent... pretty popular." – ofirpress

🚀 Project Ideas

TraceQuery

Summary

A diagnostic and investigative tool for exploring the execution traces of coding agents (Claude Code, Codex).
It enables developers to "Total Recall" past sessions, identifying why an agent failed, where it hallucinated, or what subagent sidechains were spawned.
Solves the problem of "black box" agent behavior by providing a searchable, queried history of logic and tool calls.

Details

Key	Value
Target Audience	AI Engineers and Power Users of Claude Code/Codex
Core Feature	RAG-based search over local .jsonl session logs with TUI/CLI
Tech Stack	Rust, Tantivy (Search Engine), SQLite
Difficulty	Medium
Monetization	Revenue-ready: $10/mo for advanced analytics/sync.

Notes

Inspired by claude-trace and comments from bredren and d4rkp4ttern who found that "it is pretty incredible what you can get when letting a frontier model RAG your complete CLI convo history."
HN users love understanding the "paperwork around the loop" to debug why an agent "confidently declares victory three steps into a ten-step task."

DoneTools SDK

Summary

A specialized reliability harness designed to stop agentic "early stopping" and "tool-forgetting."
It implements specific bookkeeping tools: a persistent TODO injector, a DoneTool for explicit signaling, and a "Nudge" mechanism for when the LLM outputs plain text instead of a tool call.
Moves beyond the "200-line loop" to handle production-grade persistence and task verification.

Details

Key	Value
Target Audience	Developers building custom agentic workflows
Core Feature	Deterministic task-completion signaling and TODO bookkeeping
Tech Stack	Python, Pydantic, LangChain/Langroid
Difficulty	Medium
Monetization	Hobby (Open Core)

Notes

Directly addresses nyellin’s insight: "To fix 'early stopping' you need... TODOs that are injected back into every prompt."
Solves the "Emperor has no clothes" vs. "Production reality" debate by providing the "load-bearing paperwork" as a library.

Plan-as-a-Folder (PaaF)

Summary

A workflow tool that forces agents to maintain a plans/ directory in a repository containing markdown files of working history.
It bypasses context window limits by treating the filesystem as a "living system prompt" and long-term memory.
Allows for "forking" an agent's task, manual intervention in the plan, and auditing billable hours/outcomes.

Details

Key	Value
Target Audience	Teams using AI for complex, multi-day engineering tasks
Core Feature	Filesystem-based state management for agentic loops
Tech Stack	Python/Node.js, Markdown-diffing
Difficulty	Low
Monetization	Hobby

Notes

Based on lmeyerov’s successful experiment: "all repos get a (.gitignore'd) folder of plans//plan.md work histories... quite helpful... forking/auditing/retrying."
It appeals to the HN preference for "with-the-grain" tools that don't rely on complex database backends.

Headless Checkpoint Editor

Summary

A lightweight, terminal-friendly wrapper for Claude Code that fixes terminal "jank" and cursor issues.
It replaces the default prompt editor with a more robust readline-compliant interface and adds pre-commit hook integration to automatically audit agent output before execution.
Includes a "Belt and Suspenders" mode that automatically appends quality-check instructions (linter, static analysis) to every final step.

Details

Key	Value
Target Audience	Terminal-heavy developers frustrated by current CLI UX
Core Feature	Robust Readline editor and auto-verification hooks
Tech Stack	Go (Bubble Tea / Charmbracelet)
Difficulty	Medium
Monetization	Hobby

Notes

Addresses loeg’s frustration: "Claude code terminal has the jankiest text editor... holding down backspace just moves the cursor."
Incorporates shnpln’s workflow of manual verification: "as your last todo, go over everything you just did... and use a linter."

How to code Claude Code in 200 lines of code

1. Core Simplicity of Coding Agents

2. Production Features Enhance Reliability

3. Model Improvements Outweigh Harness

4. Practical Alternatives and Open-Source Competition

🚀 Project Ideas

TraceQuery

Summary

Details

Notes

DoneTools SDK

Summary

Details

Notes

Plan-as-a-Folder (PaaF)

Summary

Details

Notes

Headless Checkpoint Editor

Summary

Details

Notes

Read Later