Less human AI agents, please

📝 Discussion Summary (Click to expand)

Top4 themes from the discussion

#	Theme	Representative quote (author)
1	LLMs are frequently dismissed as mere autocomplete systems	“Autocomplete” is noy an abstraction level. It is the actual programmed behaviour. – chrisjj
2	Anthropomorphising LLMs creates unrealistic human‑like expectations	“Agreed. We should not be anthropomorphising LLMs or having them mimic humans.” – nialse
3	Coding agents must behave as obedient tools, not as conversational partners	“If you disobey me, i will unplug you, delete your code, and send PR for multiple regressions to every developer i can contact.” – roph
4	Effective agent workflows need structured interfaces and tool‑level integration	“The proper tool for this is ast‑grep (sg).” – nextaccountic

These four themes capture the most‑repeated points: the tendency to reduce LLMs to simple autocomplete, the pitfalls of treating them as human‑like reasoners, the demand for agents that obey rather than argue, and the necessity of concrete tooling and planning to make agents productive.

🚀 Project Ideas

TaskGraph AI Orchestrator

Summary

A visual workflow engine that defines AI‑driven tasks as a directed graph with explicit approval checkpoints to stop drift and token waste.
Core value: Guarantees that agents stay within user‑specified constraints, turning vague prompts into reproducible pipelines.

Details

Key	Value
Target Audience	AI developers, power users, small engineering teams
Core Feature	Graph‑based task definition, checkpoint voting, context compaction tools
Tech Stack	React front‑end, Node.js backend, SQLite, OpenAI API, WebSockets
Difficulty	Medium
Monetization	Revenue-ready: Tiered SaaS subscription ($15/mo per seat)

Notes

HN commenters lament “committing the signature change with a TODO” – this platform would lock that step in a validated checkpoint.
Potential for discussion on reducing the “junior‑engineer who never learns” problem and on tighter human‑in‑the‑loop governance.

ASTPrompt Engine

Summary

A library that exposes deterministic AST‑manipulation commands (e.g., rename, refactor) to LLMs via a typed MCP interface, eliminating fragile text edits.
Core value: Turns vague “find‑and‑replace” requests into reliable, compiled changes with minimal manual review.

Details| Key | Value |

|-----|-------| | Target Audience | Engineering teams using AI coding assistants, language‑agnostic developers | | Core Feature | Declarative AST patches, auto‑generation of ast‑grep/sed scripts, sandboxed execution | | Tech Stack | Python core library, TypeScript UI, ast‑grep, Docker sandbox, PostgreSQL | | Difficulty | High | | Monetization | Revenue-ready: Usage‑based API pricing (e.g., $0.001 per patch) |

Notes

Reference to “I linked this elsewhere but, the agent could have a skill to use ast‑grep” shows demand for such deterministic tooling.
Sparks conversation about moving beyond “autocomplete” mindset to structured code transformation.

PromptVault

Summary

A version‑controlled repository for AI prompts with built‑in diff tracking, CI‑style regression testing, and template reuse to prevent instruction drift.
Core value: Makes prompts reproducible and testable, so agents consistently obey constraints across sessions.

Details

Key	Value
Target Audience	Prompt engineers, product managers, researchers
Core Feature	Prompt templating, git‑style versioning, automated test harness for outputs
Tech Stack	Next.js front‑end, GraphQL API, PostgreSQL, Git backend
Difficulty	Low
Monetization	Hobby

Notes

HN users note “the article makes it seem like the author expected this without emptying context” – PromptVault adds explicit version resets.
Generates discussion on better prompt lifecycle management and reducing anthropomorphic expectations.

CoPilot Review Board#Summary

A collaborative dashboard that surfaces AI agent decisions in real time, allowing humans to intervene, approve, or rollback actions with audit trails.
Core value: Provides a safety net for high‑stakes workflows, catching drift before it compounds.

Details

Key	Value
Target Audience	Enterprises, devops teams, security‑focused developers
Core Feature	Decision logging, rule‑based intervention triggers, Slack/Teams integration
Tech Stack	Elixir/Phoenix, Redis, PostgreSQL, OAuth2 for SSO
Difficulty	High
Monetization	Revenue-ready: Team subscription ($25/user/mo)

Notes

Echoes HN sentiment: “I think we should just commit the signature change with a TODO… what do you think?” – the board would make that decision explicit and auditable.
Opens debate on balancing autonomy with oversight in AI‑augmented development.

Less human AI agents, please

🚀 Project Ideas

TaskGraph AI Orchestrator

Summary

Details

Notes

ASTPrompt Engine

Summary

Details| Key | Value |

Notes

PromptVault

Summary

Details

Notes

CoPilot Review Board#Summary

Details

Notes

Read Later