Project ideas from Hacker News discussions.

Show HN: Han – A Korean programming language written in Rust

📝 Discussion Summary (Click to expand)

Three prevailing themes in the discussion

Theme Key points Representative quotes
Token‑efficiency & LLM tokenizers Korean‑keyword code does not cut prompt size; the benefit is a training‑data bias, not Hangul itself. “I actually tested this with GPT‑4o’s tokenizer, and the result was the opposite — Korean keywords average 2‑3 tokens vs 1 for English.” – xodn348
Tokenizers trained on English merge common words into single tokens; Hangul syllables split into 2‑3 byte‑level tokens. “It’s a tokenizer training bias, not a property of Hangul itself.” – xodn348
Readability, typing ergonomics, and learning Korean speakers can read code as natural language; Hangul’s keyboard layout and phonetic design aid fast typing, but it doesn’t reduce keystrokes for non‑Korean users. “The main benefit of Korean actually comes from the fact that the language itself fits perfectly into a standard 27 alphabet keys and laid out in such a way that lets you type ridiculously fast.” – bbrodriguez
Using real Korean words (not transliterations) improves readability for native speakers. “Using actual Korean words rather than transliterations greatly aids readability.” – WillAdams
Ecosystem & practical adoption The lack of a Korean‑centric ecosystem (libraries, docs, error messages) and the dominance of English in tooling make widespread adoption difficult. “I think the actual reason it has not taken off is because of the ecosystem.” – applfanboysbgon
Even if keywords were translated, the surrounding world (APIs, docs, Stack Overflow) remains English‑centric. “People act like the keywords are the hard part. They aren’t, and once you get past ‘for’ and ‘if’ the rest of the toolchain still lands in English.” – hrmtst93837

These three themes—tokenizer bias, readability/typing ergonomics, and ecosystem constraints—capture the core of the conversation around the Korean‑keyword programming language project.


🚀 Project Ideas

Hangul Prompt Optimizer ChromeExtension

Summary

  • A Chrome/Firefox extension that pre‑processes user‑written LLM prompts in Hangul, automatically maps them to the most token‑efficient keywords, and displays real‑time token count and cost savings.
  • Core value: Reduces token consumption for Korean‑keyword prompts by up to 30 %, lowering API fees and improving latency.

Details

Key Value
Target Audience LLM developers who write prompts with Korean keywords, especially those experimenting with custom languages like Han.
Core Feature Token‑efficient keyword translation, live preview of token usage, one‑click cost‑calculator for major LLM APIs (OpenAI, Anthropic, etc.).
Tech Stack Chrome Extension (Manifest V3), React, TypeScript, LLAMA‑cpp tokenizer library, backend‑free cost API (uses public pricing endpoints).
Difficulty Medium
Monetization Revenue-ready: Freemium with $4/mo Pro plan (advanced batch processing, custom keyword dictionaries).

Notes

  • HN commenters repeatedly mentioned token‑efficiency struggles and the desire for prompt cost awareness; this tool directly addresses that pain point.
  • Potential for integration with existing AI‑tooling workflows and as a showcase for building language‑specific token optimizers.

Polyglot Code Translator SaaS

Summary

  • A web platform that lets users paste code written in any non‑English programming language (e.g., Han Korean, Serbian YUTF‑8) and instantly transpiles it to standard English syntax while preserving semantics.
  • Core value: Enables seamless collaboration across language barriers by providing one‑click translation, error‑highlighting, and auto‑documentation in the user’s native language.

Details

Key Value
Target Audience International development teams, educators, and indie hackers who want to experiment with localized code but must ship to English‑centric ecosystems.
Core Feature Multi‑language lexer, transpiler to Python/JS/TS, bilingual error messages, community‑driven keyword dictionary, REST API for CI pipelines.
Tech Stack Backend: Node.js + ANTLR grammars, Dockerized micro‑service, Frontend: Next.js, PostgreSQL, GitHub integration for PR validation.
Difficulty High
Monetization Revenue-ready: Usage‑based pricing $0.005 per 1 KB of transpiled code + $15/mo for team plans.

Notes

  • Discussions on HN highlighted fragmentation fears and the need for a lingua franca in documentation; this service solves that by bridging localized source to universal toolchains.
  • Could become a go‑to resource for language‑learning programmers and a showcase for non‑English open‑source ecosystems.

Korean REPL‑as‑a‑Service Platform

Summary

  • A hosted, interactive REPL (Read‑Eval‑Print Loop) for the Korean‑focused programming language “Han”, featuring real‑time syntax highlighting, error translation into Korean, and seamless export to standard libraries (e.g., NumPy) for production use.
  • Core value: Lowers the barrier for Korean‑speaking learners and hobbyists to experiment with code without setting up local environments, while automatically generating English‑compatible modules for wider adoption.

Details

Key Value
Target Audience Korean‑speaking beginners, educators teaching programming in Korean, and hobbyist developers seeking quick prototyping.
Core Feature Browser‑based REPL with Hangul syntax support, instant error explanation in Korean, one‑click export to Python/JS modules, community library marketplace.
Tech Stack Full‑stack: Elixir/Phoenix for real‑time sessions, Dockerized Python backend for library integration, Tailwind CSS UI, auth via GitHub OAuth.
Difficulty Medium
Monetization Revenue-ready: Tiered subscription $7/mo (basic), $15/mo (advanced libraries, API access).

Notes

  • HN users expressed enthusiasm for learning Korean through coding and praised the cultural angle; the REPL directly fulfills that desire.
  • Offers a practical utility for educators and language learners, fostering a community around non‑English programming experiences.

Read Later