<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: bhu8</title><link>https://news.ycombinator.com/user?id=bhu8</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 16 Jun 2026 17:20:31 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=bhu8" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by bhu8 in "Ask HN: What are you working on? (June 2026)"]]></title><description><![CDATA[
<p>Factorio/SimCity like interface for managing multiple agents: <a href="https://getviberia.com" rel="nofollow">https://getviberia.com</a><p>It's like the love child of Polytopia and Conductor. As many other agent management platforms/harnesses, Viberia has been building itself, and honestly this has been too much fun to stop.</p>
]]></description><pubDate>Sun, 14 Jun 2026 21:28:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48532958</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=48532958</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48532958</guid></item><item><title><![CDATA[New comment by bhu8 in "Anthropic's model naming, extrapolated"]]></title><description><![CDATA[
<p>I like how the original triplet's initials represent their behavior well:<p>- Opus is OP, like OverPowered<p>- Sonnet is SO, like your significant other (this was more meaningful in Sonnet 3.7 days)<p>- Haiku is HA, like the reaction to a bad joke<p>The latest model, naturally, needed three letters: FAB.<p>I'm now looking forward to ABS and LO</p>
]]></description><pubDate>Wed, 10 Jun 2026 19:58:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48481766</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=48481766</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48481766</guid></item><item><title><![CDATA[New comment by bhu8 in "Open source Kanban desktop app that runs parallel agents on every card"]]></title><description><![CDATA[
<p>Just use direnv? You’ll probably need to adjust the port you are hosting the local page on, but that’s just N=mod(hash based on the worktree name) and then port=default_port+N.<p>Tell your claude to set this up. Should do it in a single prompt</p>
]]></description><pubDate>Sat, 23 May 2026 05:02:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48244852</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=48244852</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48244852</guid></item><item><title><![CDATA[Show HN: Viberia – Civ/Polytopia-like command center for AI agents (BYOK/BYOS)]]></title><description><![CDATA[
<p>Hey HN,<p>This is my take on the agent harness. Everything on an isometric map. Agents are grouped into "buildings" that run in a sequence or a loop; e.g., the CodeForge has an agent that writes a PRD, another one that implements, and a third that reviews. Everything is customizable, you build your own buildings/teams however you want.<p>It's a Tauri app, really light (about 8x less energy than the closest competitor I benchmarked, so it actually runs from a coffee shop on battery). It's macOS only for now, but ping me if you are willing to test the Windows or Linux version.<p>I've been dogfooding this for months and would love to get some feedback, feature requests, and bug reports so I know what to focus on next.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48190531">https://news.ycombinator.com/item?id=48190531</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 19 May 2026 08:02:28 +0000</pubDate><link>https://getviberia.com/</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=48190531</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48190531</guid></item><item><title><![CDATA[New comment by bhu8 in "New Claude Code programmatic usage restrictions"]]></title><description><![CDATA[
<p>It would unfortunately impact it. ACP uses Claude SDK and is developed by a third-party.</p>
]]></description><pubDate>Wed, 13 May 2026 21:58:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=48128159</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=48128159</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48128159</guid></item><item><title><![CDATA[New comment by bhu8 in "GPT-5.5"]]></title><description><![CDATA[
<p>Gpt-5.3-codex is miles better than 5.4 in that regard. It’s better at orchestration, and does the things that it said it did. Haven’t tested 5.5 yet but using 5.4 for exploration + brainstorming and handing over the findings to 5.3-codex works pretty well</p>
]]></description><pubDate>Thu, 23 Apr 2026 20:53:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47881791</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47881791</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47881791</guid></item><item><title><![CDATA[New comment by bhu8 in "Elevated errors on Claude.ai, API, Claude Code"]]></title><description><![CDATA[
<p>Feels like an issue in their caching. First non-cached turns are sent properly but everything that is second+ turn fails.</p>
]]></description><pubDate>Wed, 15 Apr 2026 16:42:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=47781647</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47781647</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47781647</guid></item><item><title><![CDATA[New comment by bhu8 in "Ask HN: What Are You Working On? (April 2026)"]]></title><description><![CDATA[
<p>I am working on a (yet another) local app for managing multiple claude/codex/gemini sessions in a game like environment: <a href="https://getviberia.com/" rel="nofollow">https://getviberia.com/</a></p>
]]></description><pubDate>Mon, 13 Apr 2026 16:50:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=47754791</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47754791</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47754791</guid></item><item><title><![CDATA[New comment by bhu8 in "Show HN: The Lottery of Life"]]></title><description><![CDATA[
<p>I think something is broken though. I got 20 nematodes in a row. It's around 1% prob.</p>
]]></description><pubDate>Wed, 18 Mar 2026 07:19:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=47422559</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47422559</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47422559</guid></item><item><title><![CDATA[New comment by bhu8 in "Show HN: Claude Code skills that build complete Godot games"]]></title><description><![CDATA[
<p>Ah thanks, I see. This was 8-9 months ago.<p>I was starting from scratch and mainly relying on Opus/Sonnet 4.<p>I also kept running into the Godot 3 vs 4 issue before adding specific guidance about this into CLAUDE.md</p>
]]></description><pubDate>Tue, 17 Mar 2026 03:30:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=47408265</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47408265</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47408265</guid></item><item><title><![CDATA[New comment by bhu8 in "Show HN: Claude Code skills that build complete Godot games"]]></title><description><![CDATA[
<p>Great work but why not use C# instead of GDScript?<p>LLMs are really good at C# (and tscn files for some reason), so that solves the "LLMs suck at GDScript" problem. Also, C# can be cheaper in terms of token usage (even accounting for not having to load the additional APIs): one agent writes the interfaces, another one fills in the details.<p>Saying this because I had really enjoyed vibecoding a Godot game in C# - and it was REALLY painful to vibecode with GDScript.</p>
]]></description><pubDate>Mon, 16 Mar 2026 19:41:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47403789</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47403789</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47403789</guid></item><item><title><![CDATA[New comment by bhu8 in "Show HN: I taught LLMs to play Magic: The Gathering against each other"]]></title><description><![CDATA[
<p>This is amazing. I checked some games and the blunders make me think that the LLMs are not really great at forecasting what happens if they play X on Y.<p>Can you actually introduce that into the decision making? That is, you would:<p>1. Have the LLM come up with N many potential actions<p>2. Run XMage run in parallel and provide the outcome for each different action<p>3. Revert XMage to the original state<p>4. Provide the LLM with the different outcomes and have them choose the action/outcome pair rather than just the action<p>This would actually help them analyze the counterfactual outcomes more effectively and should prevent 99% of the blunders<p>If you happen to be token rich, you could even do this in a MCTS manner and have them think really deep</p>
]]></description><pubDate>Wed, 18 Feb 2026 06:10:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47057759</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47057759</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47057759</guid></item><item><title><![CDATA[New comment by bhu8 in "RTS for Agents"]]></title><description><![CDATA[
<p>Very nice! I am working on something very similar at www.viberia.net<p>My take was that it’s easier to trace who is doing what (and what the agent hierarchy looks like) when agents’ locations are fixed.</p>
]]></description><pubDate>Wed, 21 Jan 2026 14:42:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=46706370</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=46706370</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46706370</guid></item><item><title><![CDATA[Token Laundering]]></title><description><![CDATA[
<p>Article URL: <a href="https://llemre.com/token-laundering/">https://llemre.com/token-laundering/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46289310">https://news.ycombinator.com/item?id=46289310</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 16 Dec 2025 14:58:19 +0000</pubDate><link>https://llemre.com/token-laundering/</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=46289310</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46289310</guid></item><item><title><![CDATA[New comment by bhu8 in "Context is the bottleneck for coding agents now"]]></title><description><![CDATA[
<p>Noted. Thanks!</p>
]]></description><pubDate>Fri, 26 Sep 2025 15:53:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=45387921</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=45387921</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45387921</guid></item><item><title><![CDATA[New comment by bhu8 in "Context is the bottleneck for coding agents now"]]></title><description><![CDATA[
<p>> Opting to introduce them sooner will almost certainly increase the complexity of your codebase prematurely<p>Agreed, but how else are you going to scale mostly AI written code? Relying mostly on AI agents gives you that organizational complexity.<p>> Given how long gpt codex 5 has been out, there’s no way you’ve followed these practices for a reasonable enough time to consider them definitive<p>Yeah, fair. Codex has been out for less than 2 weeks at this point. I was relying on gpt-5 in August and opus before that.</p>
]]></description><pubDate>Fri, 26 Sep 2025 15:39:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=45387747</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=45387747</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45387747</guid></item><item><title><![CDATA[New comment by bhu8 in "Context is the bottleneck for coding agents now"]]></title><description><![CDATA[
<p>Not yet unfortunately, but I'm in the process of building one.<p>This was my journey: I vibe-coded an Electron app and ended up with a terrible monolithic architecture, and mostly badly written code. Then, I took the app's architecture docs and spent a lot of my time shouting "MAKE THIS ARCHITECTURE MORE ORTHOGONAL, SOLID, KISS, DRY" to gpt-5-pro, and ended up with a 1500+ liner monster doc.<p>I'm now turning this into a Tauri app and following the new architecture to a T. I would say that it is has a pretty clean structure with multiple microservices.<p>Now, new features are gated based on the architecture doc, so I'm always maintaining a single source of truth that serves as the main context for any new discussions/features. Also, each microservice has its own README file(s) which are updated with each code change.</p>
]]></description><pubDate>Fri, 26 Sep 2025 15:32:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=45387678</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=45387678</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45387678</guid></item><item><title><![CDATA[New comment by bhu8 in "Context is the bottleneck for coding agents now"]]></title><description><![CDATA[
<p>IMHO, jumping from Level 2 to Level 5 is a matter of:<p>- Better structured codebases - we need hierarchical codebases with minimal depth, maximal orthogonality and reasonable width. Think microservices.<p>- Better documentation - most code documentations are not built to handle updates. We need a proper graph structure with few sources of truth that get propagated downstream. Again, some optimal sort of hierarchy is crucial here.<p>At this point, I really don't think that we necessarily need better agents.<p>Setup your codebase optimally, spin up 5-10 instances of gpt-5-codex-high for each issue/feature/refactor (pick the best according to some criteria) and your life will go smoothly</p>
]]></description><pubDate>Fri, 26 Sep 2025 15:16:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=45387487</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=45387487</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45387487</guid></item><item><title><![CDATA[New comment by bhu8 in "Jürgen Schmidhuber：the Father of Generative AI Without Turing Award"]]></title><description><![CDATA[
<p>I'm Schmidhuber neutral, but the word on the street is that he is a major asshole and sometimes impossible to work with. His research might be more solid than the Turing award winners but his personality truly kept him behind.</p>
]]></description><pubDate>Sat, 21 Jun 2025 11:32:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=44336688</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=44336688</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44336688</guid></item><item><title><![CDATA[New comment by bhu8 in "AI agents: Less capability, more reliability, please"]]></title><description><![CDATA[
<p>I have been thinking about the exact same problem for a while and was literally hours away from publishing a blogpost on the subject.<p>+100 on the footnote:<p>> agents or workflows?<p>Workflows. Workflows, all the way.<p>The agents can start using these workflows once they are actually ready to execute stuff with high precision. And, by then we would have figured out how to create effective, accurate and easily diagnozable workflows, so people will stop complaining about "I want to know what's going on inside the black box".</p>
]]></description><pubDate>Mon, 31 Mar 2025 16:15:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=43536700</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=43536700</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43536700</guid></item></channel></rss>