<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: bhu8</title><link>https://news.ycombinator.com/user?id=bhu8</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 23 Apr 2026 18:26:53 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=bhu8" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by bhu8 in "Elevated errors on Claude.ai, API, Claude Code"]]></title><description><![CDATA[
<p>Feels like an issue in their caching. First non-cached turns are sent properly but everything that is second+ turn fails.</p>
]]></description><pubDate>Wed, 15 Apr 2026 16:42:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=47781647</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47781647</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47781647</guid></item><item><title><![CDATA[New comment by bhu8 in "Ask HN: What Are You Working On? (April 2026)"]]></title><description><![CDATA[
<p>I am working on a (yet another) local app for managing multiple claude/codex/gemini sessions in a game like environment: <a href="https://getviberia.com/" rel="nofollow">https://getviberia.com/</a></p>
]]></description><pubDate>Mon, 13 Apr 2026 16:50:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=47754791</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47754791</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47754791</guid></item><item><title><![CDATA[New comment by bhu8 in "Show HN: The Lottery of Life"]]></title><description><![CDATA[
<p>I think something is broken though. I got 20 nematodes in a row. It's around 1% prob.</p>
]]></description><pubDate>Wed, 18 Mar 2026 07:19:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=47422559</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47422559</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47422559</guid></item><item><title><![CDATA[New comment by bhu8 in "Show HN: Claude Code skills that build complete Godot games"]]></title><description><![CDATA[
<p>Ah thanks, I see. This was 8-9 months ago.<p>I was starting from scratch and mainly relying on Opus/Sonnet 4.<p>I also kept running into the Godot 3 vs 4 issue before adding specific guidance about this into CLAUDE.md</p>
]]></description><pubDate>Tue, 17 Mar 2026 03:30:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=47408265</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47408265</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47408265</guid></item><item><title><![CDATA[New comment by bhu8 in "Show HN: Claude Code skills that build complete Godot games"]]></title><description><![CDATA[
<p>Great work but why not use C# instead of GDScript?<p>LLMs are really good at C# (and tscn files for some reason), so that solves the "LLMs suck at GDScript" problem. Also, C# can be cheaper in terms of token usage (even accounting for not having to load the additional APIs): one agent writes the interfaces, another one fills in the details.<p>Saying this because I had really enjoyed vibecoding a Godot game in C# - and it was REALLY painful to vibecode with GDScript.</p>
]]></description><pubDate>Mon, 16 Mar 2026 19:41:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47403789</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47403789</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47403789</guid></item><item><title><![CDATA[New comment by bhu8 in "Show HN: I taught LLMs to play Magic: The Gathering against each other"]]></title><description><![CDATA[
<p>This is amazing. I checked some games and the blunders make me think that the LLMs are not really great at forecasting what happens if they play X on Y.<p>Can you actually introduce that into the decision making? That is, you would:<p>1. Have the LLM come up with N many potential actions<p>2. Run XMage run in parallel and provide the outcome for each different action<p>3. Revert XMage to the original state<p>4. Provide the LLM with the different outcomes and have them choose the action/outcome pair rather than just the action<p>This would actually help them analyze the counterfactual outcomes more effectively and should prevent 99% of the blunders<p>If you happen to be token rich, you could even do this in a MCTS manner and have them think really deep</p>
]]></description><pubDate>Wed, 18 Feb 2026 06:10:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47057759</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=47057759</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47057759</guid></item><item><title><![CDATA[New comment by bhu8 in "RTS for Agents"]]></title><description><![CDATA[
<p>Very nice! I am working on something very similar at www.viberia.net<p>My take was that it’s easier to trace who is doing what (and what the agent hierarchy looks like) when agents’ locations are fixed.</p>
]]></description><pubDate>Wed, 21 Jan 2026 14:42:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=46706370</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=46706370</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46706370</guid></item><item><title><![CDATA[Token Laundering]]></title><description><![CDATA[
<p>Article URL: <a href="https://llemre.com/token-laundering/">https://llemre.com/token-laundering/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46289310">https://news.ycombinator.com/item?id=46289310</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 16 Dec 2025 14:58:19 +0000</pubDate><link>https://llemre.com/token-laundering/</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=46289310</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46289310</guid></item><item><title><![CDATA[New comment by bhu8 in "Context is the bottleneck for coding agents now"]]></title><description><![CDATA[
<p>Noted. Thanks!</p>
]]></description><pubDate>Fri, 26 Sep 2025 15:53:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=45387921</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=45387921</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45387921</guid></item><item><title><![CDATA[New comment by bhu8 in "Context is the bottleneck for coding agents now"]]></title><description><![CDATA[
<p>> Opting to introduce them sooner will almost certainly increase the complexity of your codebase prematurely<p>Agreed, but how else are you going to scale mostly AI written code? Relying mostly on AI agents gives you that organizational complexity.<p>> Given how long gpt codex 5 has been out, there’s no way you’ve followed these practices for a reasonable enough time to consider them definitive<p>Yeah, fair. Codex has been out for less than 2 weeks at this point. I was relying on gpt-5 in August and opus before that.</p>
]]></description><pubDate>Fri, 26 Sep 2025 15:39:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=45387747</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=45387747</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45387747</guid></item><item><title><![CDATA[New comment by bhu8 in "Context is the bottleneck for coding agents now"]]></title><description><![CDATA[
<p>Not yet unfortunately, but I'm in the process of building one.<p>This was my journey: I vibe-coded an Electron app and ended up with a terrible monolithic architecture, and mostly badly written code. Then, I took the app's architecture docs and spent a lot of my time shouting "MAKE THIS ARCHITECTURE MORE ORTHOGONAL, SOLID, KISS, DRY" to gpt-5-pro, and ended up with a 1500+ liner monster doc.<p>I'm now turning this into a Tauri app and following the new architecture to a T. I would say that it is has a pretty clean structure with multiple microservices.<p>Now, new features are gated based on the architecture doc, so I'm always maintaining a single source of truth that serves as the main context for any new discussions/features. Also, each microservice has its own README file(s) which are updated with each code change.</p>
]]></description><pubDate>Fri, 26 Sep 2025 15:32:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=45387678</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=45387678</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45387678</guid></item><item><title><![CDATA[New comment by bhu8 in "Context is the bottleneck for coding agents now"]]></title><description><![CDATA[
<p>IMHO, jumping from Level 2 to Level 5 is a matter of:<p>- Better structured codebases - we need hierarchical codebases with minimal depth, maximal orthogonality and reasonable width. Think microservices.<p>- Better documentation - most code documentations are not built to handle updates. We need a proper graph structure with few sources of truth that get propagated downstream. Again, some optimal sort of hierarchy is crucial here.<p>At this point, I really don't think that we necessarily need better agents.<p>Setup your codebase optimally, spin up 5-10 instances of gpt-5-codex-high for each issue/feature/refactor (pick the best according to some criteria) and your life will go smoothly</p>
]]></description><pubDate>Fri, 26 Sep 2025 15:16:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=45387487</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=45387487</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45387487</guid></item><item><title><![CDATA[New comment by bhu8 in "Jürgen Schmidhuber：the Father of Generative AI Without Turing Award"]]></title><description><![CDATA[
<p>I'm Schmidhuber neutral, but the word on the street is that he is a major asshole and sometimes impossible to work with. His research might be more solid than the Turing award winners but his personality truly kept him behind.</p>
]]></description><pubDate>Sat, 21 Jun 2025 11:32:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=44336688</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=44336688</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44336688</guid></item><item><title><![CDATA[New comment by bhu8 in "AI agents: Less capability, more reliability, please"]]></title><description><![CDATA[
<p>I have been thinking about the exact same problem for a while and was literally hours away from publishing a blogpost on the subject.<p>+100 on the footnote:<p>> agents or workflows?<p>Workflows. Workflows, all the way.<p>The agents can start using these workflows once they are actually ready to execute stuff with high precision. And, by then we would have figured out how to create effective, accurate and easily diagnozable workflows, so people will stop complaining about "I want to know what's going on inside the black box".</p>
]]></description><pubDate>Mon, 31 Mar 2025 16:15:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=43536700</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=43536700</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43536700</guid></item><item><title><![CDATA[New comment by bhu8 in "Ask HN: What are you working on? (March 2025)"]]></title><description><![CDATA[
<p>I'm sold. How do I play?</p>
]]></description><pubDate>Mon, 31 Mar 2025 03:21:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=43530551</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=43530551</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43530551</guid></item><item><title><![CDATA[New comment by bhu8 in "Claude 3.7 Sonnet with reasoning model card on Anthropic docs"]]></title><description><![CDATA[
<p>Also available on claude.ai</p>
]]></description><pubDate>Mon, 24 Feb 2025 18:29:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=43163020</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=43163020</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43163020</guid></item><item><title><![CDATA[New comment by bhu8 in "O3-mini System Card [pdf]"]]></title><description><![CDATA[
<p>Yeah, the more pages I read, the more disappointed I became. Here is the reason they cite for the low performance (which is even more worrying):<p>"The model often attempts to use a hallucinated bash tool rather than python despite constant, multi-shot prompting and feedback that this format is incorrect. This resulted in long conversations that likely hurt its performance."</p>
]]></description><pubDate>Fri, 31 Jan 2025 18:40:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=42890330</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=42890330</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42890330</guid></item><item><title><![CDATA[New comment by bhu8 in "The Nobel Prize in Physics 2021"]]></title><description><![CDATA[
<p>Nice thread on Giorgio Parisi by Montanari: <a href="https://twitter.com/Andrea__M/status/1445405295811960841?s=20" rel="nofollow">https://twitter.com/Andrea__M/status/1445405295811960841?s=2...</a></p>
]]></description><pubDate>Tue, 05 Oct 2021 15:46:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=28761044</link><dc:creator>bhu8</dc:creator><comments>https://news.ycombinator.com/item?id=28761044</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=28761044</guid></item></channel></rss>