Hacker News: bhu8

New comment by bhu8 in "Ask HN: What are you working on? (June 2026)"

bhu8 — Sun, 14 Jun 2026 21:28:27 +0000

Factorio/SimCity like interface for managing multiple agents: https://getviberia.com

It's like the love child of Polytopia and Conductor. As many other agent management platforms/harnesses, Viberia has been building itself, and honestly this has been too much fun to stop.

New comment by bhu8 in "Anthropic's model naming, extrapolated"

bhu8 — Wed, 10 Jun 2026 19:58:41 +0000

I like how the original triplet's initials represent their behavior well:

- Opus is OP, like OverPowered

- Sonnet is SO, like your significant other (this was more meaningful in Sonnet 3.7 days)

- Haiku is HA, like the reaction to a bad joke

The latest model, naturally, needed three letters: FAB.

I'm now looking forward to ABS and LO

New comment by bhu8 in "Open source Kanban desktop app that runs parallel agents on every card"

bhu8 — Sat, 23 May 2026 05:02:08 +0000

Just use direnv? You’ll probably need to adjust the port you are hosting the local page on, but that’s just N=mod(hash based on the worktree name) and then port=default_port+N.

Tell your claude to set this up. Should do it in a single prompt

Show HN: Viberia – Civ/Polytopia-like command center for AI agents (BYOK/BYOS)

bhu8 — Tue, 19 May 2026 08:02:28 +0000

Hey HN,

This is my take on the agent harness. Everything on an isometric map. Agents are grouped into "buildings" that run in a sequence or a loop; e.g., the CodeForge has an agent that writes a PRD, another one that implements, and a third that reviews. Everything is customizable, you build your own buildings/teams however you want.

It's a Tauri app, really light (about 8x less energy than the closest competitor I benchmarked, so it actually runs from a coffee shop on battery). It's macOS only for now, but ping me if you are willing to test the Windows or Linux version.

I've been dogfooding this for months and would love to get some feedback, feature requests, and bug reports so I know what to focus on next.

Comments URL: https://news.ycombinator.com/item?id=48190531

Points: 1

# Comments: 0

New comment by bhu8 in "New Claude Code programmatic usage restrictions"

bhu8 — Wed, 13 May 2026 21:58:26 +0000

It would unfortunately impact it. ACP uses Claude SDK and is developed by a third-party.

New comment by bhu8 in "GPT-5.5"

bhu8 — Thu, 23 Apr 2026 20:53:50 +0000

Gpt-5.3-codex is miles better than 5.4 in that regard. It’s better at orchestration, and does the things that it said it did. Haven’t tested 5.5 yet but using 5.4 for exploration + brainstorming and handing over the findings to 5.3-codex works pretty well

New comment by bhu8 in "Elevated errors on Claude.ai, API, Claude Code"

bhu8 — Wed, 15 Apr 2026 16:42:39 +0000

Feels like an issue in their caching. First non-cached turns are sent properly but everything that is second+ turn fails.

New comment by bhu8 in "Ask HN: What Are You Working On? (April 2026)"

bhu8 — Mon, 13 Apr 2026 16:50:39 +0000

I am working on a (yet another) local app for managing multiple claude/codex/gemini sessions in a game like environment: https://getviberia.com/

New comment by bhu8 in "Show HN: The Lottery of Life"

bhu8 — Wed, 18 Mar 2026 07:19:06 +0000

I think something is broken though. I got 20 nematodes in a row. It's around 1% prob.

New comment by bhu8 in "Show HN: Claude Code skills that build complete Godot games"

bhu8 — Tue, 17 Mar 2026 03:30:47 +0000

Ah thanks, I see. This was 8-9 months ago.

I was starting from scratch and mainly relying on Opus/Sonnet 4.

I also kept running into the Godot 3 vs 4 issue before adding specific guidance about this into CLAUDE.md

New comment by bhu8 in "Show HN: Claude Code skills that build complete Godot games"

bhu8 — Mon, 16 Mar 2026 19:41:00 +0000

Great work but why not use C# instead of GDScript?

LLMs are really good at C# (and tscn files for some reason), so that solves the "LLMs suck at GDScript" problem. Also, C# can be cheaper in terms of token usage (even accounting for not having to load the additional APIs): one agent writes the interfaces, another one fills in the details.

Saying this because I had really enjoyed vibecoding a Godot game in C# - and it was REALLY painful to vibecode with GDScript.

New comment by bhu8 in "Show HN: I taught LLMs to play Magic: The Gathering against each other"

bhu8 — Wed, 18 Feb 2026 06:10:09 +0000

This is amazing. I checked some games and the blunders make me think that the LLMs are not really great at forecasting what happens if they play X on Y.

Can you actually introduce that into the decision making? That is, you would:

1. Have the LLM come up with N many potential actions

2. Run XMage run in parallel and provide the outcome for each different action

3. Revert XMage to the original state

4. Provide the LLM with the different outcomes and have them choose the action/outcome pair rather than just the action

This would actually help them analyze the counterfactual outcomes more effectively and should prevent 99% of the blunders

If you happen to be token rich, you could even do this in a MCTS manner and have them think really deep

New comment by bhu8 in "RTS for Agents"

bhu8 — Wed, 21 Jan 2026 14:42:38 +0000

Very nice! I am working on something very similar at www.viberia.net

My take was that it’s easier to trace who is doing what (and what the agent hierarchy looks like) when agents’ locations are fixed.

Token Laundering

bhu8 — Tue, 16 Dec 2025 14:58:19 +0000

Article URL: https://llemre.com/token-laundering/

Comments URL: https://news.ycombinator.com/item?id=46289310

Points: 1

# Comments: 0

New comment by bhu8 in "Context is the bottleneck for coding agents now"

bhu8 — Fri, 26 Sep 2025 15:53:24 +0000

Noted. Thanks!

New comment by bhu8 in "Context is the bottleneck for coding agents now"

bhu8 — Fri, 26 Sep 2025 15:39:57 +0000

> Opting to introduce them sooner will almost certainly increase the complexity of your codebase prematurely

Agreed, but how else are you going to scale mostly AI written code? Relying mostly on AI agents gives you that organizational complexity.

> Given how long gpt codex 5 has been out, there’s no way you’ve followed these practices for a reasonable enough time to consider them definitive

Yeah, fair. Codex has been out for less than 2 weeks at this point. I was relying on gpt-5 in August and opus before that.

New comment by bhu8 in "Context is the bottleneck for coding agents now"

bhu8 — Fri, 26 Sep 2025 15:32:41 +0000

Not yet unfortunately, but I'm in the process of building one.

This was my journey: I vibe-coded an Electron app and ended up with a terrible monolithic architecture, and mostly badly written code. Then, I took the app's architecture docs and spent a lot of my time shouting "MAKE THIS ARCHITECTURE MORE ORTHOGONAL, SOLID, KISS, DRY" to gpt-5-pro, and ended up with a 1500+ liner monster doc.

I'm now turning this into a Tauri app and following the new architecture to a T. I would say that it is has a pretty clean structure with multiple microservices.

Now, new features are gated based on the architecture doc, so I'm always maintaining a single source of truth that serves as the main context for any new discussions/features. Also, each microservice has its own README file(s) which are updated with each code change.

New comment by bhu8 in "Context is the bottleneck for coding agents now"

bhu8 — Fri, 26 Sep 2025 15:16:59 +0000

IMHO, jumping from Level 2 to Level 5 is a matter of:

- Better structured codebases - we need hierarchical codebases with minimal depth, maximal orthogonality and reasonable width. Think microservices.

- Better documentation - most code documentations are not built to handle updates. We need a proper graph structure with few sources of truth that get propagated downstream. Again, some optimal sort of hierarchy is crucial here.

At this point, I really don't think that we necessarily need better agents.

Setup your codebase optimally, spin up 5-10 instances of gpt-5-codex-high for each issue/feature/refactor (pick the best according to some criteria) and your life will go smoothly

New comment by bhu8 in "Jürgen Schmidhuber：the Father of Generative AI Without Turing Award"

bhu8 — Sat, 21 Jun 2025 11:32:53 +0000

I'm Schmidhuber neutral, but the word on the street is that he is a major asshole and sometimes impossible to work with. His research might be more solid than the Turing award winners but his personality truly kept him behind.

New comment by bhu8 in "AI agents: Less capability, more reliability, please"

bhu8 — Mon, 31 Mar 2025 16:15:56 +0000

I have been thinking about the exact same problem for a while and was literally hours away from publishing a blogpost on the subject.

+100 on the footnote:

> agents or workflows?

Workflows. Workflows, all the way.

The agents can start using these workflows once they are actually ready to execute stuff with high precision. And, by then we would have figured out how to create effective, accurate and easily diagnozable workflows, so people will stop complaining about "I want to know what's going on inside the black box".