Hacker News: baldai

New comment by baldai in "Google officially announces that ads will be included in AI Mode search results"

baldai — Thu, 21 May 2026 11:46:19 +0000

I share your feelings.

New comment by baldai in "DeepSeek V4 – almost on the frontier"

baldai — Sun, 03 May 2026 14:38:51 +0000

Isnt 5.5 Low just better? Its so fast, needs so little tool calls to get work done.

New comment by baldai in "DeepSeek V4—almost on the frontier"

baldai — Sat, 02 May 2026 14:29:57 +0000

Hi, I am happy it works well for you. For me personally I struggle finding good use-cases in general for these OOS models. I am lightly technical but I do not manually code. So my flow is /grill-me (can take hours), make plan, review plan with 2. model, implement, review after implementation.

Maybe it is because my tasks are usually chunkier, or because I cant code myself that I struggle using cheaper models. Feels like at every stage of this process SOTA model improves it by 5%, which adds up.

But I am maybe ignorant of Opus level. My main driver is 5.5 and Opus is there for frontend and 2. opinion. In a past I also used Claude models for the chatting phase, but 5.5 took over recently. Maybe Deepseek is closer to Opus and I just overestimated the model compared to 5.5? I tried to give it benefit of being similar.

Recently I started experimenting with Deepseek Flash, maybe hoping if plan is solid enough it can implement quickly and cheaply, but for now it feels not worth it.

How do you use the model to see the benefits? Have you tried 5.5 and can you compare to that one as well?

Thanks.

New comment by baldai in "DeepSeek V4 – almost on the frontier"

baldai — Sat, 02 May 2026 09:01:27 +0000

Only similarity it has to Opus 4.6 is the 4 in the name. I do not understand these dishonest comparisons. OOS models are vool, cheap and promising for a future -- but why are we pretending they are better than they are?

New comment by baldai in "Show HN: Stage – Putting humans back in control of code review"

baldai — Fri, 17 Apr 2026 18:35:51 +0000

I was actually recently thinking about similar idea. I am someone who started coding post LLMs and have basic technical understanding. I know what loops, variables, API, backend bla bla is. I learned bunch more since then but I am not capable of making decisions based on git diff alone. And I want to. I want to because I think increasing my skills is still super important, even in AI era. The models are getting better, but are still limited by their core design -- for now it does not seem like they will replace humans.

So getting assistance in the review, in making the decisions and giving me more clarity feels interesting.

Maybe its people like me, who became involved into coding after the LLMs who might be your niche.

One thing I dont understand, the UI/UX? Is this visible only on git itself? Or can I get it working in Codex?