<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: baldai</title><link>https://news.ycombinator.com/user?id=baldai</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 21 May 2026 17:40:57 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=baldai" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by baldai in "Google officially announces that ads will be included in AI Mode search results"]]></title><description><![CDATA[
<p>I share your feelings.</p>
]]></description><pubDate>Thu, 21 May 2026 11:46:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48221083</link><dc:creator>baldai</dc:creator><comments>https://news.ycombinator.com/item?id=48221083</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48221083</guid></item><item><title><![CDATA[New comment by baldai in "DeepSeek V4 – almost on the frontier"]]></title><description><![CDATA[
<p>Isnt 5.5 Low just better? Its so fast, needs so little tool calls to get work done.</p>
]]></description><pubDate>Sun, 03 May 2026 14:38:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47997375</link><dc:creator>baldai</dc:creator><comments>https://news.ycombinator.com/item?id=47997375</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47997375</guid></item><item><title><![CDATA[New comment by baldai in "DeepSeek V4—almost on the frontier"]]></title><description><![CDATA[
<p>Hi, I am happy it works well for you. For me personally I struggle finding good use-cases in general for these OOS models. I am lightly technical but I do not manually code. So my flow is /grill-me (can take hours), make plan, review plan with 2. model, implement, review after implementation.<p>Maybe it is because my tasks are usually chunkier, or because I cant code myself that I struggle using cheaper models. Feels like at every stage of this process SOTA model improves it by 5%, which adds up.<p>But I am maybe ignorant of Opus level. My main driver is 5.5 and Opus is there for  frontend and 2. opinion. In a past I also used Claude models for the chatting phase, but 5.5 took over recently. Maybe Deepseek is closer to Opus and I just overestimated the model compared to 5.5? I tried to give it benefit of being similar.<p>Recently I started experimenting with Deepseek Flash, maybe hoping if plan is solid enough it can implement quickly and cheaply, but for now it feels not worth it.<p>How do you use the model to see the benefits? Have you tried 5.5 and can you compare to that one as well?<p>Thanks.</p>
]]></description><pubDate>Sat, 02 May 2026 14:29:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47986694</link><dc:creator>baldai</dc:creator><comments>https://news.ycombinator.com/item?id=47986694</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47986694</guid></item><item><title><![CDATA[New comment by baldai in "DeepSeek V4 – almost on the frontier"]]></title><description><![CDATA[
<p>Only similarity it has to Opus 4.6 is the 4 in the name. I do not understand these dishonest comparisons. OOS models are vool, cheap and promising for a future -- but why are we pretending they are better than they are?</p>
]]></description><pubDate>Sat, 02 May 2026 09:01:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47984698</link><dc:creator>baldai</dc:creator><comments>https://news.ycombinator.com/item?id=47984698</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47984698</guid></item><item><title><![CDATA[New comment by baldai in "Show HN: Stage – Putting humans back in control of code review"]]></title><description><![CDATA[
<p>I was actually recently thinking about similar idea. I am someone who started coding post LLMs and have basic technical understanding. I know what loops, variables, API, backend bla bla is. I learned bunch more since then but I am not capable of making decisions based on git diff alone. And I want to. I want to because I think increasing my skills is still super important, even in AI era. The models are getting better, but are still limited by their core design -- for now it does not seem like they will replace humans.<p>So getting assistance in the review, in making the decisions and giving me more clarity feels interesting.<p>Maybe its people like me, who became involved into coding after the LLMs who might be your niche.<p>One thing I dont understand, the UI/UX? Is this visible only on git itself? Or can I get it working in Codex?</p>
]]></description><pubDate>Fri, 17 Apr 2026 18:35:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47809102</link><dc:creator>baldai</dc:creator><comments>https://news.ycombinator.com/item?id=47809102</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47809102</guid></item></channel></rss>