<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: Hfuffzehn</title><link>https://news.ycombinator.com/user?id=Hfuffzehn</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 10 Jun 2026 09:38:30 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=Hfuffzehn" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by Hfuffzehn in "German ruling declares Google liable for false answers in AI Overviews"]]></title><description><![CDATA[
<p>Yes, the monopoly is not relevant for the court.<p>It is relevant for Google though, because they want to transfer it to another product.<p>And the court is saying that whatever that new product is, Google is not allowed to mislead the public by pretending it is search.</p>
]]></description><pubDate>Wed, 10 Jun 2026 08:40:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48473321</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48473321</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48473321</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "German ruling declares Google liable for false answers in AI Overviews"]]></title><description><![CDATA[
<p>If I get it correctly I like the ruling.<p>So Google has established a product called Search.
For that product rules have been established.
Google has monopolized that product.<p>Now Google is replacing that product with a new product.
But they keep calling it the same thing.
Because they want to keep their monopoly.<p>That is what has been deemed illegal.
Gemini is not illegal. Pretending the worst version of Gemini is Search is illegal, because it breaks the rules established for Search.<p>But IANAL.</p>
]]></description><pubDate>Wed, 10 Jun 2026 06:53:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=48472433</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48472433</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48472433</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "MAI-Code-1-Flash"]]></title><description><![CDATA[
<p>And as I am on holiday today I will try to help them out:<p><pre><code>                   GPT-5.4 mini Haiku 4.5 MAI-Code
</code></pre>
SWE-Bench Pro            54.4 %     35.2%    51.2%<p>Terminal-Bench 2.0       60.0 %     41.6%    54.8%<p>Source: <a href="https://openai.com/index/introducing-gpt-5-4-mini-and-nano/" rel="nofollow">https://openai.com/index/introducing-gpt-5-4-mini-and-nano/</a></p>
]]></description><pubDate>Wed, 03 Jun 2026 13:01:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=48383445</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48383445</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48383445</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "MAI-Code-1-Flash"]]></title><description><![CDATA[
<p>So I guess the important link the marketing department forgot is this one: <a href="https://docs.github.com/en/copilot/reference/copilot-billing/models-and-pricing#microsoft" rel="nofollow">https://docs.github.com/en/copilot/reference/copilot-billing...</a><p>Model Input Cached input Output<p>MAI-Code-1-Flash $0.75 $0.075 $4.50<p>Comparing to<p>Claude Haiku 4.5 $1.00 $0.10 $5.00<p>looks fine.<p>But they also forgot to include the benchmarks comparing to<p>GPT-5.4 mini $0.75 $0.075 $4.50<p>Those would have been helpful.</p>
]]></description><pubDate>Wed, 03 Jun 2026 12:43:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48383228</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48383228</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48383228</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "AI outperforms law professors in Stanford Law study"]]></title><description><![CDATA[
<p>This is a very good comment.
But notice how even in software engineering there is still disagreement about these structural safeguards.<p>So yes, we can say the LLM created bad code when it does not compile or fails prewritten tests.<p>But experts might disagree what good comments, good cohesion, appropriate use of design patterns, appropriate test coverage or clear variable names are.<p>So what are we suppossed to train the LLMs towards?
Somebody still has to decide what "good" is.</p>
]]></description><pubDate>Wed, 03 Jun 2026 12:28:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48383089</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48383089</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48383089</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "AI outperforms law professors in Stanford Law study"]]></title><description><![CDATA[
<p>I agree.
But notice that you assume that there is a metric with which you can messure improvement.
Which is fine if you are measuring against your personal taste.<p>But it might be that the optimization target itself has a ceiling. If you're training toward human approval ratings from a broad population, you converge toward what median preference selects for. The plateau is baked into what you're measuring against.</p>
]]></description><pubDate>Wed, 03 Jun 2026 11:52:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=48382796</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48382796</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48382796</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "MAI-Code-1-Flash"]]></title><description><![CDATA[
<p><a href="https://docs.github.com/en/copilot/reference/copilot-billing/models-and-pricing#microsoft" rel="nofollow">https://docs.github.com/en/copilot/reference/copilot-billing...</a><p>Model            Input  Cached input Output
MAI-Code-1-Flash $0.75  $0.075       $4.50</p>
]]></description><pubDate>Wed, 03 Jun 2026 11:07:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48382401</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48382401</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48382401</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "MAI-Code-1-Flash"]]></title><description><![CDATA[
<p>The first time I was impressed by AI coding was when I pointed it at some switch case monster code and told it to replace it with a strategy pattern.<p>And it did just fine.<p>So no matter what you think about vibe coding, using AI for these slightly more complicated use cases is genuinely useful.</p>
]]></description><pubDate>Wed, 03 Jun 2026 11:02:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48382362</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48382362</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48382362</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "MAI-Code-1-Flash"]]></title><description><![CDATA[
<p>Agreed.
Seems like this could have been a nice model if we would still be in the old GitHub Copilot free request/ premium multiplier mode.
It could have been a good compromise to somehow reign in the costs for Microsoft.<p>But with Copilot now just being paying per-token prices I don't see how this is competitive with Chinese models.<p>It is probably telling you can't find the costs in the announcement.
Because
Input $0.75 Cached input $0.075 Output $4.50
might be competitive with Haiku, but nobody in their right mind uses Haiku and Anthropic has abandoned it chasing the tokenmaxers who aren't thinking about budgets.<p>So I guess they are aiming for corporate customers that are bound to Microsoft through compliance approval that will soon start seeing their budgets explode that have to find some corporate compromise.</p>
]]></description><pubDate>Wed, 03 Jun 2026 09:57:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=48382004</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48382004</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48382004</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "Real-time LLM Inference on Standard GPUs: 3k tokens/s per request"]]></title><description><![CDATA[
<p>That's really nice of them.<p>That means Jensen can add another 30 times faster when comparing Rubin to Blackwell without having to actually do anything.<p>Hopefully that means he won't have any problem to make another 150 billion in profit in the next year.<p>Sorry for the sarcasm. Looks like interesting work.</p>
]]></description><pubDate>Fri, 29 May 2026 12:56:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48322499</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48322499</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48322499</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "Various LLM Smells"]]></title><description><![CDATA[
<p>The interesting thing for me is that I do not feel like the writing of LLMs has improved very much lately stylistically.
They have reached a "good" level some time ago but the newer models havn't brought such improvements that you would prefer them to an expert human writer.<p>Will be interesting if that holds in other areas when chasing super intelligence.</p>
]]></description><pubDate>Thu, 28 May 2026 22:02:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48316145</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48316145</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48316145</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "Using AI to write better code more slowly"]]></title><description><![CDATA[
<p>The main insight here I think is that LLMs are great tools for iterative development and iterative problem solving in general.<p>You can very effectivly iterate alone using the LLM as a mirror, rephrasing what you put in and adding a bit.<p>You can use LLMs to quickly create prototypes to give to other human beings to help you with the next iteration.<p>If you get something from someone else to iterate on you can use the LLM to help you with understanding to rephrase things in a way more suitable for your understanding.<p>But instead everything anybody seems to be talking about seems to be one shoting things and AI iterating with other AI.<p>The big problem here is that the one thing AI does not have is agency.
The naming AI agent is wishful thinking and marketing.</p>
]]></description><pubDate>Tue, 26 May 2026 06:48:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48275988</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48275988</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48275988</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "Does anybody like React?"]]></title><description><![CDATA[
<p>I haven't really deeply thought about frontend JS for many years.<p>Back then the question we were looking at was whether it would be good idea to move away from SAP UI5.
The alternatives back then where React, Angular and Vue.<p>The conclusion we came to was that it was definitely worth to migrate, but to what was not so easy to agree on.<p>Right now I am working with a legacy Java codebase that was based on RxJava.
And every single day I am cursing the people that made that decision.
It seems so obviously a bad idea.
And the only thing that lets me keep my sanity is remembering that every decision only becomes obvious with hindsight.<p>So I guess the only thing I can contribute is that it could always be worse and sometimes making the bold and seemingly innovative decision comes back many years later to bite other people.</p>
]]></description><pubDate>Tue, 26 May 2026 05:04:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48275221</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48275221</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48275221</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "DeepSeek reasonix, DeepSeek native coding agent with high caching and low cost"]]></title><description><![CDATA[
<p>This is really tickling the conspiracy theorist part of my brain.<p>"Independent open-source project · not affiliated with DeepSeek"
"Reasonix only targets DeepSeek because..."
"Why DeepSeek only? Can I swap to Claude / GPT? It's a design choice, not a limitation"<p>The lady doth protest too much, methinks?<p>Nicely timed shortly after the making the rebate permanent anouncement.<p>Could just be Chinese devs trying to help western devs with some software and a western facing marketing campaign to raise awareness.
Could be DeepSeek astroturfing.
Could be "someone" in China trying to get more access to western data.<p>Who knows?</p>
]]></description><pubDate>Sun, 24 May 2026 16:50:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=48258879</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48258879</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48258879</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "BambuStudio has been violating PrusaSlicer AGPL license since their fork"]]></title><description><![CDATA[
<p>With DeepSeek making their price rebates permanent we now have some data what China values data access at.<p>Western providers of the open weight models are 3 times or more as expensive as DeepSeek itself right now.<p>Of course the data access for the Chinese is not the only part valued in there, but I am pretty sure it is one.</p>
]]></description><pubDate>Sat, 23 May 2026 13:36:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48247563</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48247563</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48247563</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "I Miss Terry Pratchett"]]></title><description><![CDATA[
<p>I miss him too.<p>Even though I had the experiences he discribes with Douglas Adams first before discovering Terry Pratchett.</p>
]]></description><pubDate>Sat, 23 May 2026 13:24:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48247478</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48247478</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48247478</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "The last six months in LLMs in five minutes"]]></title><description><![CDATA[
<p>You have correctly identified that getting a "high-quality harness (ie preloaded instructions from md files, including custom skills)" is the (or at least a) hard part.<p>Because you have to adjust the harness to your problem space and provide that so you can say it is high-quality.<p>Many people will stop that discussion at the claude code vs. codex vs. opencode level and then merge that with discussing model performance.<p>And that is also why "Generate an SVG of a pelican riding a bicycle" is still a benchmark worth discussing. Because at least it is a defined problem space.</p>
]]></description><pubDate>Tue, 19 May 2026 06:47:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48190068</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48190068</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48190068</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "Maybe you shouldn't install new software for a bit"]]></title><description><![CDATA[
<p>Isn't blaming AI for that similar to blaming C for buffer overflows?<p>More people are producing more code because of easier tools. Most code is bad. But that's not the tools fault.<p>And in the end it is a problem of processes and culture.</p>
]]></description><pubDate>Fri, 08 May 2026 06:26:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48059363</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=48059363</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48059363</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "Grok 4.3"]]></title><description><![CDATA[
<p>Yes, but that market is not b2b, less commercialized, more end consumer focused and more bring your own key.<p>That's why I find it interesting. Anthropic is not interested in building a moat there and OpenAI has given up on their announcement of exploring it.<p>So you can see end users making decisions.</p>
]]></description><pubDate>Fri, 01 May 2026 12:43:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47974131</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=47974131</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47974131</guid></item><item><title><![CDATA[New comment by Hfuffzehn in "Grok 4.3"]]></title><description><![CDATA[
<p>Sure, but the best statistics about what models people are actually using when they can choose is probably from openrouter: <a href="https://openrouter.ai/apps/category/entertainment/roleplay" rel="nofollow">https://openrouter.ai/apps/category/entertainment/roleplay</a></p>
]]></description><pubDate>Fri, 01 May 2026 11:59:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47973761</link><dc:creator>Hfuffzehn</dc:creator><comments>https://news.ycombinator.com/item?id=47973761</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47973761</guid></item></channel></rss>