<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: dippogriff</title><link>https://news.ycombinator.com/user?id=dippogriff</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 26 Jun 2026 22:24:00 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=dippogriff" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by dippogriff in "The AI backlash is only getting started"]]></title><description><![CDATA[
<p>If the labs weren't so aggressive with building datacenters in people's backyards, this could've been a different story. People don't like it when pipelines are built in their backyard either.</p>
]]></description><pubDate>Fri, 26 Jun 2026 14:37:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48687154</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48687154</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48687154</guid></item><item><title><![CDATA[New comment by dippogriff in "The AI backlash is only getting started"]]></title><description><![CDATA[
<p>They tried that a few times and the mistakes have had consequences.</p>
]]></description><pubDate>Fri, 26 Jun 2026 14:29:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48687048</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48687048</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48687048</guid></item><item><title><![CDATA[New comment by dippogriff in "KinetIQ Ascend: Toward 100% Reliable Manipulation and Superhuman Speed"]]></title><description><![CDATA[
<p>This is excellent! Very useful takeaways. Being able to properly do continuous training in production is key with robotics data being so hard to come by.</p>
]]></description><pubDate>Fri, 26 Jun 2026 14:24:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48686988</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48686988</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48686988</guid></item><item><title><![CDATA[New comment by dippogriff in "Fixing Failures in Browser-Use Models: Why More Data Isn't Enough"]]></title><description><![CDATA[
<p>Great work showing on how brittle these GUI benchmarks can be! Love the visuals.<p>I wonder if SFT is the problem here as opposed to the coordinate discretization; what happens with continuous action space?</p>
]]></description><pubDate>Fri, 26 Jun 2026 14:07:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48686807</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48686807</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48686807</guid></item><item><title><![CDATA[New comment by dippogriff in "Autodata: An agentic data scientist to create high quality synthetic data"]]></title><description><![CDATA[
<p>This is cool. Creative ways to do external verification is the only path to solving training on LLM slop</p>
]]></description><pubDate>Thu, 25 Jun 2026 20:55:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48679075</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48679075</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48679075</guid></item><item><title><![CDATA[New comment by dippogriff in "Every match of the 2026 World Cup as a generative poster"]]></title><description><![CDATA[
<p>Neat! minor nit - would be nice if the esc button took you back to the list, instead of having the click the X button</p>
]]></description><pubDate>Thu, 25 Jun 2026 07:23:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48670082</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48670082</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48670082</guid></item><item><title><![CDATA[New comment by dippogriff in "Why eval startups fail (2025)"]]></title><description><![CDATA[
<p>The current way benchmarks are done and are accepted by the community makes for really uninspired work. Until we're willing to break out of this rigid evaluation format prone to crazy overfitting and gaming, talent will move elsewhere. It is kind of a chicken and egg problem though.</p>
]]></description><pubDate>Wed, 24 Jun 2026 18:18:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48663715</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48663715</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48663715</guid></item><item><title><![CDATA[New comment by dippogriff in "For Most of the World, Open-Source AI Is the Only Way Forward"]]></title><description><![CDATA[
<p>Edge models will get much better after the current insane capex and organic data for pre-training is dried out. But hard to see how the best open source models will ever come close to the best closed ones.</p>
]]></description><pubDate>Wed, 24 Jun 2026 17:52:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48663384</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48663384</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48663384</guid></item><item><title><![CDATA[New comment by dippogriff in "The worthlessness of Vitamin D is mildly exaggerated"]]></title><description><![CDATA[
<p>Vice versa, the exaggeration of vitamin D is mildly worthless. Some need supplements, some don't.</p>
]]></description><pubDate>Wed, 24 Jun 2026 05:50:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48655693</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48655693</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48655693</guid></item><item><title><![CDATA[New comment by dippogriff in "Qwen-AgentWorld: Language World Models for General Agents"]]></title><description><![CDATA[
<p>I'm a fan of this direction. For me the most interesting use case for these world models isn't even training, it's verification. If this thing or some idealized version of it can actually reliably simulate state transitions, could you use it to verify an agent's execution path against hard constraints and replace/eclipse LLMs-as-a-judge?</p>
]]></description><pubDate>Wed, 24 Jun 2026 05:47:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48655664</link><dc:creator>dippogriff</dc:creator><comments>https://news.ycombinator.com/item?id=48655664</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48655664</guid></item></channel></rss>