<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ainch</title><link>https://news.ycombinator.com/user?id=ainch</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 13 Jun 2026 16:05:04 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ainch" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ainch in "Gram Newton-Schulz: A Fast, Hardware-Aware Newton-Schulz Algorithm for Muon"]]></title><description><![CDATA[
<p>Tri Dao's lab must have saved countless watts with FlashAttention. Great to see them continuing to open-source massive efficiency gains.</p>
]]></description><pubDate>Fri, 12 Jun 2026 00:13:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48498200</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48498200</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48498200</guid></item><item><title><![CDATA[New comment by ainch in "Anthropic apologizes for invisible Claude Fable guardrails"]]></title><description><![CDATA[
<p>Here's one that was flagged for me: a question about a niche Reinforcement Learning paper from 2012<p><i>I've been reading the option-option model paper by David Silver. It appears that they achieved quite an effective result. Why hasn't there been more work on it since?</i></p>
]]></description><pubDate>Thu, 11 Jun 2026 18:58:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48494872</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48494872</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48494872</guid></item><item><title><![CDATA[New comment by ainch in "πFS"]]></title><description><![CDATA[
<p>I agree it's an oversimplification. The example I think of is something like Newton's law of gravitation vs Ptolemaic epicycles: one simple explanation replaced many layers of tweaks.<p>It's also a relevant example for AI - one paper tested the ability of Transformers  to model planetary orbits: unlike Newton's Law, the implicit forces they learn are nonsense.<p><a href="https://arxiv.org/pdf/2507.06952" rel="nofollow">https://arxiv.org/pdf/2507.06952</a></p>
]]></description><pubDate>Thu, 11 Jun 2026 18:40:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48494632</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48494632</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48494632</guid></item><item><title><![CDATA[New comment by ainch in "Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable"]]></title><description><![CDATA[
<p>Anthropic's claim was that Deepseek collected ~150k conversations.<p><a href="https://www.anthropic.com/news/detecting-and-preventing-distillation-attacks" rel="nofollow">https://www.anthropic.com/news/detecting-and-preventing-dist...</a><p>I think the extent of distillation by Deepseek specifically is overstated. For comparison, Minimax collected over 13m 'exchanges', which starts to sound a lot more like large-scale distillation.</p>
]]></description><pubDate>Thu, 11 Jun 2026 00:50:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=48484892</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48484892</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48484892</guid></item><item><title><![CDATA[New comment by ainch in "πFS"]]></title><description><![CDATA[
<p>In some sense, science is the most extreme form of compression - Newtonian mechanics explains an incredible number of phenomena in a few lines of text.</p>
]]></description><pubDate>Wed, 10 Jun 2026 23:06:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48484017</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48484017</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48484017</guid></item><item><title><![CDATA[New comment by ainch in "Claude Fable 5"]]></title><description><![CDATA[
<p>AISI did also say that GPT-5.5, which has been public for months, scores basically the same as Mythos on their cybersec evaluation. But there wasn't as much media about about that for some reason.<p><a href="https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5-5-cyber-capabilities" rel="nofollow">https://www.aisi.gov.uk/blog/our-evaluation-of-openais-gpt-5...</a></p>
]]></description><pubDate>Tue, 09 Jun 2026 22:49:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48468914</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48468914</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48468914</guid></item><item><title><![CDATA[New comment by ainch in "Claude Fable 5"]]></title><description><![CDATA[
<p>Token prices have increased, but it's not really the whole story at this point, given some models will use far more tokens to complete a task than others. One of the charts in Anthropic's blog posts shows Fable at 'low' reasoning achieving better results for less money than Opus on 'high'.</p>
]]></description><pubDate>Tue, 09 Jun 2026 22:46:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48468869</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48468869</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48468869</guid></item><item><title><![CDATA[New comment by ainch in "Apple reveals new AI architecture built around Google Gemini models"]]></title><description><![CDATA[
<p>They say it's because the EU's DMA would require them to open up device data to third-party assistants, and they'd no longer be able to guarantee user privacy.<p><a href="https://www.apple.com/newsroom/2026/06/due-to-dma-siri-ai-delayed-in-eu-for-ios-27-and-ipados-27/" rel="nofollow">https://www.apple.com/newsroom/2026/06/due-to-dma-siri-ai-de...</a></p>
]]></description><pubDate>Mon, 08 Jun 2026 20:02:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48451022</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48451022</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48451022</guid></item><item><title><![CDATA[New comment by ainch in "AI is slowing down"]]></title><description><![CDATA[
<p>As WIRED reported[0], despite constantly writing about how an AI collapse is <i>just about</i> to come, Zitron privately does PR for AI firms on the side. The man is an obvious hack, and it's disappointing that he has become one of the mainstream faces of AI skepticism.<p>[0]: <a href="https://www.wired.com/story/ai-pr-ed-zitron-profile/" rel="nofollow">https://www.wired.com/story/ai-pr-ed-zitron-profile/</a></p>
]]></description><pubDate>Mon, 08 Jun 2026 18:43:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48449594</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48449594</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48449594</guid></item><item><title><![CDATA[New comment by ainch in "Ask HN: What was your "oh shit" moment with GenAI?"]]></title><description><![CDATA[
<p>I'd spent 6 hours solving a gnarly RL problem (mathematically solving divergence of off-policy TD-Lambda for any value of lambda or behaviour policy).<p>As a punt I gave it to o3 (remember LLMs were 'bad at maths') - after 15 minutes it returned with the answer that had taken me hours.</p>
]]></description><pubDate>Sun, 07 Jun 2026 10:42:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48433556</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48433556</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48433556</guid></item><item><title><![CDATA[New comment by ainch in "Artificial intelligence is not conscious – Ted Chiang"]]></title><description><![CDATA[
<p>There's a growing body of evidence that most of what the brain does is constantly  predicting the world around us - look into the predictive brain hypothesis if you're interested.<p>As Ilya Sutskever has pointed out, if you read a mystery novel up until the reveal of the culprit, and then fill in "The killer was _____", don't you need to understand the novel to accurately predict the next word?</p>
]]></description><pubDate>Thu, 04 Jun 2026 09:34:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48396237</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48396237</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48396237</guid></item><item><title><![CDATA[New comment by ainch in "Nvidia Cosmos 3"]]></title><description><![CDATA[
<p>You can fine-tune it so, given an image and a task description, it generates a corresponding set of actions.</p>
]]></description><pubDate>Mon, 01 Jun 2026 15:05:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48357816</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48357816</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48357816</guid></item><item><title><![CDATA[Training a Simple World Model with Jax]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.alexinch.com/blog/simple-world-model">https://www.alexinch.com/blog/simple-world-model</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48354481">https://news.ycombinator.com/item?id=48354481</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 01 Jun 2026 09:24:29 +0000</pubDate><link>https://www.alexinch.com/blog/simple-world-model</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48354481</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48354481</guid></item><item><title><![CDATA[New comment by ainch in "Dune's Butlerian Jihad and the Future of AI"]]></title><description><![CDATA[
<p>Deep Blue's strength was leveraging massive compute to execute a task-specific, human-written algorithm. The problems which LLMs are tackling don't elude mathematicians because they require too much number-crunching, but because they demand creative problem solving. The latter seems more profound, even if it doesn't imply sentience.</p>
]]></description><pubDate>Mon, 01 Jun 2026 09:22:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=48354466</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48354466</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48354466</guid></item><item><title><![CDATA[Speeding up MuJoCo 460x with Jax]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.alexinch.com/blog/mjx">https://www.alexinch.com/blog/mjx</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48246132">https://news.ycombinator.com/item?id=48246132</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 23 May 2026 09:19:19 +0000</pubDate><link>https://www.alexinch.com/blog/mjx</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48246132</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48246132</guid></item><item><title><![CDATA[New comment by ainch in "Agora-1: The Multi-Agent World Model"]]></title><description><![CDATA[
<p>Very cool work, the learned world state is a smart way of getting consistent generation across all the views (and not having the map vanish when you 180 like some other models). Multi-agent is such an interesting field, because it's clear that humanity benefits from distributed intelligence, but I don't think MARL has really had a big breakthrough like AlphaGo or RLVR for single-agent RL.<p>Two thoughts about where this could go: first, the internal world state would need to be learned to transfer to real-life robotics, since you can't query the internals of a game engine in training. Second, an enormous challenge for many of these world models is going to be truly unbounded environmental interactivity - Agora is still mostly about a few agents interacting in a static environment. Learning interaction will be hard, because the interactions in games are intentionally added in, by hand. But we (human learners) acquire a strong model for environental interaction very efficiently, which is part of what helps us generalise so effectively.</p>
]]></description><pubDate>Mon, 18 May 2026 19:42:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48184519</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48184519</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48184519</guid></item><item><title><![CDATA[New comment by ainch in "Futhark by example (2020)"]]></title><description><![CDATA[
<p>Thanks for the reply - you're clearly much more experienced with the internals here, but I believe we're still talking at cross-purposes. I believe you're talking about compilers like jax.jit or torch.compile performing symbolic shape inference. I'm talking about the ergonomics of tracking shape information while writing Python code that calls these libraries. I don't use Torch much, so I'll just comment on the Jax side.<p>> Jax is explicitly mentioned in your pyrefly link as having a parallel (but slightly weaker) system<p>Jaxtyping is limited to runtime-only checks (which might as well be assert statements), and doesn't infer shapes based on operations. I'm interested in Pyrefly because I've run into the limitations of Jaxtyping in my own usage.<p>> Jax is built on stablehlo which uses shape dialect, which is part of the compiler (and therefore statically known).<p>It's true that JAX does shape inference when it compiles down to HLO - but that isn't available to the Python typing system. The Pyrefly development is addressing that, so you get static analysis before even running anything, or without having to add eval_shape calls all over your codebase. I think that's helpful, and will catch bugs. When I say Jax does inference at runtime, I mean that you have to run for the jit compiler to kick in - you don't get feedback as you edit.<p>> the fact that people do not know how to use the tools does not mean the tools are lacking... almost everyone that is employed to work with these tools is aware of these features and therefore eschews those kinds of comment strings.<p>The examples I took are from Andrej Karpathy and Noam Shazeer - maybe the disconnect is that they're more on the research side. Perhaps only unsophisticated users rely on these hacks - but as one such user I'm very excited that Pyrefly is addressing a problem I have. I suspect part of the misunderstanding that's evolved here is that these tools serve audiences with different needs.</p>
]]></description><pubDate>Mon, 18 May 2026 01:01:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48174604</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48174604</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48174604</guid></item><item><title><![CDATA[New comment by ainch in "Every AI Subscription Is a Ticking Time Bomb for Enterprise"]]></title><description><![CDATA[
<p>At the same time, the training paradigm being scaled, Reinforcement Learning, is significantly less data-efficient than next-token prediction. You basically need to run an agent for minutes (or longer if you want good long-horizon performance), only to give it a binary pass/fail - one bit of information.<p>Inference compute is definitely scaling fast, but to scale RL, training and R&D compute also needs to scale hard. I don't think it's obvious that inference will overtake R&D/training, unless there's a reputable source that states that.</p>
]]></description><pubDate>Sun, 17 May 2026 18:02:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48171424</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48171424</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48171424</guid></item><item><title><![CDATA[New comment by ainch in "Every AI Subscription Is a Ticking Time Bomb for Enterprise"]]></title><description><![CDATA[
<p>The price for a given level of capability will fall, but the frontier has recently been getting more expensive. If you compare GPT-5 to GPT-5.5 on the Artificial Analysis benchmark, it's ~4x more expensive, but achieves a higher score. Claude 4.7 is also more expensive than predecessors because of a tokenizer change.<p>As the AI labs become more reliant on enterprise adoption, it makes sense to push capabilities at a cost that makes sense for businesses. Even if it prices out consumers or hobbyists.</p>
]]></description><pubDate>Sun, 17 May 2026 13:14:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48168650</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48168650</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48168650</guid></item><item><title><![CDATA[New comment by ainch in "AI subscriptions are a ticking time bomb for enterprise"]]></title><description><![CDATA[
<p>Tokens can be sold at profit, but 70% of compute expenditure goes to R&D and model training[0]. Inference needs to cover all of that as well as being profitable in a vacuum.<p>[0] <a href="https://epoch.ai/data-insights/openai-compute-spend" rel="nofollow">https://epoch.ai/data-insights/openai-compute-spend</a></p>
]]></description><pubDate>Sun, 17 May 2026 13:08:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48168604</link><dc:creator>ainch</dc:creator><comments>https://news.ycombinator.com/item?id=48168604</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48168604</guid></item></channel></rss>