<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: faurroar</title><link>https://news.ycombinator.com/user?id=faurroar</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 18 Jun 2026 11:34:40 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=faurroar" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by faurroar in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>Hinton and Bengio don't understand how LLMs work?</p>
]]></description><pubDate>Sat, 13 Jun 2026 05:14:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48513518</link><dc:creator>faurroar</dc:creator><comments>https://news.ycombinator.com/item?id=48513518</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48513518</guid></item><item><title><![CDATA[New comment by faurroar in "How LLMs work"]]></title><description><![CDATA[
<p>Architectures have evolved significantly since then.  DeepSeek v4 =/= GPT-3.  Even then, a great deal of complexity lies in everything surrounding the architectures e.g. how do you implement them performantly on modern accelerators, how do you distribute the model across a set of accelerators, how do you post-train, etc.  And pre-training itself is a dark art.  If you legitimately think that frontier labs are doing something equivalent to whatever you wrote on your whiteboard, you’re clueless.</p>
]]></description><pubDate>Sat, 06 Jun 2026 05:14:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48421610</link><dc:creator>faurroar</dc:creator><comments>https://news.ycombinator.com/item?id=48421610</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48421610</guid></item></channel></rss>