<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: lucidrains</title><link>https://news.ycombinator.com/user?id=lucidrains</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 17 Apr 2026 10:11:35 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=lucidrains" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by lucidrains in "The Waymo Ojai Will Soon Offer Autonomous Rides Around the U.S."]]></title><description><![CDATA[
<p>They need to have a service that allows for pets. It is the only reason I still use Ubers!</p>
]]></description><pubDate>Thu, 08 Jan 2026 17:14:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=46543574</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=46543574</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46543574</guid></item><item><title><![CDATA[New comment by lucidrains in "BERT is just a single text diffusion step"]]></title><description><![CDATA[
<p>there has been some movement on that front, in the form of adding expand / delete tokens! <a href="https://hkunlp.github.io/blog/2025/dreamon/" rel="nofollow">https://hkunlp.github.io/blog/2025/dreamon/</a></p>
]]></description><pubDate>Mon, 20 Oct 2025 21:58:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=45649953</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=45649953</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45649953</guid></item><item><title><![CDATA[New comment by lucidrains in "From multi-head to latent attention: The evolution of attention mechanisms"]]></title><description><![CDATA[
<p>It is a reference to the beatles song, mainly because Noam Shazeer is a music lover</p>
]]></description><pubDate>Sat, 30 Aug 2025 13:33:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=45074544</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=45074544</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45074544</guid></item><item><title><![CDATA[New comment by lucidrains in "TREAD: Token Routing for Efficient Architecture-Agnostic Diffusion Training"]]></title><description><![CDATA[
<p>very nice, will have to try it out! this is the same research group from which Robin Rombach (of stable diffusion fame) originated from</p>
]]></description><pubDate>Mon, 18 Aug 2025 19:45:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=44944522</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=44944522</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44944522</guid></item><item><title><![CDATA[New comment by lucidrains in "Hand: open-source Robot Hand"]]></title><description><![CDATA[
<p><a href="https://theopenexo.nau.edu/" rel="nofollow">https://theopenexo.nau.edu/</a></p>
]]></description><pubDate>Thu, 17 Jul 2025 17:20:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=44595698</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=44595698</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44595698</guid></item><item><title><![CDATA[New comment by lucidrains in "Schizophrenia is the price we pay for minds poised near the edge of a cliff"]]></title><description><![CDATA[
<p>I am sorry to hear that.</p>
]]></description><pubDate>Sun, 29 Jun 2025 17:43:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=44414891</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=44414891</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44414891</guid></item><item><title><![CDATA[New comment by lucidrains in "Schizophrenia is the price we pay for minds poised near the edge of a cliff"]]></title><description><![CDATA[
<p>indeed, it just becomes less likely</p>
]]></description><pubDate>Sun, 29 Jun 2025 17:43:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=44414887</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=44414887</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44414887</guid></item><item><title><![CDATA[New comment by lucidrains in "Schizophrenia is the price we pay for minds poised near the edge of a cliff"]]></title><description><![CDATA[
<p>if you are a man and make it past age of 29 without starting to hear voices, you can breathe a sigh of relief (I did)</p>
]]></description><pubDate>Sun, 29 Jun 2025 14:10:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=44413306</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=44413306</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44413306</guid></item><item><title><![CDATA[New comment by lucidrains in "Launch HN: Vassar Robotics (YC X25) – $219 robot arm that learns new skills"]]></title><description><![CDATA[
<p>Love to know as well!</p>
]]></description><pubDate>Tue, 10 Jun 2025 19:43:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=44240643</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=44240643</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44240643</guid></item><item><title><![CDATA[New comment by lucidrains in "Surprisingly fast AI-generated kernels we didn't mean to publish yet"]]></title><description><![CDATA[
<p>oh wow, I was looking for use of islands or map-elites that I missed this.. thought it was the blandest mimetic evolution possible</p>
]]></description><pubDate>Sat, 31 May 2025 00:48:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=44141084</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=44141084</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44141084</guid></item><item><title><![CDATA[New comment by lucidrains in "Differential Transformer"]]></title><description><![CDATA[
<p>does this not mean we should explore usage of talking heads (Shazeer et al) a bit more? <a href="https://arxiv.org/abs/2003.02436" rel="nofollow">https://arxiv.org/abs/2003.02436</a></p>
]]></description><pubDate>Tue, 08 Oct 2024 15:22:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=41778345</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=41778345</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41778345</guid></item><item><title><![CDATA[New comment by lucidrains in "Psilocybin desynchronizes the human brain"]]></title><description><![CDATA[
<p>Indeed, the best setting is in nature on a beautiful day</p>
]]></description><pubDate>Wed, 17 Jul 2024 21:14:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=40990433</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=40990433</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40990433</guid></item><item><title><![CDATA[New comment by lucidrains in "GPUs Go Brrr"]]></title><description><![CDATA[
<p>amazing work! thank you!</p>
]]></description><pubDate>Tue, 14 May 2024 00:02:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=40350038</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=40350038</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40350038</guid></item><item><title><![CDATA[New comment by lucidrains in "GPUs Go Brrr"]]></title><description><![CDATA[
<p>would be interested to see thunderkittens (great name!) tackle the flash attention backwards pass, which is an order of magnitude harder than the forward</p>
]]></description><pubDate>Mon, 13 May 2024 15:10:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=40344230</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=40344230</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40344230</guid></item><item><title><![CDATA[New comment by lucidrains in "xLSTM: Extended Long Short-Term Memory"]]></title><description><![CDATA[
<p><a href="https://arxiv.org/abs/2404.08819" rel="nofollow">https://arxiv.org/abs/2404.08819</a></p>
]]></description><pubDate>Wed, 08 May 2024 14:37:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=40298605</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=40298605</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40298605</guid></item><item><title><![CDATA[New comment by lucidrains in "Better and Faster Large Language Models via Multi-Token Prediction"]]></title><description><![CDATA[
<p>wow, so prophet net does work! i spent so much time experimenting with it back in the day, but just lacked the scale to see a positive result.</p>
]]></description><pubDate>Wed, 01 May 2024 14:15:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=40223550</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=40223550</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40223550</guid></item><item><title><![CDATA[New comment by lucidrains in "Self-reasoning tokens: teaching models to think ahead"]]></title><description><![CDATA[
<p>indeed, simsiam is a great example of the effectiveness of using stop gradient</p>
]]></description><pubDate>Sun, 21 Apr 2024 19:57:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=40108709</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=40108709</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40108709</guid></item><item><title><![CDATA[New comment by lucidrains in "Self-reasoning tokens: teaching models to think ahead"]]></title><description><![CDATA[
<p>could even try it with a fraction of the attention heads, instead of introducing new tokens</p>
]]></description><pubDate>Sun, 21 Apr 2024 00:50:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=40102347</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=40102347</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40102347</guid></item><item><title><![CDATA[New comment by lucidrains in "Self-reasoning tokens: teaching models to think ahead"]]></title><description><![CDATA[
<p>yes, it is a stop gradient mask on the attention matrix, iiuc. worth trying</p>
]]></description><pubDate>Sun, 21 Apr 2024 00:18:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=40102199</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=40102199</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40102199</guid></item><item><title><![CDATA[New comment by lucidrains in "But what is a GPT?  Visual intro to Transformers [video]"]]></title><description><![CDATA[
<p>I can't think of anyone better to teach attention mechanism to the masses. This is a dream come true</p>
]]></description><pubDate>Mon, 01 Apr 2024 19:47:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=39898356</link><dc:creator>lucidrains</dc:creator><comments>https://news.ycombinator.com/item?id=39898356</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39898356</guid></item></channel></rss>