<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: jephs</title><link>https://news.ycombinator.com/user?id=jephs</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 05 Jun 2026 21:05:44 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=jephs" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by jephs in "Do transformers need three projections? Systematic study of QKV variants"]]></title><description><![CDATA[
<p>Scaling curves don't need to be drawn at particularly enormous parameter counts to be useful! If you can do a 300M and 1.2B run (like the authors do here), then you can do 150M, 300M, 600M, and 1.2B runs with only 50% more resources, and get a much better sense for whether effects seem to amplify or diminish as scale increases.</p>
]]></description><pubDate>Fri, 05 Jun 2026 16:56:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48415238</link><dc:creator>jephs</dc:creator><comments>https://news.ycombinator.com/item?id=48415238</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48415238</guid></item><item><title><![CDATA[New comment by jephs in "Do transformers need three projections? Systematic study of QKV variants"]]></title><description><![CDATA[
<p>I'm terribly sorry, but scaling curves or GTFO. Any random pile of linear algebra works fine-ish at small scales. Very few random piles of linear algebra push the Pareto envelope at large scales.</p>
]]></description><pubDate>Fri, 05 Jun 2026 00:27:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48406522</link><dc:creator>jephs</dc:creator><comments>https://news.ycombinator.com/item?id=48406522</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48406522</guid></item><item><title><![CDATA[New comment by jephs in "Retro-Tech Parenting"]]></title><description><![CDATA[
<p>I've got 5 & 6 year old kids. They have a a VHS player / tiny CRT monitor with a few dozen tapes, a tiny janky mp3 player with all my ripped post-y2k era albums, and lots of books and art supplies.<p>VHS tapes are so cheap. Every thrift store has hundreds for like half a buck each. All your friends have a box in their basement they want to get rid of.</p>
]]></description><pubDate>Thu, 04 Jun 2026 18:43:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48402882</link><dc:creator>jephs</dc:creator><comments>https://news.ycombinator.com/item?id=48402882</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48402882</guid></item></channel></rss>