<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: gdiamos</title><link>https://news.ycombinator.com/user?id=gdiamos</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 27 Apr 2026 11:39:15 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=gdiamos" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by gdiamos in "Forcing Flash Attention onto a TPU and Learning the Hard Way"]]></title><description><![CDATA[
<p>I personally don't mind letting Claude write about work.<p>You could spend 80% doing the work and 20% writing about it, or 99% doing the work and 1% copy-pasting Claude's writeup about it into a blog.<p>There is nothing wrong with writing if you are into it, and yes you can probably do better than Claude, but I can related to engineers who just want to build.</p>
]]></description><pubDate>Thu, 12 Mar 2026 23:11:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47358536</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47358536</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47358536</guid></item><item><title><![CDATA[New comment by gdiamos in "Forcing Flash Attention onto a TPU and Learning the Hard Way"]]></title><description><![CDATA[
<p>One of my lessons in using different accelerators, whether they be different NVIDIA versions, or GPU->TPU, etc is that someone needs to do this work of indexing, partitioning, mapping, scheduling, and benchmarking. That work is labor intensive.<p>In this case, google has already done it, and that will be true for high resourced accelerator companies like Google working with the most popular operations like attention.<p>As long as you use those operations, you are okay. But if you do something different, you need to be prepared to do all of this yourself.</p>
]]></description><pubDate>Thu, 12 Mar 2026 21:22:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=47357311</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47357311</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47357311</guid></item><item><title><![CDATA[New comment by gdiamos in "Something is afoot in the land of Qwen"]]></title><description><![CDATA[
<p>Results as good as Qwen has been posting would seem to trigger a power struggle.<p>I think companies that don’t navigate these correctly eventually lose.</p>
]]></description><pubDate>Thu, 05 Mar 2026 09:24:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=47259517</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47259517</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47259517</guid></item><item><title><![CDATA[New comment by gdiamos in "Claude becomes number one app on the U.S. App Store"]]></title><description><![CDATA[
<p>It was inevitable.</p>
]]></description><pubDate>Sun, 01 Mar 2026 00:15:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47202099</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47202099</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47202099</guid></item><item><title><![CDATA[New comment by gdiamos in "Statement from Dario Amodei on our discussions with the Department of War"]]></title><description><![CDATA[
<p>This is why I like Dario as a CEO - he has a system of ethics that is not jus about who writes the largest check.<p>You may not agree with it, but I appreciate that it exists.</p>
]]></description><pubDate>Fri, 27 Feb 2026 02:13:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47175483</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47175483</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47175483</guid></item><item><title><![CDATA[New comment by gdiamos in "Fast KV Compaction via Attention Matching"]]></title><description><![CDATA[
<p>I know the frontier “labs” are holding back publications.<p>I don’t think it will last among researchers who think beyond production LLMs</p>
]]></description><pubDate>Fri, 20 Feb 2026 17:55:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=47091377</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47091377</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47091377</guid></item><item><title><![CDATA[New comment by gdiamos in "15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern"]]></title><description><![CDATA[
<p>I remember it being less profitable than graphics for a long time.<p>It did make money that would be interesting to a startup, but not to a public company.</p>
]]></description><pubDate>Fri, 20 Feb 2026 04:12:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47083625</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47083625</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47083625</guid></item><item><title><![CDATA[New comment by gdiamos in "15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern"]]></title><description><![CDATA[
<p>I remember it differently. CUDA was built with the intention of finding/enabling something like deep learning. I thought it was unrealistic too and took it on faith in people more experienced than me, until I saw deep learning work.<p>Some of the near misses I remember included bitcoin. Many of the other attempts didn't ever see the light of day.<p>Luck in english often means success by chance rather than one's own efforts or abilities. I don't think that characterizes CUDA. I think it was eventual success in the face of extreme difficulty, many failures, and sacrifices. In hindsight, I'm still surprised that Jensen kept funding it as long as he did. I've never met a leader since who I think would have done that.</p>
]]></description><pubDate>Thu, 19 Feb 2026 07:07:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47070822</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47070822</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47070822</guid></item><item><title><![CDATA[New comment by gdiamos in "15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern"]]></title><description><![CDATA[
<p>Most people don't appreciate how many dead end applications NVIDIA explored before finding deep learning. It took a very long time, and it wasn't luck.</p>
]]></description><pubDate>Thu, 19 Feb 2026 04:11:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=47069830</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47069830</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47069830</guid></item><item><title><![CDATA[New comment by gdiamos in "15 years of FP64 segmentation, and why the Blackwell Ultra breaks the pattern"]]></title><description><![CDATA[
<p>I'm not sure why the article dismisses cost.<p>Let's say X=10% of the GPU area (~75mm^2) is dedicated to FP32 SIMD units. Assume FP64 units are ~2-4x bigger. That would be 150-300mm^2, a huge amount of area that would increase the price per GPU. You may not agree with these assumptions. Feel free to change them. It is an overhead that is replicated per core. Why would gamers want to pay for any features they don't use?<p>Not to say there isn't market segmentation going on, but FP64 cost is higher for massively parallel processors than it was in the days of high frequency single core CPUs.</p>
]]></description><pubDate>Thu, 19 Feb 2026 04:09:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47069822</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=47069822</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47069822</guid></item><item><title><![CDATA[New comment by gdiamos in "Vouch"]]></title><description><![CDATA[
<p>I feel like a lot of software engineering problems come out of people who refuse to talk to each other than through comments in VCS.<p>It makes sense if you are collaborating over IRC, but I feel the need to face palm when people sitting next to each other do it.<p>What is your preferred way to talk to your team?<p>No English, only code<p>Slack<p>Zoom<p>In a meeting room<p>Over lunch<p>On a walk<p>One thing I’ve learned over time is that the highest bandwidth way of talking is face to face because you can read body language in addition to words. Video chat is okay, but an artificial and often overly formal setting. Phone is faster than text. Text drops the audio/visual/emotional signal completely. Code is precise but requires reverse engineering intent.<p>I personally like a walk, and then pair programming a shared screen.</p>
]]></description><pubDate>Mon, 09 Feb 2026 01:38:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=46940577</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=46940577</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46940577</guid></item><item><title><![CDATA[New comment by gdiamos in "Clawdbot - open source personal AI assistant"]]></title><description><![CDATA[
<p>It sounds like lack of security is the biggest feature and risk of this clawd thing.<p>I also tried using Siri to tell me the weather forcast while I was driving to the park. It asked me to auth into my phone. Then it asked me to approve location access. I guess it was secure but I never figured out what the weather forecast was.<p>Thankfully it didn't rain on my picnic. Some of the parents there asked me if their investors should be interested in clawd.</p>
]]></description><pubDate>Mon, 26 Jan 2026 04:37:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=46761924</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=46761924</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46761924</guid></item><item><title><![CDATA[New comment by gdiamos in "Skip is now free and open source"]]></title><description><![CDATA[
<p>Here’s another perspective. Developers aren’t budget approvers in engineering organizations by choice.<p>If you are a budget approver then your inbox and calendar are full of sales teams.<p>I find that experience is too distracting to concentrate on writing good code.</p>
]]></description><pubDate>Thu, 22 Jan 2026 11:22:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=46717793</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=46717793</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46717793</guid></item><item><title><![CDATA[New comment by gdiamos in "On the slow death of scaling"]]></title><description><![CDATA[
<p>It was an interesting read Sara, thanks for sharing it.<p>I especially agree with your point that scaling laws really killed open research. That's a shame and I personally think we could benefit from more research.<p>I originally didn't like calling them scaling laws.<p>In addition to the law part seeming a bit much, I've found that researchers often overemphasize the scale part. If scaling is predictable, then you don't need to do most experiments at very large scale. However, that doesn't seem to stop researchers from starting there.<p>Once you find something good, and you understand how it scales, then you can pour system resources into it. So I originally thought it would encourage research. I find it sad that it seems to have had the opposite effect.</p>
]]></description><pubDate>Wed, 07 Jan 2026 07:50:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=46523714</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=46523714</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46523714</guid></item><item><title><![CDATA[New comment by gdiamos in "John Giannandrea to retire from Apple"]]></title><description><![CDATA[
<p>The iphone is so useful they can probably ride it for a couple more decades at least. I would still buy it.<p>From a technology or engineering perspective, I have no idea how to work with Apple.</p>
]]></description><pubDate>Tue, 02 Dec 2025 01:32:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=46116194</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=46116194</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46116194</guid></item><item><title><![CDATA[New comment by gdiamos in "Ilya Sutskever, Yann LeCun and the End of “Just Add GPUs”"]]></title><description><![CDATA[
<p>A Dyson sphere brain ?</p>
]]></description><pubDate>Thu, 27 Nov 2025 05:13:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=46065908</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=46065908</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46065908</guid></item><item><title><![CDATA[New comment by gdiamos in "Ilya Sutskever, Yann LeCun and the End of “Just Add GPUs”"]]></title><description><![CDATA[
<p>I personally don’t think the scaling hypothesis is wrong, but it is running up against real limits<p>What high quality data sources are not already tapped?<p>Where does the next 1000x flops come from?</p>
]]></description><pubDate>Thu, 27 Nov 2025 01:09:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=46064304</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=46064304</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46064304</guid></item><item><title><![CDATA[New comment by gdiamos in "TiDAR: Think in Diffusion, Talk in Autoregression"]]></title><description><![CDATA[
<p>Diffusion is favored by current GPUs .<p>Over time we seem to have a tendency to build models that are well matched to our machines</p>
]]></description><pubDate>Sat, 22 Nov 2025 16:53:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=46016111</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=46016111</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46016111</guid></item><item><title><![CDATA[New comment by gdiamos in "Jeff Bezos creates A.I. startup where he will be co-chief executive"]]></title><description><![CDATA[
<p>How do they break ties?</p>
]]></description><pubDate>Tue, 18 Nov 2025 04:05:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=45961260</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=45961260</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45961260</guid></item><item><title><![CDATA[New comment by gdiamos in "Show HN: Tiny Diffusion – A character-level text diffusion model from scratch"]]></title><description><![CDATA[
<p>Did I miss something? <a href="https://github.com/NVlabs/Fast-dLLM/blob/main/llada/chat.py" rel="nofollow">https://github.com/NVlabs/Fast-dLLM/blob/main/llada/chat.py</a><p>That’s inference code, but where is the high perf web server?</p>
]]></description><pubDate>Sat, 15 Nov 2025 07:37:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=45935723</link><dc:creator>gdiamos</dc:creator><comments>https://news.ycombinator.com/item?id=45935723</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45935723</guid></item></channel></rss>