<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: LatencyKills</title><link>https://news.ycombinator.com/user?id=LatencyKills</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 16 Jun 2026 08:13:53 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=LatencyKills" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by LatencyKills in "I Am Not a Reverse Centaur"]]></title><description><![CDATA[
<p>> Who gets a sense of accoplishment from prompting an LLM?<p>I have a good friend who is a VP at a telecom company who has never written a line of code. He's been using Claude to create interactive web pages to help him understand parts of the company.<p>He was <i>so</i> excited when he got something to work he called me immediately.<p>I'm sure the code isn't what you or I would write, but it is good enough for my friend. That said, heaven help him if he loses access to Claude. ;-)</p>
]]></description><pubDate>Fri, 12 Jun 2026 19:27:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48508427</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48508427</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48508427</guid></item><item><title><![CDATA[New comment by LatencyKills in "How LLMs work"]]></title><description><![CDATA[
<p>I went this route because I had difficulty visualizing the content of the Attention Is All You Need paper. After going through both books, I can now understand every part of that paper.<p>I'm currently working on a robotics project that uses Nvidia's GR00T N1 model, and I was able to understand the research paper. [0]<p>[0]: <a href="https://arxiv.org/abs/2503.14734" rel="nofollow">https://arxiv.org/abs/2503.14734</a></p>
]]></description><pubDate>Sun, 07 Jun 2026 10:40:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48433541</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48433541</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48433541</guid></item><item><title><![CDATA[New comment by LatencyKills in "How LLMs work"]]></title><description><![CDATA[
<p>Well, as I suggested, working through the implementation yourself will give you that intuition. That said, I think the simplest way to explain why positional encodings are useful is that it gives the transformer just enough information to make attention meaningful without negatively impacting any parallel, content-based comparisons.<p>A vanilla self-attention layer is just a set of token vectors. Without positional info, swapping two identical embeddings changes very little about what attention can compute. We can "fix" this problem by using positional encodings. Text that has meaning isn't just a set of characters; the location and <i>order</i> of those characters is what provides meaning.</p>
]]></description><pubDate>Sat, 06 Jun 2026 13:32:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=48424947</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48424947</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48424947</guid></item><item><title><![CDATA[New comment by LatencyKills in "How LLMs work"]]></title><description><![CDATA[
<p>I have a BS in CS (and have been in the field for 25 years). I couldn't understand the transformer architecture until I built a few myself. Here are the books I worked through. I now feel I have a very good understanding of modern LLMs.<p><a href="https://www.amazon.com/Build-Large-Language-Model-Scratch/dp/B0DNR6TH6X" rel="nofollow">https://www.amazon.com/Build-Large-Language-Model-Scratch/dp...</a><p><a href="https://www.amazon.com/Build-DeepSeek-Scratch-Abhijit-Dandekar-ebook/dp/B0GJ75VLPS" rel="nofollow">https://www.amazon.com/Build-DeepSeek-Scratch-Abhijit-Dandek...</a></p>
]]></description><pubDate>Sat, 06 Jun 2026 12:58:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48424653</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48424653</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48424653</guid></item><item><title><![CDATA[New comment by LatencyKills in "How LLMs work"]]></title><description><![CDATA[
<p>Not OP but I worked through Sebastian Raschka's "Build a Large Language Model (From Scratch)" [0] and Raj Abhijit Dandekar's "Build a DeepSeek Model (From Scratch)" [1] books.<p>I don't think there is anything in a transformer I couldn't explain in the smallest detail now.<p>[0]: <a href="https://www.amazon.com/Build-Large-Language-Model-Scratch/dp/B0DNR6TH6X" rel="nofollow">https://www.amazon.com/Build-Large-Language-Model-Scratch/dp...</a><p>[1]: <a href="https://www.amazon.com/Build-DeepSeek-Scratch-Abhijit-Dandekar-ebook/dp/B0GJ75VLPS" rel="nofollow">https://www.amazon.com/Build-DeepSeek-Scratch-Abhijit-Dandek...</a></p>
]]></description><pubDate>Sat, 06 Jun 2026 12:54:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48424626</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48424626</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48424626</guid></item><item><title><![CDATA[New comment by LatencyKills in "Nvidia announces new AI chip for personal computers"]]></title><description><![CDATA[
<p>I was an engineer at both MS and Apple, and wholeheartedly agree with you.<p>My question is, what happens to the people who use RTX cards for gaming? This new solution isn't meant for that. Do they need an "AI accelerator" <i>and</i> a gaming-centric GPU?</p>
]]></description><pubDate>Mon, 01 Jun 2026 12:31:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48355965</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48355965</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48355965</guid></item><item><title><![CDATA[New comment by LatencyKills in "Nvidia RTX Spark"]]></title><description><![CDATA[
<p>First:<p>> "Our goal is to deliver unmetered intelligence to every home and every desk with Windows," said Satya Nadella, chairman and head of Microsoft.<p>Then:<p>> However, Ian Fogg, Research Director at industry analyst firm FDM CCS Insight said the change was "likely to come with a significant price tag" and Nvidia would be targeting "those looking for workstation-class performance".<p>So... <i>not</i> every desk with Windows.</p>
]]></description><pubDate>Mon, 01 Jun 2026 12:03:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48355711</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48355711</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48355711</guid></item><item><title><![CDATA[New comment by LatencyKills in "Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark"]]></title><description><![CDATA[
<p>As someone who's been building developer tools (Visual Studio and Xcode) for 25 years, I don't have a perspective problem. We were doing "code completion" back in the 90s and could never have predicted that an LLM would write code at the current level of quality.<p>My point is that with every new model release, the expectations grow. I don't know how else to say that.</p>
]]></description><pubDate>Fri, 22 May 2026 12:10:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48234763</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48234763</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48234763</guid></item><item><title><![CDATA[New comment by LatencyKills in "The memory shortage is causing a repricing of consumer electronics"]]></title><description><![CDATA[
<p>He didn't even know that there <i>was</i> a solution to the performance issues. He simply assumed that processing data took that long.<p>I think it is great that he now has this capability, but a total ignorance of software engineering is going to continually bite this type of user. Instead of questioning Claude's solution, my friend thought he just needed a faster computer.<p>He was also using very sketchy Python imports when <i>much</i> safer, more mature options are available. Not knowing that you shouldn't use just any random Python package is a ticking time bomb... especially when his machine is connected directly to his corporate intranet.</p>
]]></description><pubDate>Fri, 22 May 2026 11:27:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48234406</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48234406</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48234406</guid></item><item><title><![CDATA[New comment by LatencyKills in "Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark"]]></title><description><![CDATA[
<p>Things mature, and expectations grow appropriately. That is true of more than just LLM performance.</p>
]]></description><pubDate>Fri, 22 May 2026 11:19:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48234347</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48234347</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48234347</guid></item><item><title><![CDATA[New comment by LatencyKills in "Steve Wozniak cheered after telling students they have AI – actual intelligence"]]></title><description><![CDATA[
<p>I was fortunate to get to spend time with woz when I worked at Apple. He's the type of person who is practically silent during a meeting. Then, towards the end, he spoke up and would literally solve the problem we'd been struggling with the entire time.<p>He's one of the nicest, most down-to-earth people I've ever worked with.</p>
]]></description><pubDate>Fri, 22 May 2026 11:10:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48234276</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48234276</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48234276</guid></item><item><title><![CDATA[New comment by LatencyKills in "The memory shortage is causing a repricing of consumer electronics"]]></title><description><![CDATA[
<p>I have a friend who is VP at a major telecom company. He has no technical experience but has been using Claude to create data analysis apps. He was complaining that it took three hours to process certain datasets, so I took a look.<p>He had Claude essentially create a 300MB json file and was doing all of the data processing on that data directly.<p>It never occurred to him, or Claude, that there were other ways to operate on that data. It took me less than 10 minutes to get that processing time down to under a minute.<p>These are the type of issues that worry me about vibe coding.</p>
]]></description><pubDate>Fri, 22 May 2026 11:05:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48234241</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48234241</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48234241</guid></item><item><title><![CDATA[New comment by LatencyKills in "Waymo pauses Atlanta service as its robotaxis keep driving into floods"]]></title><description><![CDATA[
<p>Deep water can still damage an EV by getting into connectors, sensors, wheel bearings, brakes, and cabin electronics.<p>They can also float just like a regular car.</p>
]]></description><pubDate>Thu, 21 May 2026 18:08:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=48226762</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48226762</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48226762</guid></item><item><title><![CDATA[New comment by LatencyKills in "AI is just unauthorised plagiarism at a bigger scale"]]></title><description><![CDATA[
<p>> Which statement of mine do you think is not settled law?<p>I object to your statement that "copyright doesn’t get involved at all" when that is objectively untrue. If that <i>was</i> true, many of the world's largest companies wouldn't be spending tens of millions of dollars to have that question answered in court. Go to any law-focused forum, and you will find attorneys arguing over these questions.<p>To train a model using a book, you must first obtain a copy of that book. Did OpenAI purchase a copy of every book not already in the public domain used during training? They did not.<p>Some of the suits I mentioned claim that OpenAI literally stole copies of books to train its models.<p>My point is that the copyright question has <i>not</i> been answered. If the NYT, et. al. win, it will be a watershed moment for how AI companies pay for training data moving forward.</p>
]]></description><pubDate>Thu, 21 May 2026 15:34:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48224519</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48224519</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48224519</guid></item><item><title><![CDATA[New comment by LatencyKills in "AI is just unauthorised plagiarism at a bigger scale"]]></title><description><![CDATA[
<p>To do that training, you must <i>first</i> obtain the item with the content you require. Did OpenAI purchase a copy of every book they trained their models on?<p>Answer: They did not. That is literally why there are dozens of ongoing lawsuits in progress.</p>
]]></description><pubDate>Thu, 21 May 2026 15:28:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=48224407</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48224407</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48224407</guid></item><item><title><![CDATA[New comment by LatencyKills in "AI is just unauthorised plagiarism at a bigger scale"]]></title><description><![CDATA[
<p>I appreciate your comment, but you answered as if this question had been answered legally. It has not.<p>The New York Times is suing both OpenAI and Microsoft for copyright infringement. The Authors Guild is suing OpenAI. Getty Images is suing Stability AI. Disney is suing Midjourney. Universal Music Group and Sony have filed suits against multiple AI companies.<p>> so copyright doesn’t get involved at all.<p>The dozens of ongoing cases that discredit that statement.</p>
]]></description><pubDate>Thu, 21 May 2026 15:17:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48224238</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48224238</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48224238</guid></item><item><title><![CDATA[New comment by LatencyKills in "AI is just unauthorised plagiarism at a bigger scale"]]></title><description><![CDATA[
<p>> I'm trained on protected works.<p>That someone, at some point, paid for.<p>I'd like to understand why I can't use a song in one of my videos without permission/payment, but an AI company can train models using that song without having either.<p>I'm not anti-AI. I'd just like to see companies play by the rules everyone else has to follow.</p>
]]></description><pubDate>Thu, 21 May 2026 14:41:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48223498</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48223498</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48223498</guid></item><item><title><![CDATA[New comment by LatencyKills in "AI is just unauthorised plagiarism at a bigger scale"]]></title><description><![CDATA[
<p>Having an original thought is in no way related to breaking copyright laws.<p>I don't think we should "get over" the fact that modern SOTA models couldn't exist without being trained on protected works.</p>
]]></description><pubDate>Thu, 21 May 2026 14:33:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48223328</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48223328</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48223328</guid></item><item><title><![CDATA[New comment by LatencyKills in "I’ve joined Anthropic"]]></title><description><![CDATA[
<p>I understand where you are coming from, but at least when I was there, we were still trying to develop solutions that had never been implemented at that scale before (just like Anthropic today). I helped create the first version of Visual Studio (Boston). People tend to forget that even by the 90s we still didn't really understand how to solve a lot of the main technical problems. That's what I loved about the work. Everything seems easy/obvious after the fact.<p>When I left MS, a full Windows build was about 18M LOC. The fact that 18 million lines of code, written by tens of thousands of engineers, worked at all was a mini miracle.<p>With regard to compensation: like Karpathy, I had already earned enough to be comfortable for the rest of my life. Once money stopped being the primary driver, I was able to focus on what made me happy. Building things, even if you don't like them, brought me happiness and fulfillment. I hope Andrej finds the same at Anthropic.</p>
]]></description><pubDate>Tue, 19 May 2026 18:13:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48197047</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48197047</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48197047</guid></item><item><title><![CDATA[New comment by LatencyKills in "I’ve joined Anthropic"]]></title><description><![CDATA[
<p>I worked for MS and Apple for 20 years and heard that opinion constantly; i.e., "People only work there for the money."<p>I have no idea if Andrej "sold out" but perhaps he realizes that if he wants to work on the cutting edge alongside talented people, with a seemingly endless budget, Anthropic is a good choice.<p>I chose my employers for the same reason; the compensation was secondary.</p>
]]></description><pubDate>Tue, 19 May 2026 15:55:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48195038</link><dc:creator>LatencyKills</dc:creator><comments>https://news.ycombinator.com/item?id=48195038</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48195038</guid></item></channel></rss>