<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: androiddrew</title><link>https://news.ycombinator.com/user?id=androiddrew</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 14 Apr 2026 16:46:49 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=androiddrew" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by androiddrew in "Taking on CUDA with ROCm: 'One Step After Another'"]]></title><description><![CDATA[
<p>I’ll try and get in touch with them. Thank you.</p>
]]></description><pubDate>Tue, 14 Apr 2026 10:41:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47763833</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47763833</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47763833</guid></item><item><title><![CDATA[New comment by androiddrew in "GAIA – Open-source framework for building AI agents that run on local hardware"]]></title><description><![CDATA[
<p>You got it right I think. I’m sitting with two “AI Ready Radeon AI Pro 9700 workstation cards, which are RDNA4 not CDNA. My experience is that my cards are not a priority. Individual engineers at AMD may care, the company  doesn’t. I have been trying since February to get ahold of anyone responsible for shipping tuned Tensile gfx1201 kernels in rocm-libs, which is used by Ollama.its been three weeks since I raised enough hell on the discord to get a response, but they still can’t find “who” is responsible for Tensile tuning, and “if” they are even going to do it for the gfx12* cards.<p>Don’t get me started with vLLM and AITER.</p>
]]></description><pubDate>Tue, 14 Apr 2026 00:49:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47759866</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47759866</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47759866</guid></item><item><title><![CDATA[New comment by androiddrew in "GAIA – Open-source framework for building AI agents that run on local hardware"]]></title><description><![CDATA[
<p>Yup, meanwhile Jensen is on the Lexfriedman podcast stating the reason why CUDA is successful is because all thier devices run it. The on ramp is at the individual user.<p>I have and RDNA4 card and they certainly are prioritizing CDNA over a CDNA + RDNA strategy or a unification strategy.</p>
]]></description><pubDate>Tue, 14 Apr 2026 00:00:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47759539</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47759539</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47759539</guid></item><item><title><![CDATA[New comment by androiddrew in "Taking on CUDA with ROCm: 'One Step After Another'"]]></title><description><![CDATA[
<p>I have been trying since February to get someone at AMD to shipped tuned Tensile kernels in the rcom-libs for the gfx1201. They are used by Ollama but no one on the Developer Discord knows who is responsible for that. It has been pretty frustrating and it shows that AMD has an organizational problem to overcome in addition to all the things technically that they want rocm to do.</p>
]]></description><pubDate>Mon, 13 Apr 2026 12:10:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=47750862</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47750862</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47750862</guid></item><item><title><![CDATA[New comment by androiddrew in "Will I ever own a zettaflop?"]]></title><description><![CDATA[
<p>Not with the price of silicon being what it is</p>
]]></description><pubDate>Fri, 10 Apr 2026 00:06:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47711931</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47711931</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47711931</guid></item><item><title><![CDATA[New comment by androiddrew in "Things I Think I Think... Preferring Local OSS LLMs"]]></title><description><![CDATA[
<p>I love local first. I am finding that a 120B MoE is hitting the sweet spot for local hosted. Right now that takes a 2K strix halo, a 4k GB10 machine, or a 5k Mac Pro. 2 years from now I think hardware will take us back to the 2k ish range with good performance.<p>I love my dual GPU setup (2AMD Radeon r9700 64GB vram) but it costs 5x electricity than my GX10 (GB10 chip inside) and since layers are landing in system memory my TPS is half the GX10.<p>Now a dense model like Devstral2 24B slaps on the Dual GPU setup. I just haven’t gotten as much out of that as I have the 120 MoEs</p>
]]></description><pubDate>Thu, 02 Apr 2026 11:54:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47613193</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47613193</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47613193</guid></item><item><title><![CDATA[New comment by androiddrew in "Ollama is now powered by MLX on Apple Silicon in preview"]]></title><description><![CDATA[
<p>Get turboquant 4 bit implemented and this would be game changer.</p>
]]></description><pubDate>Tue, 31 Mar 2026 11:14:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47585661</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47585661</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47585661</guid></item><item><title><![CDATA[New comment by androiddrew in "The Little Book of C"]]></title><description><![CDATA[
<p>Wish they had this for zig</p>
]]></description><pubDate>Thu, 26 Mar 2026 22:33:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47536680</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47536680</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47536680</guid></item><item><title><![CDATA[New comment by androiddrew in "Jury finds Meta liable in case over child sexual exploitation on its platforms"]]></title><description><![CDATA[
<p>Alternative headline: household spyware cash machine forced to pay $20 for being bad.<p>If you want to punish Meta then you have to punish the wonder boy who runs it. Not even share holders can fight off the guy spending 80B on the metaverse.</p>
]]></description><pubDate>Wed, 25 Mar 2026 10:55:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=47515684</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47515684</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47515684</guid></item><item><title><![CDATA[New comment by androiddrew in "Microsoft weighs legal action over $50B Amazon-OpenAI cloud deal"]]></title><description><![CDATA[
<p>The poster probably is hoping someone will post the archived version in the comments</p>
]]></description><pubDate>Wed, 25 Mar 2026 00:10:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47511430</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47511430</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47511430</guid></item><item><title><![CDATA[New comment by androiddrew in "So where are all the AI apps?"]]></title><description><![CDATA[
<p>I heard a delightful term for building apps only for yourself “houseplant programming”.</p>
]]></description><pubDate>Wed, 25 Mar 2026 00:01:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47511355</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47511355</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47511355</guid></item><item><title><![CDATA[New comment by androiddrew in "Tinybox – A powerful computer for deep learning"]]></title><description><![CDATA[
<p>Could you share what you are using for inference and how you are running it? I have a 64G VRAM/128G system RAM setup.</p>
]]></description><pubDate>Sun, 22 Mar 2026 13:26:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47477305</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47477305</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47477305</guid></item><item><title><![CDATA[New comment by androiddrew in "Ozempic Is About to Go Generic for Billions of People"]]></title><description><![CDATA[
<p>Good, because it’s fucking ridiculous that pharma gets special patent loop holes to maintain a monopoly beyond what the basic protection grants.</p>
]]></description><pubDate>Sat, 21 Mar 2026 13:58:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47467092</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47467092</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47467092</guid></item><item><title><![CDATA[New comment by androiddrew in "Ubuntu 26.04 Ends 46 Years of Silent sudo Passwords"]]></title><description><![CDATA[
<p>I don’t know why this keeps coming up. Has this been a big deal for everyone else? Like ok usability improvement, but the number of times I have read an article about this is silly.</p>
]]></description><pubDate>Sat, 21 Mar 2026 13:56:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47467073</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47467073</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47467073</guid></item><item><title><![CDATA[New comment by androiddrew in "Thoughts on OpenAI acquiring Astral and uv/ruff/ty"]]></title><description><![CDATA[
<p>Yes, when the poetry people purposely added a feature to fail CI with a 1/10 chance because they wanted to depreciate a feature, I depreciated poetry.</p>
]]></description><pubDate>Fri, 20 Mar 2026 14:43:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=47455303</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47455303</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47455303</guid></item><item><title><![CDATA[New comment by androiddrew in "Leanstral: Open-source agent for trustworthy coding and formal proof engineering"]]></title><description><![CDATA[
<p>MOE but 120B range. Man I wish it was an 80B. I have 2 GPUs with 62Gib of usable VRAM. A 4bit 80B gives me some context window, but 120B puts me into system RAM</p>
]]></description><pubDate>Mon, 16 Mar 2026 23:47:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47406652</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47406652</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47406652</guid></item><item><title><![CDATA[New comment by androiddrew in "How "Hardwired" AI Will Destroy Nvidia's Empire and Change the World"]]></title><description><![CDATA[
<p>Yeah, well might just come on your new laptop</p>
]]></description><pubDate>Sat, 14 Mar 2026 22:24:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47381928</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47381928</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47381928</guid></item><item><title><![CDATA[New comment by androiddrew in "How "Hardwired" AI Will Destroy Nvidia's Empire and Change the World"]]></title><description><![CDATA[
<p>Give me a 120B dense model on one of these and yeah my API use will probably drop.</p>
]]></description><pubDate>Sat, 14 Mar 2026 22:21:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47381893</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47381893</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47381893</guid></item><item><title><![CDATA[New comment by androiddrew in "New KV cache compaction technique cuts LLM memory 50x without accuracy loss"]]></title><description><![CDATA[
<p>I hope this is real.</p>
]]></description><pubDate>Sun, 08 Mar 2026 01:03:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47293227</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47293227</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47293227</guid></item><item><title><![CDATA[New comment by androiddrew in "Why developers using AI are working longer hours"]]></title><description><![CDATA[
<p>I have never been in a flow state with an agent running. I use agents, but that isn’t flow.</p>
]]></description><pubDate>Sun, 08 Mar 2026 00:37:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47293080</link><dc:creator>androiddrew</dc:creator><comments>https://news.ycombinator.com/item?id=47293080</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47293080</guid></item></channel></rss>