<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: intothemild</title><link>https://news.ycombinator.com/user?id=intothemild</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 15 May 2026 15:28:18 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=intothemild" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by intothemild in "What's in a GGUF, besides the weights – and what's still missing?"]]></title><description><![CDATA[
<p>Well considering right now MTP support is being developed, there was a conversation in that that seemed to throw around the idea of separating the MTP model out of the main GGUF, like with Mmproj. This was rejected.<p>Which I'm happy for. So given that decision, I don't think it's unreasonable to think that they might be open to including Mmproj files in the GGUF.<p>Only issue I can think of is, which one? BF16, F16? Etc</p>
]]></description><pubDate>Thu, 14 May 2026 23:08:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48142435</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=48142435</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48142435</guid></item><item><title><![CDATA[New comment by intothemild in "Local AI needs to be the norm"]]></title><description><![CDATA[
<p>There's a percentage of people who love to question how the open models were trained.. they are almost always going to try and make some argument about using the closed frontier models for distillation as some form of theft.<p>Just totally forgetting that the frontier models themselves stole an insane amount to get to where they are.<p>It's theft all the way across the board, and when someone tries to make the argument that open models theft is bad, but Altman or Amodei's theft is good.. they are revealing a lot about themselves</p>
]]></description><pubDate>Mon, 11 May 2026 06:06:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48091525</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=48091525</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48091525</guid></item><item><title><![CDATA[New comment by intothemild in "Local AI needs to be the norm"]]></title><description><![CDATA[
<p>That's already happening. Qwen3.6 and Gemma4.<p>Basically small and medium models that are crazy well trained for their sizes.<p>Then we have a lot of specular decoding stuff like MTP and others coming to speed up responses, and finally better quantisation to use less memory.<p>Local LLM is the future, and the larger labs know that the open models will eat their lunch once people realise that the gap is only a few months. If we were good with LLMs a couple months ago, we're good with the open models now.</p>
]]></description><pubDate>Sun, 10 May 2026 20:24:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48087559</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=48087559</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48087559</guid></item><item><title><![CDATA[New comment by intothemild in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>Don't forget to update the gguf you have too. The templates in them were updated recently too</p>
]]></description><pubDate>Wed, 06 May 2026 13:32:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48036049</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=48036049</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48036049</guid></item><item><title><![CDATA[New comment by intothemild in "A report on burnout in open source software communities (2025) [pdf]"]]></title><description><![CDATA[
<p>I like it, only one problem.. the fix it now types also are the same ones that didn't read anything.</p>
]]></description><pubDate>Sat, 02 May 2026 12:34:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47985867</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47985867</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47985867</guid></item><item><title><![CDATA[New comment by intothemild in "Spirit Airlines Is Winding Down All Operations"]]></title><description><![CDATA[
<p>If only they flapped. Maybe they'd still be in the air.</p>
]]></description><pubDate>Sat, 02 May 2026 07:36:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=47984248</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47984248</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47984248</guid></item><item><title><![CDATA[New comment by intothemild in "A report on burnout in open source software communities (2025) [pdf]"]]></title><description><![CDATA[
<p>> We're talking about code that users can modify themselves to solve their own problems. That's it. I don't need to hear about the struggle.<p>That's exactly the kind of attitude that this discusses.<p>You create something that solves your problems, you put it up on GitHub, free, and open... Suddenly it turns out others have the same problems you did, your software solves them.<p>It starts ok. People are nice. But as it gains traction, a certain kind of toxic person becomes more and more common. The "YOU FIX IT NOW! I DONT KNOW" Kind of person.<p>You wake in the morning, look at your email, and it's a stream of being screamed at. That takes a toll.<p>All because you had an idea one time to build something that solved your problem you thought "hey I might just open source this".<p>> That's it. I don't need to hear about the struggle.</p>
]]></description><pubDate>Sat, 02 May 2026 07:28:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=47984198</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47984198</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47984198</guid></item><item><title><![CDATA[New comment by intothemild in "Anthropic's Champion Kit for engineers pushing Claude Code at their company"]]></title><description><![CDATA[
<p>How many pieces of flair is the minimum?</p>
]]></description><pubDate>Wed, 29 Apr 2026 11:07:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=47946682</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47946682</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47946682</guid></item><item><title><![CDATA[New comment by intothemild in "Making RAM at Home [video]"]]></title><description><![CDATA[
<p>I only have raw RAM, pastured RAM is wrong.<p>I get my DRAM needs at the RAM ranch.</p>
]]></description><pubDate>Wed, 22 Apr 2026 05:58:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47859630</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47859630</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47859630</guid></item><item><title><![CDATA[New comment by intothemild in "Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return"]]></title><description><![CDATA[
<p>The thing is, mythos found those with multiple passes, thousands of passes... So using thousands of passes or perhaps the same budgets, yes, cheaper open weight models could potentially (and have) found the same/similar vulnerabilities.<p>Mythos screams of marketing hype, and nothing more. Opus 4.7 isn't really a meaningful upgrade in any sense, other than being more expensive.<p>Once you can see what something like Qwen3.6-35B-A3B can do... with just a FRACTION of the size of the larger models, You'll understand that the future is open weight models you can run yourself.<p>Same goes for companies, bringing inference onsite isn't hard, I'm actively building tooling to orchestrate it.</p>
]]></description><pubDate>Tue, 21 Apr 2026 18:17:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47852409</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47852409</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47852409</guid></item><item><title><![CDATA[New comment by intothemild in "Apple ignores DMA interoperability requests and contradicts own documentation"]]></title><description><![CDATA[
<p>I wonder how much this will change now that Tim Apple, is out and John Apple is in.
(probably none)</p>
]]></description><pubDate>Tue, 21 Apr 2026 13:46:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47848756</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47848756</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47848756</guid></item><item><title><![CDATA[New comment by intothemild in "Palantir Wants to Reinstate the Draft"]]></title><description><![CDATA[
<p>oh what a shame.. I'm sure he can reach out to Elon to ask him how to inflate it</p>
]]></description><pubDate>Mon, 20 Apr 2026 16:45:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47836919</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47836919</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47836919</guid></item><item><title><![CDATA[New comment by intothemild in "Palantir Wants to Reinstate the Draft"]]></title><description><![CDATA[
<p>OK, Lets put Alex Karp in the frontlines.</p>
]]></description><pubDate>Mon, 20 Apr 2026 16:33:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47836701</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47836701</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47836701</guid></item><item><title><![CDATA[New comment by intothemild in "Qwen3.6-35B-A3B: Agentic coding power, now open to all"]]></title><description><![CDATA[
<p>We're already there IMHO.. If you have enough ram, sure.. but the ~32gig people can run models that beat sonnet 4.5</p>
]]></description><pubDate>Thu, 16 Apr 2026 19:28:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=47798296</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47798296</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47798296</guid></item><item><title><![CDATA[New comment by intothemild in "Ask HN: What Are You Working On? (April 2026)"]]></title><description><![CDATA[
<p>Koji, a AI Backend Orchestration/Managment/Proxy layer: <a href="https://github.com/danielcherubini/koji" rel="nofollow">https://github.com/danielcherubini/koji</a><p>I've been running models on my homelab for a bit now, but none of the available options out there was what I wanted. I wanted something that I could command from the CLI, API, or Web, so have an agent go in and do work remotely via SSH or myself via a web interface.<p>I wanted the ability to know if models have been updated, and if backends (llama.cpp, ik_llama.cpp) have been updated, see what those updates are and choose to update. Also wanted the ability to switch betwen versions of those, so if I felt like there was a regression, or performance issue, I could roll back.<p>I've also published plugins for OpenCode and Pi so that model discovery is automatic too.<p>I'm building this mostly for me, as usual.</p>
]]></description><pubDate>Mon, 13 Apr 2026 09:39:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47749764</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47749764</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47749764</guid></item><item><title><![CDATA[New comment by intothemild in "DRAM has a design flaw from 1966. I bypassed it [video]"]]></title><description><![CDATA[
<p>Man. You really don't get it do you.</p>
]]></description><pubDate>Fri, 10 Apr 2026 18:38:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=47721987</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47721987</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47721987</guid></item><item><title><![CDATA[New comment by intothemild in "EFF is leaving X"]]></title><description><![CDATA[
<p>When has that happened?</p>
]]></description><pubDate>Thu, 09 Apr 2026 18:48:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=47707942</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47707942</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47707942</guid></item><item><title><![CDATA[New comment by intothemild in "Tailslayer: Library for reducing tail latency in RAM reads"]]></title><description><![CDATA[
<p>I really hope you one day re-read these comments and understand just how horrible they are. For absolutely no reason.<p>So yeah, you will be 'tone-policed' because you're clearly a very rude person.</p>
]]></description><pubDate>Thu, 09 Apr 2026 15:23:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47704946</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47704946</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47704946</guid></item><item><title><![CDATA[New comment by intothemild in "Claude Managed Agents"]]></title><description><![CDATA[
<p>Yeah this has been my experience too, mixing agents/models from different companies..<p>Having Opus write a spec, then send to Gemini to revise, back to Opus to fix, then to me to read and approve..<p>Send to a local model like Qwen3.5 to build, then off to Opus to review ...<p>This was such an amazing flow, until Anthropic decided to change their minds.</p>
]]></description><pubDate>Wed, 08 Apr 2026 19:56:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47695461</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47695461</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47695461</guid></item><item><title><![CDATA[New comment by intothemild in "Sam Altman may control our future – can he be trusted?"]]></title><description><![CDATA[
<p>I think you might be surprised that more and more Software Engineers are souring on Anthropic (the company) and the decisions the company has made recently. Not the whole drama with the US Government, but them locking down the usage of plans to their own tooling.<p>That really rubbed a lot of people the wrong way, as ultimately one might have a favorite tool, then suddenly they are forced to use another tool.</p>
]]></description><pubDate>Tue, 07 Apr 2026 14:11:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47675695</link><dc:creator>intothemild</dc:creator><comments>https://news.ycombinator.com/item?id=47675695</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47675695</guid></item></channel></rss>