<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: refulgentis</title><link>https://news.ycombinator.com/user?id=refulgentis</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 02:54:58 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=refulgentis" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by refulgentis in "Can I Buy Your KV Cache?"]]></title><description><![CDATA[
<p>This paper doesn't make any sense - for background, I've maintained an AI client that's cross-platform, cross-provider, and integrates llama.cpp since 2022. I don't know why they think "agents" don't share prefill work - paid providers cache on <i>the prefill text</i>, llama.cpp, same, and I specifically hooked up llama.cpp so it can do subsets as well. i.e. all the agents would reuse the cache<p>It reads like it started from an underspecification of "agents" x a strain of pop-wisdom about "KV cache" that I've seen enter mainstream discourse over the past 3 months that is Not Even Wrong, then, they solved a non-existent problem.<p>EDIT: based on the rest of comments either requesting a primer on terms, or, pointing out it makes errors in even more obvious ways, flagging.</p>
]]></description><pubDate>Fri, 12 Jun 2026 22:27:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48510150</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48510150</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48510150</guid></item><item><title><![CDATA[New comment by refulgentis in "Adaptive PDFs"]]></title><description><![CDATA[
<p>Thanks Claude</p>
]]></description><pubDate>Fri, 12 Jun 2026 19:35:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48508532</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48508532</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48508532</guid></item><item><title><![CDATA[New comment by refulgentis in "Claude Fable 5"]]></title><description><![CDATA[
<p>It's silly and a joke and a surprisingly good benchmark and don't take it seriously but don't take not taking it seriously seriously and if it's too good we use another prompt but don't actually because then it's not the pelican post and there's obvious ways to better it and it's not worth doing because it's not serious.<p>Only coherent move at this point: hit the minus button immediately. There's never anything about the model in the thread other than simon's post.</p>
]]></description><pubDate>Tue, 09 Jun 2026 18:19:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465220</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48465220</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465220</guid></item><item><title><![CDATA[New comment by refulgentis in "How much of Thermo Fisher's antibody data has been manipulated?"]]></title><description><![CDATA[
<p>Nice! Just because you’re the top comment, I feel it is helpful to quick readers to point out that it has 0 to do with what happened here. :)</p>
]]></description><pubDate>Mon, 08 Jun 2026 17:00:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48447921</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48447921</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48447921</guid></item><item><title><![CDATA[New comment by refulgentis in "Ask HN: What was your "oh shit" moment with GenAI?"]]></title><description><![CDATA[
<p>Using GPT-3 to translate the color science code I wrote for Google's design system from Dart to ~any language so I could get it deployed cross platform quickly, and it all worked.</p>
]]></description><pubDate>Fri, 05 Jun 2026 19:30:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48417074</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48417074</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48417074</guid></item><item><title><![CDATA[New comment by refulgentis in "Gemma 4 QAT models: Optimizing compression for mobile and laptop efficiency"]]></title><description><![CDATA[
<p>I understand it. :)<p>And you're absolutely right to point out they aren't products - I hoped that was clear - when you're building a product <i>with them</i>, you end up having to do the same build loop 4 times, in this instance :)</p>
]]></description><pubDate>Fri, 05 Jun 2026 18:10:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48416163</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48416163</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48416163</guid></item><item><title><![CDATA[New comment by refulgentis in "Gemma 4 QAT models: Optimizing compression for mobile and laptop efficiency"]]></title><description><![CDATA[
<p>Ah, nice, ty! My excuse is those repos were added to the collection after my comment, but perhaps not :3</p>
]]></description><pubDate>Fri, 05 Jun 2026 17:34:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48415725</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48415725</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48415725</guid></item><item><title><![CDATA[New comment by refulgentis in "Gemma 4 QAT models: Optimizing compression for mobile and laptop efficiency"]]></title><description><![CDATA[
<p>It's super annoying when you have products that utilize these because there's...4? releases in 3 weeks?<p>- Gemma 4 2B/4B/27BE3B/31B<p>- Gemma 4 2B/4B/27BE3B/31B x "assistant" / MTP drafter models (i.e. multitoken prediction)<p>- Gemma 4 12B (2 days ago? 1?)<p>- Gemma 4 QAT 2B/4B/12B/27BE3B/31B x "assistant" models (i.e. multitoken prediction)<p>It probably sounds <i>silly</i> and really whiny in the abstract. It just causes a ton of work / confusion downstream that <i>feels</i> unnecessary.<p>Extremely glad for the output, not glad to have to chase it.<p>ex. llama.cpp currently supports the originals but not the MTP predictors but there is a patch for the MTP predictors but not for the small MoE models and I think it supports the 12B but maybe not media for it yet and now we have these too and the blog says there's GGUFs (llama.cpp models) but there isn't in any of the 12? repos I clicked through. and ~every consumer-facing local LLM app is built on llama.cpp or a fork of it.<p>Also if anyone at Google is taking feedback over to b/ or product, pleaseeee stop the "E"2B "E"4B thing, unless it's actually taking up less RAM on Android during CPU inference. I can't tell if I need to treat the 4B like an 8B (i.e. beyond most consumer hardware without a GPU) or a 4B (i.e. will run on most consumer hardware since 2021)<p>EDIT: And, yes, the QAT 12B x mmproj does not work with llama.cpp. I'm glad there's people who have the luxury of not having to, well, actually use these and treat me as whining :) I'll need to schedule another 4-8 hours of work for the 4th time, no fun!</p>
]]></description><pubDate>Fri, 05 Jun 2026 17:27:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48415627</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48415627</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48415627</guid></item><item><title><![CDATA[New comment by refulgentis in "Gemma 4 QAT models: Optimizing compression for mobile and laptop efficiency"]]></title><description><![CDATA[
<p>@google.com'ers, there are no GGUFs (blog says there is)</p>
]]></description><pubDate>Fri, 05 Jun 2026 17:22:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=48415574</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48415574</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48415574</guid></item><item><title><![CDATA[New comment by refulgentis in "Iran Shock Jolts Asia and Europe to Speed Up Energy Transition"]]></title><description><![CDATA[
<p>I’m confused a bit, the premise they produce essentially all solar panels got an “indeed” with article-based assertions that exports doubled in one month, and it’s 14 GW more than the total amount installed in Spain ever, both very impressive. :)<p>Is there anything there about Chinese share?<p>I had the understanding they produce the vast vast majority as well, but that seems belied by exports doubling near instantaneously with demand? That made me wonder if there’s a lower cost floor producer(s) with, say, 10-20% of production that quickly got booked</p>
]]></description><pubDate>Thu, 04 Jun 2026 22:07:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=48405279</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48405279</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48405279</guid></item><item><title><![CDATA[New comment by refulgentis in "Gemma 4 12B: A unified, encoder-free multimodal model"]]></title><description><![CDATA[
<p>I find your agreement seductive because it side steps the unfounded assertions and simply asserts there must be something different and we don’t know it, which is easy for me to agree with too. Or maybe hard to <i>disagree</i> with.</p>
]]></description><pubDate>Wed, 03 Jun 2026 21:37:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48390457</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48390457</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48390457</guid></item><item><title><![CDATA[New comment by refulgentis in "Gemma 4 12B: A unified, encoder-free multimodal model"]]></title><description><![CDATA[
<p>This sounds like when crystal people talk quantum physics.</p>
]]></description><pubDate>Wed, 03 Jun 2026 20:48:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48389818</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48389818</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48389818</guid></item><item><title><![CDATA[New comment by refulgentis in "Atherton spent $145K to delay train electrification. The rest of us paid $400M"]]></title><description><![CDATA[
<p>That is a very uncharitable read.</p>
]]></description><pubDate>Tue, 02 Jun 2026 19:04:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48374709</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48374709</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48374709</guid></item><item><title><![CDATA[New comment by refulgentis in "Atherton spent $145K to delay train electrification. The rest of us paid $400M"]]></title><description><![CDATA[
<p>Note to future selfs? (-selves?)<p>Around Opus 4.6 release it got good enough people tried laundering it all the time, and around 4.8 the group dynamics were such that it was worse to call it AI writing than launder it as your own.<p>Regardless, there was still enough of a taboo around it that the way people would try to launder it began to often include an AI-generated "sources" table, which was also aggressively bad, but lord knows <i>no one</i> reads those. So it was just another sigil that misled.</p>
]]></description><pubDate>Sun, 31 May 2026 23:02:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48350620</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48350620</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48350620</guid></item><item><title><![CDATA[New comment by refulgentis in "Atherton spent $145K to delay train electrification. The rest of us paid $400M"]]></title><description><![CDATA[
<p>I agree completely and empathetically and vehemently with the idea behind the message.<p>The slop & aggressively poor argumentation, the kind that I <i>think</i> would have caused me to fail it if I tried it in speech & debate in <i>middle school</i>, leaves me feeling empty.<p>They keep saying $400M, $400M, $400M, $400M, and the only cost they came up with is $20M. It makes me uncomfortable to support the overall cause if this is how it'll be played, because, setting aside morality of tactics, it's not playing to win. Anyone who is at the margins will see it plainly and be given a reason not to listen.</p>
]]></description><pubDate>Sun, 31 May 2026 22:57:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48350584</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48350584</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48350584</guid></item><item><title><![CDATA[New comment by refulgentis in "SQLite is all you need for durable workflows"]]></title><description><![CDATA[
<p>Correct! I'm not "worried" about it, I've been putting SQLites in your and my pocket for the last 17 years.<p>I don't want to be glib and leave it there, even though I'm slightly annoyed you missed several sigils in my post that I was <i>well</i> past that.<p>The point is, for the <i>not</i> in your pocket case, for the <i>not</i> a singular document store case, I'm curious what the use case is.</p>
]]></description><pubDate>Fri, 29 May 2026 22:54:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48330404</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48330404</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48330404</guid></item><item><title><![CDATA[New comment by refulgentis in "SQLite is all you need for durable workflows"]]></title><description><![CDATA[
<p>Excellent question, and I spent so many years asking myself it, this over and over. You asking it made me realize I just...don't anymore. So allow me to blather a bit / free associate because I won't be sure why myself until I've written it out.<p>TL;DR: whatever works for you is the right decision. (which isn't helpful, I heard this so many times and as the recipient, I thought "That's nice. Now how do I choose what works for me?")<p>I finally <i>had</i> to use Postgres a couple years ago after a career of only SQLite - startup founder & iOS app developer using SQLite, turned Googler on Android, turned doing-my-own-thing.<p>In retrospect, I have made only one bad decision:<p>I went <i>way</i> out of my way to make SQLite work at my 2009-iOS-startup. It was a restaurant point of sale system, and to allow a networked system, one of the iOS devices would act as a server. This was a really cool trick, even an advantage in marketing that was appreciated by users. It meant the restaurant could continue to operate if the internet went down. But it eventually became clear owners <i>loved</i> having internet-based access too, ex. to do reporting/financial analysis over the data. And I kept contorting, instead of moving past my fear of getting into things I didn’t know, I instead did some like rudimentary thing over port forwarding. The bad decision here was riding one horse for so long and letting it affect the product, having a real server database would have allowed for a lot more features, think, first party gift cards, and a 100 others.<p>After leaving Google I needed server-side storage and fought and fought to avoid it. Then it turned out Postgres is easy and, just like SQLite, 99.999% of the time I don’t even know I’m using it.<p>In retrospect, there’s ~0 switching cost to these, particularly in age of LLMs. If you do need something more one day, it’ll be easy to do, and if you have to do it in a rush because you’re successful, you’re in Good Problem territory.<p>Hope that helped, after writing it out, dunno how convincing it is. Feel free to follow up, I appreciate the curiosity/framing because I had the same thought for so long.</p>
]]></description><pubDate>Fri, 29 May 2026 22:52:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=48330389</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48330389</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48330389</guid></item><item><title><![CDATA[New comment by refulgentis in "SQLite is all you need for durable workflows"]]></title><description><![CDATA[
<p>> Isn't concurrency also limited by your machines disk speed for writes<p>Yes, in theory: given a large enough database, and a disk that can only do one operation at a time, and a large enough operation that touches enough of the database. In practice, in a SQLite single tenant scenario? No, not at all.<p>> what difference does it make if you write sequentially vs concurrently. Why does concurrency even matter for databases?<p>As soon as your codebase involves reacting to events independently of a user taking action it becomes a practical concern. Generally, this is a broad question and has 1,000,000 answers.<p>EDIT: Originally I had "I think you understand generally, no?" appended but realized that's not helpful at all, if you did, you wouldn't be asking.<p>Something that may help is imagining what'd happen if a DB <i>wasn't</i> thread safe / didn't allow multiple writers. Ex. in SQLite's case, it allows multiple write operations to take place but there's a one-at-a-time queue. If we didn't have databases that <i>were</i> able to execute multiple writes simultaneously, you'd need a separate database for each concurrent writer you expect, and you'd effectively have a global lock. Orderly scaling would be ~impossible unless you did something crazy like have a single server per user</p>
]]></description><pubDate>Fri, 29 May 2026 19:10:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48327877</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48327877</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48327877</guid></item><item><title><![CDATA[New comment by refulgentis in "SQLite is all you need for durable workflows"]]></title><description><![CDATA[
<p>Strawman, no? "run an Obelisk server with a SQLite database", now we're distributed.<p>SQLite is a nice local store. It's this server stuff  that I don’t grok, well, yet. :)</p>
]]></description><pubDate>Fri, 29 May 2026 19:06:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=48327837</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48327837</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48327837</guid></item><item><title><![CDATA[New comment by refulgentis in "SQLite is all you need for durable workflows"]]></title><description><![CDATA[
<p>I absolutely 100% do not understand it either. At all. Every time I try to over the last year or two I come away with the conclusion its something that <i>sounds</i> cool (to me too!) but is guaranteed to cause more problems than more obvious solutions.<p>That being said I'd <i>kill</i> for someone who used it and benefited to explain it to me in a practical sense. (specifically where <i>syncing</i> is involved, and syncing a subset of the SQLite is necessary. If it's "just" a document store thats treated like a blob for syncing/backup, that's familiar. If it's all in one storage but only local, that's familiar.)<p>Re: TFA, I guess it would have helped if I knew what Obelisk was, which is on me, and a more in-depth explanation of how this ties into AI/agents, which is on the industry/writer.</p>
]]></description><pubDate>Fri, 29 May 2026 19:03:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48327790</link><dc:creator>refulgentis</dc:creator><comments>https://news.ycombinator.com/item?id=48327790</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48327790</guid></item></channel></rss>