<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: msp26</title><link>https://news.ycombinator.com/user?id=msp26</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 08:04:27 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=msp26" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by msp26 in "Claude Fable 5"]]></title><description><![CDATA[
<p>It triggered for me when I asked "Web search for your own model card (released today) and pick out your favourite highlights from the pdf"</p>
]]></description><pubDate>Tue, 09 Jun 2026 18:40:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465592</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=48465592</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465592</guid></item><item><title><![CDATA[New comment by msp26 in "Claude Fable 5"]]></title><description><![CDATA[
<p>>Pricing for both models is $10 per million input tokens and $50 per million output tokens.</p>
]]></description><pubDate>Tue, 09 Jun 2026 17:08:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48463978</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=48463978</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48463978</guid></item><item><title><![CDATA[New comment by msp26 in "I’ve joined Anthropic"]]></title><description><![CDATA[
<p>hell will freeze over before anthropic release anything meaningful to the public</p>
]]></description><pubDate>Tue, 19 May 2026 16:44:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48195767</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=48195767</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48195767</guid></item><item><title><![CDATA[New comment by msp26 in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>Interesting, I might try that, thanks!</p>
]]></description><pubDate>Wed, 06 May 2026 08:13:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48033628</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=48033628</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48033628</guid></item><item><title><![CDATA[New comment by msp26 in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>Google is singlehandedly carrying western open source models. Gemma 4 31B is fantastic.<p>However, it is a little painful to try to fit the best possible version into 24GB vram with vision + this drafter soon. My build doesn't support any more GPUs and I believe I would want another 4090 (overpriced) for best performance or otherwise just replace it altogether.</p>
]]></description><pubDate>Tue, 05 May 2026 18:06:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=48026231</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=48026231</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48026231</guid></item><item><title><![CDATA[New comment by msp26 in "Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks"]]></title><description><![CDATA[
<p>I like starting most of my projects on marimo notebooks now and slowly moving parts of it to the main codebase + db.<p>By the end of it I might remove the notebook entirely but usually I keep it for some visualisation + running stuff as a cli tool.</p>
]]></description><pubDate>Sat, 02 May 2026 16:12:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=47987662</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47987662</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47987662</guid></item><item><title><![CDATA[New comment by msp26 in "Claude.ai unavailable and elevated errors on the API"]]></title><description><![CDATA[
<p>session usage limits this week feel like ass. Even when being careful to not break prefix caching.</p>
]]></description><pubDate>Tue, 28 Apr 2026 18:20:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47938340</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47938340</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47938340</guid></item><item><title><![CDATA[New comment by msp26 in "Claude Token Counter, now with model comparisons"]]></title><description><![CDATA[
<p>Not necessarily with speculative decoding. Whitespace would be trivial to predict and they would petty much keep using the same amount of compute as before.<p>I don't think that's their primary motive for doing this but it is a side effect.</p>
]]></description><pubDate>Mon, 20 Apr 2026 10:11:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47832277</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47832277</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47832277</guid></item><item><title><![CDATA[New comment by msp26 in "Claude Opus 4.7"]]></title><description><![CDATA[
<p>They don't have the compute to make Mythos generally available: that's all there is to it. The exclusivity is also nice from a marketing pov.</p>
]]></description><pubDate>Thu, 16 Apr 2026 14:59:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47794101</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47794101</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47794101</guid></item><item><title><![CDATA[New comment by msp26 in "Claude Opus 4.7"]]></title><description><![CDATA[
<p>> First, Opus 4.7 uses an updated tokenizer that improves how the model processes text<p>wow can I see it and run it locally please? Making API calls to check token counts is retarded.</p>
]]></description><pubDate>Thu, 16 Apr 2026 14:56:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=47794054</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47794054</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47794054</guid></item><item><title><![CDATA[New comment by msp26 in "If DSPy is so great, why isn't anyone using it?"]]></title><description><![CDATA[
<p>> Data extraction tasks are amongst the easiest to evaluate because there’s a known “right” answer.<p>Wrong. There can be a lot of subjectivity and pretending that some golden answer exists does more harm and narrows down the scope of what you can build.<p>My other main problem with data extraction tasks and why I'm not satisfied with any of the existing eval tools is that the schemas I write change can drastically as my understanding of the problem increases. And nothing really seems to handle that well, I mostly just resort to reading diffs of what happens when I change something and reading the input/output data very closely. Marimo is fantastic for anything visual like this btw.<p>Also there is a difference between: the problem in reality → the business model → your db/application schema → the schema you send to the LLM. And to actually improve your schema/prompt you have to be mindful of the entire problem stack and how you might separate things that are handled through post processing rather than by the LLM directly.<p>> Abstract model calls. Make swapping GPT-4 for Claude a one-line change.<p>And in practice random limitations like structured output API schema limits between providers can make this non-trivial. God I hate the Gemini API.</p>
]]></description><pubDate>Mon, 23 Mar 2026 16:16:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47491546</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47491546</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47491546</guid></item><item><title><![CDATA[New comment by msp26 in "GPT‑5.4 Mini and Nano"]]></title><description><![CDATA[
<p>Man the lowest end pricing has been thoroughly hiked. It was convenient while it lasted.</p>
]]></description><pubDate>Tue, 17 Mar 2026 22:32:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47419255</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47419255</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47419255</guid></item><item><title><![CDATA[New comment by msp26 in "Show HN: I built a tool that watches webpages and exposes changes as RSS"]]></title><description><![CDATA[
<p>I got claude to reverse engineer the extension and compare to changedetection and here's what it came up with. Apologies for clanker slop but I think its in poor taste to not attribute the opensource tool that the service is built on (one that's also funded by their SaaS plan)<p>---<p>Summary: What Is Objectively Provable<p>- The extension stores its config under the key changedetection_config<p>- 16 API endpoints in the extension are 1:1 matches with changedetection.io's documented API<p>- 16 data model field names are exact matches with changedetection.io's Watch model (including obscure ones like time_between_check_use_default, history_n, notification_muted, fetch_backend)<p>- The authentication mechanism (x-api-key header) is identical<p>- The default port (5000) matches changedetection.io's default<p>- Custom endpoints (/auth/, /feature-flags, /email/, /generate_key, /pregate) do NOT exist in changedetection.io — these are proprietary additions<p>- The watch limit error format is completely different from changedetection.io's, adding billing-specific fields (current_plan, upgrade_required)<p>- The extension ships with error tracking that sends telemetry (including user emails on login) to the developer's GlitchTip server at 100% sample rate<p>The extension is provably a client for a modified/extended changedetection.io backend. The open question is only the degree of modification - whether it's a fork, a proxy wrapper, or a plugin system. But the underlying engine is unambiguously changedetection.io.</p>
]]></description><pubDate>Thu, 12 Mar 2026 11:08:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47349069</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47349069</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47349069</guid></item><item><title><![CDATA[New comment by msp26 in "Show HN: I built a tool that watches webpages and exposes changes as RSS"]]></title><description><![CDATA[
<p>see:<p><a href="https://news.ycombinator.com/item?id=47349069">https://news.ycombinator.com/item?id=47349069</a></p>
]]></description><pubDate>Thu, 12 Mar 2026 11:06:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=47349051</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47349051</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47349051</guid></item><item><title><![CDATA[New comment by msp26 in "Show HN: Argus – VSCode debugger for Claude Code sessions"]]></title><description><![CDATA[
<p>Apologies but I will use this thread as an opportunity to report CC VSCode extension bugs because I don't think there's an official channel that actually gets read by humans.<p>> yeah they're shipping too fast and everything is buggy as shit<p>- fork conversation button doesn't even work anymore in vscode extension<p>- sometimes when I reconnect to my remote SSH in VSCode, previously loaded chats become inaccessible. The chats are still there in the .jsonl files but for some reason the CC extension becomes incapable of reading them.<p>-- this issue happens so frequently that I ended up making a skill to allow CC to dig up info from the bugged sessions</p>
]]></description><pubDate>Sat, 07 Mar 2026 17:30:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=47289599</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47289599</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47289599</guid></item><item><title><![CDATA[New comment by msp26 in "Gemini 3.1 Flash-Lite: Built for intelligence at scale"]]></title><description><![CDATA[
<p>many tasks don't need any reasoning</p>
]]></description><pubDate>Tue, 03 Mar 2026 20:08:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47238189</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47238189</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47238189</guid></item><item><title><![CDATA[New comment by msp26 in "Gemini 3.1 Flash-Lite: Built for intelligence at scale"]]></title><description><![CDATA[
<p>What the fuck is this price hike? It was such a nice low end, fast model. Who needs 10 years of reasoning on this model size??<p>I'm gonna switch some workflows to qwen3.5.<p>There's a lot of tasks that benefit from just having a mildly capable LLM and 2.5 Flash Lite worked out of the box for cheap.<p>Can we get flash lite lite please?<p>Edit:
Logan said:
"I think open source models like Gemma might be the answer here"<p>Implying that they're not interested in serving lower end Gemini models?</p>
]]></description><pubDate>Tue, 03 Mar 2026 19:48:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=47237891</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47237891</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47237891</guid></item><item><title><![CDATA[New comment by msp26 in "Anthropic Cowork feature creates 10GB VM bundle on macOS without warning"]]></title><description><![CDATA[
<p>> every single product/feature I've used other than the Claude Code CLI has been terrible<p>yeah they're shipping too fast and everything is buggy as shit<p>- fork conversation button doesn't even work anymore in vscode extension<p>- sometimes when I reconnect to my remote SSH in VSCode, previously loaded chats become inaccessible. The chats are still there in the .jsonl files but for some reason the CC extension becomes incapable of reading them.</p>
]]></description><pubDate>Mon, 02 Mar 2026 16:50:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47220477</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47220477</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47220477</guid></item><item><title><![CDATA[New comment by msp26 in "I am directing the Department of War to designate Anthropic a supply-chain risk"]]></title><description><![CDATA[
<p>Batshit situation, respectable position from Dario throughout.<p>But there's some irony in this happening to Anthropic after all the constant hawkish fearmongering about the evil Chinese (and open source AI sentiment too).</p>
]]></description><pubDate>Fri, 27 Feb 2026 23:02:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47187126</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47187126</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47187126</guid></item><item><title><![CDATA[New comment by msp26 in "The Future of AI Software Development"]]></title><description><![CDATA[
<p>Horrific comparison point. LLM inference is way more expensive locally for single users than running batch inference at scale in a datacenter on actual GPUs/TPUs.</p>
]]></description><pubDate>Wed, 18 Feb 2026 18:11:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47064133</link><dc:creator>msp26</dc:creator><comments>https://news.ycombinator.com/item?id=47064133</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47064133</guid></item></channel></rss>