Hacker News: msp26

New comment by msp26 in "Running local models is good now"

msp26 — Tue, 16 Jun 2026 17:51:45 +0000

Yep agreed completely. I couldn't imagine torturing myself with a small model for local coding. But Gemma 4 31B is so fucking good for a variety of language modelling tasks.

New comment by msp26 in "Claude Fable 5"

msp26 — Tue, 09 Jun 2026 18:40:34 +0000

It triggered for me when I asked "Web search for your own model card (released today) and pick out your favourite highlights from the pdf"

New comment by msp26 in "Claude Fable 5"

msp26 — Tue, 09 Jun 2026 17:08:34 +0000

>Pricing for both models is $10 per million input tokens and $50 per million output tokens.

New comment by msp26 in "I’ve joined Anthropic"

msp26 — Tue, 19 May 2026 16:44:11 +0000

hell will freeze over before anthropic release anything meaningful to the public

New comment by msp26 in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"

msp26 — Wed, 06 May 2026 08:13:57 +0000

Interesting, I might try that, thanks!

New comment by msp26 in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"

msp26 — Tue, 05 May 2026 18:06:22 +0000

Google is singlehandedly carrying western open source models. Gemma 4 31B is fantastic.

However, it is a little painful to try to fit the best possible version into 24GB vram with vision + this drafter soon. My build doesn't support any more GPUs and I believe I would want another 4090 (overpriced) for best performance or otherwise just replace it altogether.

New comment by msp26 in "Show HN: Mljar Studio – local AI data analyst that saves analysis as notebooks"

msp26 — Sat, 02 May 2026 16:12:30 +0000

I like starting most of my projects on marimo notebooks now and slowly moving parts of it to the main codebase + db.

By the end of it I might remove the notebook entirely but usually I keep it for some visualisation + running stuff as a cli tool.

New comment by msp26 in "Claude.ai unavailable and elevated errors on the API"

msp26 — Tue, 28 Apr 2026 18:20:26 +0000

session usage limits this week feel like ass. Even when being careful to not break prefix caching.

New comment by msp26 in "Claude Token Counter, now with model comparisons"

msp26 — Mon, 20 Apr 2026 10:11:15 +0000

Not necessarily with speculative decoding. Whitespace would be trivial to predict and they would petty much keep using the same amount of compute as before.

I don't think that's their primary motive for doing this but it is a side effect.

New comment by msp26 in "Claude Opus 4.7"

msp26 — Thu, 16 Apr 2026 14:59:05 +0000

They don't have the compute to make Mythos generally available: that's all there is to it. The exclusivity is also nice from a marketing pov.

New comment by msp26 in "Claude Opus 4.7"

msp26 — Thu, 16 Apr 2026 14:56:47 +0000

> First, Opus 4.7 uses an updated tokenizer that improves how the model processes text

wow can I see it and run it locally please? Making API calls to check token counts is retarded.

New comment by msp26 in "If DSPy is so great, why isn't anyone using it?"

msp26 — Mon, 23 Mar 2026 16:16:58 +0000

> Data extraction tasks are amongst the easiest to evaluate because there’s a known “right” answer.

Wrong. There can be a lot of subjectivity and pretending that some golden answer exists does more harm and narrows down the scope of what you can build.

My other main problem with data extraction tasks and why I'm not satisfied with any of the existing eval tools is that the schemas I write change can drastically as my understanding of the problem increases. And nothing really seems to handle that well, I mostly just resort to reading diffs of what happens when I change something and reading the input/output data very closely. Marimo is fantastic for anything visual like this btw.

Also there is a difference between: the problem in reality → the business model → your db/application schema → the schema you send to the LLM. And to actually improve your schema/prompt you have to be mindful of the entire problem stack and how you might separate things that are handled through post processing rather than by the LLM directly.

> Abstract model calls. Make swapping GPT-4 for Claude a one-line change.

And in practice random limitations like structured output API schema limits between providers can make this non-trivial. God I hate the Gemini API.

New comment by msp26 in "GPT‑5.4 Mini and Nano"

msp26 — Tue, 17 Mar 2026 22:32:17 +0000

Man the lowest end pricing has been thoroughly hiked. It was convenient while it lasted.

New comment by msp26 in "Show HN: I built a tool that watches webpages and exposes changes as RSS"

msp26 — Thu, 12 Mar 2026 11:08:55 +0000

I got claude to reverse engineer the extension and compare to changedetection and here's what it came up with. Apologies for clanker slop but I think its in poor taste to not attribute the opensource tool that the service is built on (one that's also funded by their SaaS plan)

---

Summary: What Is Objectively Provable

- The extension stores its config under the key changedetection_config

- 16 API endpoints in the extension are 1:1 matches with changedetection.io's documented API

- 16 data model field names are exact matches with changedetection.io's Watch model (including obscure ones like time_between_check_use_default, history_n, notification_muted, fetch_backend)

- The authentication mechanism (x-api-key header) is identical

- The default port (5000) matches changedetection.io's default

- Custom endpoints (/auth/, /feature-flags, /email/, /generate_key, /pregate) do NOT exist in changedetection.io — these are proprietary additions

- The watch limit error format is completely different from changedetection.io's, adding billing-specific fields (current_plan, upgrade_required)

- The extension ships with error tracking that sends telemetry (including user emails on login) to the developer's GlitchTip server at 100% sample rate

The extension is provably a client for a modified/extended changedetection.io backend. The open question is only the degree of modification - whether it's a fork, a proxy wrapper, or a plugin system. But the underlying engine is unambiguously changedetection.io.

New comment by msp26 in "Show HN: I built a tool that watches webpages and exposes changes as RSS"

msp26 — Thu, 12 Mar 2026 11:06:12 +0000

see:

https://news.ycombinator.com/item?id=47349069

New comment by msp26 in "Show HN: Argus – VSCode debugger for Claude Code sessions"

msp26 — Sat, 07 Mar 2026 17:30:31 +0000

Apologies but I will use this thread as an opportunity to report CC VSCode extension bugs because I don't think there's an official channel that actually gets read by humans.

> yeah they're shipping too fast and everything is buggy as shit

- fork conversation button doesn't even work anymore in vscode extension

- sometimes when I reconnect to my remote SSH in VSCode, previously loaded chats become inaccessible. The chats are still there in the .jsonl files but for some reason the CC extension becomes incapable of reading them.

-- this issue happens so frequently that I ended up making a skill to allow CC to dig up info from the bugged sessions

New comment by msp26 in "Gemini 3.1 Flash-Lite: Built for intelligence at scale"

msp26 — Tue, 03 Mar 2026 20:08:00 +0000

many tasks don't need any reasoning

New comment by msp26 in "Gemini 3.1 Flash-Lite: Built for intelligence at scale"

msp26 — Tue, 03 Mar 2026 19:48:45 +0000

What the fuck is this price hike? It was such a nice low end, fast model. Who needs 10 years of reasoning on this model size??

I'm gonna switch some workflows to qwen3.5.

There's a lot of tasks that benefit from just having a mildly capable LLM and 2.5 Flash Lite worked out of the box for cheap.

Can we get flash lite lite please?

Edit: Logan said: "I think open source models like Gemma might be the answer here"

Implying that they're not interested in serving lower end Gemini models?

New comment by msp26 in "Anthropic Cowork feature creates 10GB VM bundle on macOS without warning"

msp26 — Mon, 02 Mar 2026 16:50:20 +0000

> every single product/feature I've used other than the Claude Code CLI has been terrible

yeah they're shipping too fast and everything is buggy as shit

- fork conversation button doesn't even work anymore in vscode extension

New comment by msp26 in "I am directing the Department of War to designate Anthropic a supply-chain risk"

msp26 — Fri, 27 Feb 2026 23:02:20 +0000

Batshit situation, respectable position from Dario throughout.

But there's some irony in this happening to Anthropic after all the constant hawkish fearmongering about the evil Chinese (and open source AI sentiment too).