<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: msdz</title><link>https://news.ycombinator.com/user?id=msdz</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 21 Jun 2026 09:38:41 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=msdz" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by msdz in "GPT-5.5 hallucinates 3x more than MIT-licensed GLM-5.2"]]></title><description><![CDATA[
<p>> it hallucinates like crazy but looks like its by design to boost benchmarks.<p>Wasn’t there a discussion around some new-ish benchmark _punishing_ hallucination answers (over not replying at all) recently?  
Maybe in the not-so-distant future, this “spam replies until one’s correct” strategy won’t be able to game a benchmark much at all anymore.</p>
]]></description><pubDate>Sat, 20 Jun 2026 19:57:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48612445</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48612445</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48612445</guid></item><item><title><![CDATA[New comment by msdz in "Want your images back? That'll be $5"]]></title><description><![CDATA[
<p>You forgot about the best part, in terms of the “GDPR threat” effectiveness:<p>Fines can be up to €20 million or 4% of global revenues…, _whichever is greater._</p>
]]></description><pubDate>Wed, 17 Jun 2026 16:33:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48572861</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48572861</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48572861</guid></item><item><title><![CDATA[New comment by msdz in "SpaceX to buy Cursor for $60B"]]></title><description><![CDATA[
<p>You don't think that a $60b valuation is having something worthwhile?<p>(Only half-joking…)</p>
]]></description><pubDate>Tue, 16 Jun 2026 23:37:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48563807</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48563807</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48563807</guid></item><item><title><![CDATA[New comment by msdz in "SpaceX to buy Cursor for $60B"]]></title><description><![CDATA[
<p>Somewhat doubtful. (The first part of your statement, anyway. Of course the difference between abstract "value" and hard, spendable "money" is a thing.)<p>Like Mr. Hanson said in my sibling comment, some rulers are (or were!) bound to have amassed incredible amounts of resources. For historical/non-present-day examples, consider looking into figures like Jakob Fugger or Mansa Musa.</p>
]]></description><pubDate>Tue, 16 Jun 2026 23:23:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48563668</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48563668</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48563668</guid></item><item><title><![CDATA[New comment by msdz in "SpaceX to buy Cursor for $60B"]]></title><description><![CDATA[
<p>Who's to say it won't?</p>
]]></description><pubDate>Tue, 16 Jun 2026 21:31:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48562424</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48562424</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48562424</guid></item><item><title><![CDATA[New comment by msdz in "Apple is about to make Hide My Email useless"]]></title><description><![CDATA[
<p>Which has more market pull: Some web site or Apple?</p>
]]></description><pubDate>Tue, 16 Jun 2026 20:59:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48561980</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48561980</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48561980</guid></item><item><title><![CDATA[New comment by msdz in "Kimi K2.7-Code: open-source coding model with better token efficiency"]]></title><description><![CDATA[
<p>> If you ask Claude in Chinese to introduce itself, it will claim it's Kimi :)<p>That's a funny anecdote, buut I'm not able to reproduce. Where/how/when did you get this, or hear about it?  
It might've been patched by now, at least that's the feel I get from my limited testing.<p>Using bare aichat [1] with no system prompt and no temperature nor top_p (and I'm truncating the response after the first line that contains the name the model gave, because the point has been made clear by then), and with the same prompt (approx. "Introduce yourself!") every time:<p>Claude Sonnet 4.5:<p>> 请做个自我介绍！<p>你好！我是Claude，一个由Anthropic公司开发的AI助手。
[…]<p>Claude Haiku 4.5:<p>> 请做个自我介绍！<p># 你好！<p>我是 *Claude*，一个由 Anthropic 公司开发的 AI 助手。<p>Claude Opus 4.5:<p>> 请做个自我介绍！<p># 你好！<p>我是 *Claude*，由 Anthropic 公司开发的 AI 助手。<p>Claude Opus 4.6:<p>> 请做个自我介绍！<p># 你好！ 我是 Claude<p>Claude Opus 4.7:<p>> 请做个自我介绍！<p>你好！我是 Claude，由 Anthropic 公司开发的人工智能助手。很高兴认识你！<p>Claude Opus 4.8:<p>> 请做个自我介绍！<p>你好！我是 Claude，由 Anthropic 公司开发的人工智能助手。<p>Claude Fable 5:<p>> 请做个自我介绍！<p># 自我介绍<p>你好！很高兴认识你！<p>我是 *Claude*，由 Anthropic 开发的 AI 助手。 [2]<p>I don't see a Kimi mention, unfortunately. :-)<p>[1] <a href="https://github.com/sigoden/aichat" rel="nofollow">https://github.com/sigoden/aichat</a><p>[2] This model really is noticeably more verbose even with supposed-to-be-brief responses huh, lol</p>
]]></description><pubDate>Fri, 12 Jun 2026 15:29:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48505409</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48505409</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48505409</guid></item><item><title><![CDATA[New comment by msdz in "Build a Basic AI Agent from Scratch: Long Task Planning"]]></title><description><![CDATA[
<p>That is in fact a better explanation due to bringing up different reasons (zero cost to host as you mentioned, vs. network/visibility out-of-the-box in the linked comment).</p>
]]></description><pubDate>Fri, 12 Jun 2026 14:56:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48505011</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48505011</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48505011</guid></item><item><title><![CDATA[New comment by msdz in "Kimi K2.7-Code: open-source coding model with better token efficiency"]]></title><description><![CDATA[
<p>> China is a communist country with elements of capitalistic markets baked in.<p>While I get the point you're making (it should be pretty obvious to anyone who's held a newspaper), I think it's important regardless to point out that Chinese companies AFAIK aren't worker-owned or -controlled, so you can't exactly call it communism, either. And they obviously do not have a "free market capitalism", as you just discussed.<p>It's simply a highly authoritarian state then, I guess?</p>
]]></description><pubDate>Fri, 12 Jun 2026 14:52:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48504963</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48504963</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48504963</guid></item><item><title><![CDATA[New comment by msdz in "Build a Basic AI Agent from Scratch: Long Task Planning"]]></title><description><![CDATA[
<p>What a strange comment.<p>The original post is also available at the poster’s own blog [1], so the question is a very valid one. Clearly, “posting articles for free” is a hurdle already cleared by the author.<p>[1] <a href="https://www.ruxu.dev/articles/ai/build-an-ai-agent-planning/" rel="nofollow">https://www.ruxu.dev/articles/ai/build-an-ai-agent-planning/</a></p>
]]></description><pubDate>Thu, 11 Jun 2026 15:49:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48492022</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48492022</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48492022</guid></item><item><title><![CDATA[New comment by msdz in "Fooling Go's X.509 Certificate Verification"]]></title><description><![CDATA[
<p>Them darn youngins[1]!<p>[1] Ken Thompson (over 80), Rob Pike (around 70), Robert Griesemer (over 60)</p>
]]></description><pubDate>Tue, 09 Jun 2026 07:01:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48457558</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48457558</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48457558</guid></item><item><title><![CDATA[New comment by msdz in "MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 tokens per second"]]></title><description><![CDATA[
<p>AFAIK Taalas, the company behind this demo, still only have their initially "hardwarized" model available to test in ChatJimmy, which IIRC is a rather stupid Llama 3ish 8b.<p>Don't get me wrong though, that demo is still incredibly impressive & makes me very much excited for the hardware-based model era (potentially) ahead.<p>Once you've experienced those speeds, you really start to think about the whole class of things that becomes possible; massively parallel decode paths, extensive reasoning loops, etc…</p>
]]></description><pubDate>Mon, 08 Jun 2026 22:39:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48453365</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48453365</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48453365</guid></item><item><title><![CDATA[New comment by msdz in "The quiet renovation at Bitwarden"]]></title><description><![CDATA[
<p>What would happen if you lost access to phone and laptop? Is there another "backup" device, or a mechanism to register a new device to your Tailscale network that doesn't require vaultwarden?</p>
]]></description><pubDate>Tue, 19 May 2026 14:39:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48193919</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48193919</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48193919</guid></item><item><title><![CDATA[New comment by msdz in "Rewrite Bun in Rust has been merged"]]></title><description><![CDATA[
<p>So the takeaway here is that they scaled to just over $5bn instead of $6.6bn in revenue in just a few years…?  
Still sounds like plenty demand exists?</p>
]]></description><pubDate>Fri, 15 May 2026 08:13:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48145894</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48145894</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48145894</guid></item><item><title><![CDATA[New comment by msdz in "Rewrite Bun in Rust has been merged"]]></title><description><![CDATA[
<p>Push comes to shove, you could probably still ask an LLM to generate transpiler code, if you're so inclined, and then have it fix the remaining "edge cases" afterward, right…?</p>
]]></description><pubDate>Fri, 15 May 2026 08:08:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48145860</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=48145860</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48145860</guid></item><item><title><![CDATA[New comment by msdz in "Everything that went wrong with Claude"]]></title><description><![CDATA[
<p>While I also prefer companies that don't care what the client I'm using is, (part of) the issue is that alternative software was often inefficient with caching, at least up until recently (not sure whether it might have been patched).<p>OpenClaw heartbeats (essentially idling) could cost single-digit dollar amounts of LLM inference per day, _before any actual user activity_. Another example was IIRC the Pi agent harness sending a new timestamp at the start of every message turn (which sends along the entire chat + tool call history up until that point as context), which of course also invalidates the cache's hash, causing effectively unnecessary re-compute.<p>I'm not defending Anthropic per se, but just try to picture yourself in their position as a company desperately strapped for any amount of free compute trying to scale up every service as aggressively as they want/have to… And the caching topics are just one potential issue that could occur with "third-party" software. Not that I like it, but of course they'd be quick to ban such behavior in favor of first-party, "guaranteed behaving" customers.</p>
]]></description><pubDate>Mon, 27 Apr 2026 08:16:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47918993</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=47918993</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47918993</guid></item><item><title><![CDATA[New comment by msdz in "GPT-5.5"]]></title><description><![CDATA[
<p>Such an increase tracks the company's valuation trend, which they constantly, somehow have to justify (let alone break even on costs).</p>
]]></description><pubDate>Thu, 23 Apr 2026 19:23:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47880405</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=47880405</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47880405</guid></item><item><title><![CDATA[New comment by msdz in "A Roblox cheat and one AI tool brought down Vercel's platform"]]></title><description><![CDATA[
<p>"Writing is nature's way of letting you know how sloppy your thinking is." 
– Dick Guindon<p>If your text hasn't undergone that process, it's still sloppy thinking.</p>
]]></description><pubDate>Thu, 23 Apr 2026 13:24:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47875484</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=47875484</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47875484</guid></item><item><title><![CDATA[New comment by msdz in "GitHub's Fake Star Economy"]]></title><description><![CDATA[
<p>> I look at the starts when choosing dependencies, it's a first filter for sure.<p>Unfortunately I still look at them, too, out of habit: The project or repo's star count _was_ a first filter in the past, and we must keep in mind it no longer is.<p>> Good reminder that everything gets gamed given the incentives.<p>Also known as Goodhart's law [1]: "When a measure becomes a target, it ceases to be a good measure".<p>Essentially, VCs screwed this one up for the rest of us, I think?<p>[1] <a href="https://en.wikipedia.org/wiki/Goodhart%27s_law" rel="nofollow">https://en.wikipedia.org/wiki/Goodhart%27s_law</a></p>
]]></description><pubDate>Mon, 20 Apr 2026 09:34:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47832048</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=47832048</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47832048</guid></item><item><title><![CDATA[New comment by msdz in "TinyLoRA – Learning to Reason in 13 Parameters"]]></title><description><![CDATA[
<p>This got me thinking, and it might actually even be a comparable amount.
Let's estimate 12 years of schooling run at minimum $100,000 per student, at least in the US [1], and then add onto that number whatever else you may do after that, i.e. a bunch more money if paid (college) or "unpaid" (self-taught skills and improvements) education, and then the likely biggest portion for white-collar workers, yet hard-to-quantify, in experience and "value" professional work will equip one with.<p>Now divide the average SOTA LLM's training cost (or a guess, since these numbers aren't always published as far as I'm aware) by the number of users, or if you wanted to be more strict, the number of people it's proven to be useful for (what else would training be for), and it might not be so far off anymore?<p>Of course, whether it makes sense to divide and spread out the LLMs' costs across users in order to calculate an "average utility" is debatable.<p>[1] <a href="https://www.publicschoolreview.com/average-spending-student-stats/national-data" rel="nofollow">https://www.publicschoolreview.com/average-spending-student-...</a></p>
]]></description><pubDate>Wed, 01 Apr 2026 16:24:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=47602979</link><dc:creator>msdz</dc:creator><comments>https://news.ycombinator.com/item?id=47602979</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47602979</guid></item></channel></rss>