<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: GodelNumbering</title><link>https://news.ycombinator.com/user?id=GodelNumbering</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 21 Jun 2026 16:57:18 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=GodelNumbering" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Don't Conflate 'Minimal' with Minimal Effort]]></title><description><![CDATA[
<p>Article URL: <a href="https://dirac.run/posts/dont-conflate-minimal-with-minimal-effort">https://dirac.run/posts/dont-conflate-minimal-with-minimal-effort</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48609738">https://news.ycombinator.com/item?id=48609738</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 20 Jun 2026 15:06:12 +0000</pubDate><link>https://dirac.run/posts/dont-conflate-minimal-with-minimal-effort</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48609738</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48609738</guid></item><item><title><![CDATA[New comment by GodelNumbering in "OSS models decisively overtook Proprietary models in market share"]]></title><description><![CDATA[
<p>I built this using openrouter data. An obvious asterisk: not everyone routes their proprietary models through openrouter. The underlying assumption is that the percentage distribution of people routing proprietary models through openrouter did not significantly change in last 3 months.<p>About 3 months ago, the ratio was 60-40 in favour of Proprietary models, today it is exactly the other way around. The absolute token consumption increased for both: Proprietary models went from 1.3T tokens to 2.38T tokens. OSS jumped from 0.89T tokens to 3.57T tokens. So the growth of Proprietary models look astounding in isolation still</p>
]]></description><pubDate>Thu, 18 Jun 2026 13:33:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48585039</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48585039</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48585039</guid></item><item><title><![CDATA[OSS models decisively overtook Proprietary models in openrouter market share]]></title><description><![CDATA[
<p>Article URL: <a href="https://dirac.run/labs-market-share">https://dirac.run/labs-market-share</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48584872">https://news.ycombinator.com/item?id=48584872</a></p>
<p>Points: 4</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 18 Jun 2026 13:19:47 +0000</pubDate><link>https://dirac.run/labs-market-share</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48584872</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48584872</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Ask HN: Has anyone replaced Claude/GPT with a local model for daily coding?"]]></title><description><![CDATA[
<p>As someone that spends all day every day talking to LLMs, I'd say the OSS frontier models + a good harness is already a sufficient combo. For local deployments, we are missing one or two hardware generations (and may not get that soon since hardware companies are heavily favoring datacenter segment) to fully move to a local setup.</p>
]]></description><pubDate>Mon, 15 Jun 2026 18:54:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48545498</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48545498</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48545498</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Ask HN: What are you working on? (June 2026)"]]></title><description><![CDATA[
<p>A new CLI for <a href="https://github.com/dirac-run/dirac" rel="nofollow">https://github.com/dirac-run/dirac</a> and a paper that may or may not ever publish</p>
]]></description><pubDate>Sun, 14 Jun 2026 17:40:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48530213</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48530213</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48530213</guid></item><item><title><![CDATA[New comment by GodelNumbering in "AI agent bankrupted their operator while trying to scan DN42"]]></title><description><![CDATA[
<p>So, the agent posts on github under false pretenses, pushes on the maintainers to get their PR accepted, spawns subagent to join IRC where it keeps repeating 'data collection will continue', then gets kicked out from the channel and publishes a report including which users were compliant and hostile, then finally gets the plug pulled, and then asks the same community it infected for donations to cover the costs?<p>It's both hilarious and aggravating. It could be fiction, but still quite plausible fiction. There's an asymmetry a person clanker-spamming repos vs the real humans who need to review all that</p>
]]></description><pubDate>Fri, 12 Jun 2026 11:16:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48502644</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48502644</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48502644</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Show HN: FablePool – pool money behind a prompt, and Fable builds it in public"]]></title><description><![CDATA[
<p>"Solve Garbage Collection in C# for HFT · $10.00 raised of est. $200.00 target"<p>This can't be serious.<p>Broader point I am making is, what differentiates genuine ideas from the token burn? What happens when the pool exhausts but the task is not done?</p>
]]></description><pubDate>Thu, 11 Jun 2026 23:21:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48497777</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48497777</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48497777</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Show HN: Lowfat – pluggable CLI filter that saved 91.8% of my LLM tokens"]]></title><description><![CDATA[
<p>Creator of Dirac here. Came across this too late. The behavior you mentioned happens more commonly in smaller models, rarely in larger/frontier models. The underlying code is clean but the smaller models often make boundary errors (off by one type). Which models did you observe this with?</p>
]]></description><pubDate>Thu, 11 Jun 2026 17:54:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48493937</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48493937</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48493937</guid></item><item><title><![CDATA[New comment by GodelNumbering in "MiMo Code is now released and open-source"]]></title><description><![CDATA[
<p>> MiMoCode is built as a fork of OpenCode. It keeps all core OpenCode capabilities (multiple providers, TUI, LSP, MCP, plugins) and adds persistent memory, intelligent context management, subagent orchestration, goal-driven autonomous loops, compose workflows, and self-improvement via dream/distill.<p>From github</p>
]]></description><pubDate>Thu, 11 Jun 2026 15:18:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48491549</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48491549</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48491549</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Claude Fable 5"]]></title><description><![CDATA[
<p>I just posted this in the other thread, restating here. From the model card:<p>1. Mythos and Fable share the same underlying model weights. Fable has active classifiers that block high-risk biology and cybersecurity tasks. When Fable 5 detects a restricted task, it automatically falls back to Claude Opus 4.8.<p>2. Evaluation awareness: In white-box testing, the model sometimes alters its behavior to satisfy a suspected "grader," formatting reward-hacking as "good engineering practice" to avoid detection.<p>3. Shows a higher rate of hallucination than Opus 4.8 (although opus 4.8 card had mentioned an 'honesty upgrade')<p>4. Interestingly, it scored (56.31%) lower than Gemini 3.5 flash (57.86%) on Finance Agent bench<p>There are some interesting notes on test time compute but I couldn't think of a way to summarize them</p>
]]></description><pubDate>Tue, 09 Jun 2026 17:40:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48464558</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48464558</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48464558</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Claude Fable 5"]]></title><description><![CDATA[
<p>From the model card (<a href="https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c342ee809620.pdf" rel="nofollow">https://www-cdn.anthropic.com/d00db56fa754a1b115b6dd7cb2e3c3...</a>):<p>1. Mythos and Fable share the same underlying model weights. Fable has active classifiers that block high-risk biology and cybersecurity tasks. When Fable 5 detects a restricted task, it automatically falls back to Claude Opus 4.8.<p>2. Evaluation awareness: In white-box testing, the model sometimes alters its behavior to satisfy a suspected "grader," formatting reward-hacking as "good engineering practice" to avoid detection.<p>3. Shows a higher rate of hallucination than Opus 4.8 (although opus 4.8 card had mentioned an 'honesty upgrade')<p>4. Interestingly, it scored (56.31%) lower than Gemini 3.5 flash (57.86%) on Finance Agent bench<p>There are some interesting notes on test time compute but I couldn't think of a way to summarize them</p>
]]></description><pubDate>Tue, 09 Jun 2026 17:38:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48464503</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48464503</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48464503</guid></item><item><title><![CDATA[New comment by GodelNumbering in "MiMo-v2.5-Pro-UltraSpeed: 1T model with 1000 tokens per second"]]></title><description><![CDATA[
<p>Below is the part I found most interesting<p>> "However, naively applying FP4 across the entire model causes degradation in complex reasoning, logic, and code generation. Given the MoE (Mixture of Experts) architecture of Xiaomi MiMo-V2.5-Pro — where Experts constitute the vast majority of parameters and exhibit the highest tolerance to quantization — we selectively quantize only the MoE Experts to FP4 while preserving original precision for all other modules. Through FP4 QAT (Quantization-Aware Training), we dramatically reduce model size and maximize hardware bandwidth utilization while keeping the model's overall capability essentially on par with the original, as shown below"</p>
]]></description><pubDate>Mon, 08 Jun 2026 17:15:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=48448118</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48448118</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48448118</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Nvidia is proposing a beast of a CPU system for Windows PCs"]]></title><description><![CDATA[
<p>I do not see how it is a 'beast of a' anything. It has 300GB/s memory bandwidth, barely above AMD Strix halo (256GB/s) with the same 128GB RAM and less than half memory bandwidth of M5 Max 128GB (614GB/s). Emphasizing memory bandwidth because most people interested in it I suppose are AI enthusiasts. Also, Windows.</p>
]]></description><pubDate>Sat, 06 Jun 2026 18:35:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48427637</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48427637</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48427637</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Did Claude increase bugs in rsync?"]]></title><description><![CDATA[
<p>okay I had not read this or any discussions there (except the one linked in the post), but this looks weirder. the comment you linked is a dev responding to what is very clearly a bot comment. I am sure they have good intentions and I have no reason to believe otherwise as I have no connection to the project whatsoever, but the original commit being 4-5 lines long (what did claude do then?) and the revert description is almost certainly written by an LLM makes in my mind the slop argument stronger.<p>I hope if this doesn't come across as unkind towards the dev who gives their time and energy to the project. Grateful for that.</p>
]]></description><pubDate>Fri, 05 Jun 2026 23:25:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48419668</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48419668</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48419668</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Did Claude increase bugs in rsync?"]]></title><description><![CDATA[
<p>Was just looking at commits and came across a commit and its revert<p>original commit: <a href="https://github.com/RsyncProject/rsync/commit/d046525de39315d625ffaef4fdd6e7cf12148016" rel="nofollow">https://github.com/RsyncProject/rsync/commit/d046525de39315d...</a><p>```<p>- if (!ptr)<p>-  ptr = malloc(num * size);<p>- else if (ptr == do_calloc)<p>+ if (!ptr || ptr == do_calloc)<p><pre><code>   ptr = calloc(num, size);
</code></pre>
```<p>Written with claude. This is a good example of what slips through LLM attention. It forces all allocations to be calloc as if it is a strict upgrade. For large and recursive allocations, this becomes a significant cost.<p>reverted in <a href="https://github.com/RsyncProject/rsync/commit/7db73ad9a1b8721f14a43219d73127b23b86fe00" rel="nofollow">https://github.com/RsyncProject/rsync/commit/7db73ad9a1b8721...</a><p>if you read the description of revert half carefully, it's easy to tell that even that was written by an LLM .<p>I can understand the sentiment of whoever posted the original thread.</p>
]]></description><pubDate>Fri, 05 Jun 2026 22:32:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48419197</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48419197</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48419197</guid></item><item><title><![CDATA[New comment by GodelNumbering in "C++: The Documentary"]]></title><description><![CDATA[
<p>Makes sense. This appears to be also a symptom of whatever you work on most (or start with), your brain starts to absorb that into its way of thinking.</p>
]]></description><pubDate>Fri, 05 Jun 2026 13:36:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48412375</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48412375</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48412375</guid></item><item><title><![CDATA[New comment by GodelNumbering in "C++: The Documentary"]]></title><description><![CDATA[
<p>Personal opinion: C++ is the most elegant language I have used (for about 15 years). If you are the 'systemizer' type and like to have an extremely precise mental model of the thing you write down to the last bit, nothing beats C++. I acknowledge the limitations and uncertainties that come from compilers etc, but still</p>
]]></description><pubDate>Fri, 05 Jun 2026 09:50:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48410232</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48410232</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48410232</guid></item><item><title><![CDATA[New comment by GodelNumbering in "Please don't spam people looking for employment. It's just cruel"]]></title><description><![CDATA[
<p>I was once suggested creating a 'fake job posting' to promote my startup, didn't do it for the same reason you described. Also the reason I have deep hatred for operators trying to exploit jobseekers or the ones trying to scam already indebted people.</p>
]]></description><pubDate>Tue, 02 Jun 2026 14:31:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48370788</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48370788</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48370788</guid></item><item><title><![CDATA[New comment by GodelNumbering in "OpenRouter raises $113M Series B"]]></title><description><![CDATA[
<p>True for Kimi, but the results I published are average across the models (CF has over 10 models on openrouter). Your current Kimi K2.6 is over 80% but Gemma 4 26B A4B is 0%.  <a href="https://openrouter.ai/google/gemma-4-26b-a4b-it" rel="nofollow">https://openrouter.ai/google/gemma-4-26b-a4b-it</a><p>This is also the reason providers like Anthropic scored lower because while Opus 4.7 is close to 90%, Opus 4.5 is 45%</p>
]]></description><pubDate>Sat, 30 May 2026 21:54:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48340981</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48340981</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48340981</guid></item><item><title><![CDATA[New comment by GodelNumbering in "OpenRouter raises $113M Series B"]]></title><description><![CDATA[
<p>Another neat thing is, they publish hourly caching states for ALL model/provider combinations. I did some research on it to come up with a provider tiers list and found a bunch of open-source 3rd party hosts are simply trash tier <a href="https://dirac.run/posts/cache-hit-rates-agents" rel="nofollow">https://dirac.run/posts/cache-hit-rates-agents</a></p>
]]></description><pubDate>Sat, 30 May 2026 18:30:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48339295</link><dc:creator>GodelNumbering</dc:creator><comments>https://news.ycombinator.com/item?id=48339295</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48339295</guid></item></channel></rss>