<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: numeri</title><link>https://news.ycombinator.com/user?id=numeri</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 07 Apr 2026 05:38:55 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=numeri" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by numeri in "Agent Reading Test"]]></title><description><![CDATA[
<p>11/20 for qwen/qwen3.5-flash-02-23 in Claude Code, with effort set to low.</p>
]]></description><pubDate>Mon, 06 Apr 2026 23:36:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47668843</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=47668843</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47668843</guid></item><item><title><![CDATA[New comment by numeri in "Owner of ICE detention facility sees big opportunity in AI man camps"]]></title><description><![CDATA[
<p>No, that's what the headline implies, and the body of the article doesn't support at all. It's (currently, and with no indication of intent to change this) two separate branches of their business.</p>
]]></description><pubDate>Mon, 09 Mar 2026 13:53:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47309086</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=47309086</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47309086</guid></item><item><title><![CDATA[New comment by numeri in "Mercury 2: Fast reasoning LLM powered by diffusion"]]></title><description><![CDATA[
<p>but Taalas had to quantize Llama 3.1 8B to death to get it to fit. It can't produce coherent non-English text at all.</p>
]]></description><pubDate>Wed, 25 Feb 2026 15:24:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47152765</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=47152765</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47152765</guid></item><item><title><![CDATA[New comment by numeri in "Ask HN: What explains the recent surge in LLM coding capabilities?"]]></title><description><![CDATA[
<p>and if I was to guess, the latest generation of models (Claude Opus 4.6, GPT-5.3-codex, etc.) differ from Opus 4.5, GPT 5.2 primarily in the addition of deeper, more difficult (most likely agentic and coding-based, like Terminal Bench) tasks to their RLVR training.<p>I could be completely off, as my intuition here is fully based on public research papers, but it seems to explain the current state of things fairly well.</p>
]]></description><pubDate>Mon, 16 Feb 2026 17:08:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47037500</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=47037500</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47037500</guid></item><item><title><![CDATA[Petition for Recognition of Work on Open-Source as Volunteering in Germany]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.openpetition.de/petition/online/recognition-of-work-on-open-source-as-volunteering-in-germany">https://www.openpetition.de/petition/online/recognition-of-work-on-open-source-as-volunteering-in-germany</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46881568">https://news.ycombinator.com/item?id=46881568</a></p>
<p>Points: 213</p>
<p># Comments: 50</p>
]]></description><pubDate>Wed, 04 Feb 2026 04:46:15 +0000</pubDate><link>https://www.openpetition.de/petition/online/recognition-of-work-on-open-source-as-volunteering-in-germany</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=46881568</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46881568</guid></item><item><title><![CDATA[Exploration Posteriors for Generative Modeling Using Only Negative Rewards]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2510.09596">https://arxiv.org/abs/2510.09596</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46879151">https://news.ycombinator.com/item?id=46879151</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 03 Feb 2026 23:47:14 +0000</pubDate><link>https://arxiv.org/abs/2510.09596</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=46879151</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46879151</guid></item><item><title><![CDATA[New comment by numeri in "Ask HN: Do you still use physical calculators?"]]></title><description><![CDATA[
<p>No, Python or units[1] is always a better choice if I'm near a computer (and I nearly always am these days, unfortunately, I suppose). I do have three wonderful slide rules, though.<p>[1]: <a href="https://www.gnu.org/software/units/" rel="nofollow">https://www.gnu.org/software/units/</a></p>
]]></description><pubDate>Sun, 01 Feb 2026 23:10:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=46850343</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=46850343</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46850343</guid></item><item><title><![CDATA[New comment by numeri in "Finland looks to introduce Australia-style ban on social media"]]></title><description><![CDATA[
<p>Introducing a solid zero-knowledge age verification option is the opposite direction of ending anonymity in the Internet, which other parts of the same governments are also working on.<p>So yeah, I'll gladly trust and cheer on the part working in the right direction.</p>
]]></description><pubDate>Sun, 01 Feb 2026 23:00:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46850272</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=46850272</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46850272</guid></item><item><title><![CDATA[Underrated reasons to be thankful V]]></title><description><![CDATA[
<p>Article URL: <a href="https://dynomight.net/thanks-5/">https://dynomight.net/thanks-5/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46073033">https://news.ycombinator.com/item?id=46073033</a></p>
<p>Points: 226</p>
<p># Comments: 98</p>
]]></description><pubDate>Thu, 27 Nov 2025 20:37:51 +0000</pubDate><link>https://dynomight.net/thanks-5/</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=46073033</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46073033</guid></item><item><title><![CDATA[New comment by numeri in "It's OpenAI's world, we're just living in it"]]></title><description><![CDATA[
<p>I'll just throw in support for gaming on Linux – it's pretty nice feeling these days! I still have the occasional (once every 5–8 months?) update cause a short-lived bug, but it's a very justifiable trade-off to avoid Windows these days.</p>
]]></description><pubDate>Sat, 11 Oct 2025 00:15:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=45545247</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=45545247</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45545247</guid></item><item><title><![CDATA[New comment by numeri in "GPT-5-Codex is a better AI researcher than me"]]></title><description><![CDATA[
<p>This is written by someone who's not an AI researcher, working with tiny models on toy datasets. It's at the level of a motivated undergraduate student in their first NLP course, but not much more.</p>
]]></description><pubDate>Tue, 07 Oct 2025 15:27:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=45504332</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=45504332</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45504332</guid></item><item><title><![CDATA[New comment by numeri in "How to be a leader when the vibes are off"]]></title><description><![CDATA[
<p>One sign would be occasionally changing course in response to overwhelming employee feedback. If that never or almost never happens, the feedback is being ignored, not taken constructively and not followed.</p>
]]></description><pubDate>Thu, 25 Sep 2025 10:09:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=45371094</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=45371094</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45371094</guid></item><item><title><![CDATA[New comment by numeri in "Why language models hallucinate"]]></title><description><![CDATA[
<p>This isn't right – calibration (informally, the degree to which certainty in the model's logits correlates with its chance of getting an answer correct) is well studied in LLMs of all sizes. LLMs are not (generally) well calibrated.</p>
]]></description><pubDate>Sun, 07 Sep 2025 00:09:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=45154049</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=45154049</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45154049</guid></item><item><title><![CDATA[New comment by numeri in "Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)""]]></title><description><![CDATA[
<p>I really like your posts, and they're generally very clearly written. Maybe this one's just the odd duck out, as it's hard for me to find what you actually meant (as clarified in your comment here) in this paragraph:<p>> This suggests that Grok may have a weird sense of identity—if asked for its own opinions it turns to search to find previous indications of opinions expressed by itself or by its ultimate owner. I think there is a good chance this behavior is unintended!<p>I'd say it's far more likely that:<p>1. Elon ordered his research scientists to "fix it" – make it agree with him<p>2. They did RL (probably just basic tool use training) to encourage checking for Elon's opinions<p>3. They did not update the UI (for whatever reason – most likely just because research scientists aren't responsible for front-end, so they forgot)<p>4. Elon is likely now upset that this is shown so obviously<p>The key difference is that I think it's incredibly unlikely that this is emergent behavior due to an "sense of identity", as opposed to direct efforts of the xAI research team. It's likely also a case of <a href="https://en.wiktionary.org/wiki/anticipatory_obedience" rel="nofollow">https://en.wiktionary.org/wiki/anticipatory_obedience</a>.</p>
]]></description><pubDate>Fri, 11 Jul 2025 14:05:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=44532300</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=44532300</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44532300</guid></item><item><title><![CDATA[New comment by numeri in "Grok: Searching X for "From:Elonmusk (Israel or Palestine or Hamas or Gaza)""]]></title><description><![CDATA[
<p>I'm a little shocked at Simon's conclusion here. We have a man who bought an social media website so he could control what's said, and founded an AI lab so he could get a bot that agrees with him, and who has publicly threatened said AI with being replaced if it doesn't change its political views/agree with him.<p>His company has also been caught adding specific instructions in this vein to its prompt.<p>And now it's searching for his tweets to guide its answers on political questions, and Simon somehow thinks it could be unintended, emergent behavior? Even if it were, calling this unintended would be completely ignoring higher order system dynamics (a behavior is still intended if models are rejected until one is found that implements the behavior) and the possibility of reinforcement learning to add this behavior.</p>
]]></description><pubDate>Fri, 11 Jul 2025 09:07:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=44529934</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=44529934</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44529934</guid></item><item><title><![CDATA[New comment by numeri in "I do not remember my life and it's fine"]]></title><description><![CDATA[
<p>That's a bold claim! Actually, there are plenty of scientific experiments that show actual differences between people who report aphantasia and those who don't, including different stress responses to frightening non-visual descriptions, different susceptibility to something called image priming, lower "cortical excitability in the primary visual cortex", and more: <a href="https://en.wikipedia.org/wiki/Aphantasia" rel="nofollow">https://en.wikipedia.org/wiki/Aphantasia</a><p>So we know that at least the people who claim to see nothing act differently. Could it just be that people who act differently describe the sensation differently, you might ask?<p>No, because there are actual cases of acquired aphantasia after neurological damage. These people used to belong to the group that claimed to be able to imagine visual images, got sick, then sought medical help when they could no longer visualize. For me, at least, that's pretty cut and dry evidence that it's not just differing descriptions of the same (or similar) sensations.</p>
]]></description><pubDate>Fri, 06 Jun 2025 10:32:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=44199478</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=44199478</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44199478</guid></item><item><title><![CDATA[New comment by numeri in "I do not remember my life and it's fine"]]></title><description><![CDATA[
<p>That's the thing, some people <i>do</i> see things in their mind that clearly. It's about as rare as full aphantasia, but it's absolutely a spectrum.</p>
]]></description><pubDate>Fri, 06 Jun 2025 01:39:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=44197125</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=44197125</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44197125</guid></item><item><title><![CDATA[New comment by numeri in "I do not remember my life and it's fine"]]></title><description><![CDATA[
<p>I think you're assuming more people are like you than actually are.<p>This is part of the classic debate around aphantasia – both sides assume the other side is speaking more metaphorically, while they're speaking literally. E.g., "Surely he doesn't mean he literally can't visualize things, he just means it's not as sharp for him." or "Surely they don't literally mean they can see it, they're just imagining the list of details/attributes and pretending to see it."</p>
]]></description><pubDate>Fri, 06 Jun 2025 01:38:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=44197124</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=44197124</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44197124</guid></item><item><title><![CDATA[New comment by numeri in "I do not remember my life and it's fine"]]></title><description><![CDATA[
<p>They're definitely quite hard for me. I bet my colleagues, friends or family could answer them for me better than I can without prep (which would involve chatting with my wife). Many of the experiences in this article resonate with me, but it's definitely not quite as extreme.</p>
]]></description><pubDate>Fri, 06 Jun 2025 01:32:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=44197093</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=44197093</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44197093</guid></item><item><title><![CDATA[New comment by numeri in "Claude Code: An Agentic cleanroom analysis"]]></title><description><![CDATA[
<p>Is the analysis right, or did the LLM hallucinate this?</p>
]]></description><pubDate>Sun, 01 Jun 2025 23:03:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=44154401</link><dc:creator>numeri</dc:creator><comments>https://news.ycombinator.com/item?id=44154401</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44154401</guid></item></channel></rss>