<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: sigmar</title><link>https://news.ycombinator.com/user?id=sigmar</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 25 Jun 2026 05:10:36 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=sigmar" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by sigmar in "The text in Claude Code’s “Extended Thinking” output"]]></title><description><![CDATA[
<p>>the language in the docs is awfully indirect.<p>writes this^ and then proceeds to highlight a bold title from the docs that says "summarized thinking" that explains things clearly in the first sentence. lol</p>
]]></description><pubDate>Mon, 22 Jun 2026 15:56:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48631979</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48631979</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48631979</guid></item><item><title><![CDATA[New comment by sigmar in "The Korean telecom giant at the center of Anthropic's Mythos controversy"]]></title><description><![CDATA[
<p>Qualified it with "100%" because claude4 models show the first few lines of the chain of thought:<p>>On Claude 4 models, the first few lines of thinking output are more verbose, providing detailed reasoning that's particularly helpful for prompt engineering purposes. Claude Mythos Preview summarizes from the first token, so its thinking blocks do not show this verbose preamble.
<a href="https://platform.claude.com/docs/en/build-with-claude/extended-thinking#summarized-thinking" rel="nofollow">https://platform.claude.com/docs/en/build-with-claude/extend...</a></p>
]]></description><pubDate>Thu, 18 Jun 2026 22:23:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48592465</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48592465</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48592465</guid></item><item><title><![CDATA[New comment by sigmar in "The Korean telecom giant at the center of Anthropic's Mythos controversy"]]></title><description><![CDATA[
<p>Fable/mythos are the first models from anthropic that hide 100% of reasoning tokens. So it seems to me like we're about to get a lot more data about to what extent Chinese model progress has been a consequence of distillation techniques.</p>
]]></description><pubDate>Thu, 18 Jun 2026 21:16:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48591739</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48591739</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48591739</guid></item><item><title><![CDATA[New comment by sigmar in "The hacker sent by Anthropic to calm the government's nerves about AI safety"]]></title><description><![CDATA[
<p>>Some administration officials have said that a resolution should include an acknowledgment on Anthropic’s part that its rollout of Fable and communication with the White House could have been improved, people familiar with the talks said.<p>>followed initial frustration Friday among some administration officials when they couldn’t immediately get Amodei on the phone, the people said.<p>That he didn't drop everything to talk to them seems like the major crux? But Dario doesn't even do the day-to-day operations Daniela does. Feel like Anthropic should just hire Dean Ball to be their liason or something</p>
]]></description><pubDate>Wed, 17 Jun 2026 19:58:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48575987</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48575987</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48575987</guid></item><item><title><![CDATA[New comment by sigmar in "Why Is Claude Turning into an a**Hole?"]]></title><description><![CDATA[
<p>I should have contextualized the quote- "chat is dead" is from an openai employee which was describing how they're shifting focus to more agentic consumer products, and putting less focus on the back-and-forth chatbot interface.</p>
]]></description><pubDate>Mon, 15 Jun 2026 00:09:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48534652</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48534652</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48534652</guid></item><item><title><![CDATA[New comment by sigmar in "Why Is Claude Turning into an a**Hole?"]]></title><description><![CDATA[
<p>I like that "chat is dead" framing I heard recently because too many people are having interpersonal relations with these LLMs and want to tune their "emotions"/tone. Humanity would be in a better place if we thought of the LLMs as tools and not friends. (even though they are very good at beating a turing test)</p>
]]></description><pubDate>Sun, 14 Jun 2026 22:39:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48533693</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48533693</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48533693</guid></item><item><title><![CDATA[New comment by sigmar in "A 'cold blob' in the Atlantic could be a sign of AMOC shutdown"]]></title><description><![CDATA[
<p>All science-based conclusions come with uncertainty. Only ideologues (and siths) write in absolute terms.</p>
]]></description><pubDate>Sun, 14 Jun 2026 15:19:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48528119</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48528119</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48528119</guid></item><item><title><![CDATA[New comment by sigmar in "The American World Cup Introduced Ad Breaks–and Everyone Hates It"]]></title><description><![CDATA[
<p>Wacky decision. I know they want ads but it's an endurance sport! Imagine if they chopped a sport like 10km run into quarters.</p>
]]></description><pubDate>Sun, 14 Jun 2026 00:16:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48522855</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48522855</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48522855</guid></item><item><title><![CDATA[New comment by sigmar in "GLM 5.2 Is Out"]]></title><description><![CDATA[
<p>Did you read the blog post where they explained why there was a temporary block on all biology-related questions?</p>
]]></description><pubDate>Sun, 14 Jun 2026 00:00:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48522728</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48522728</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48522728</guid></item><item><title><![CDATA[New comment by sigmar in "Show HN: FablePool – pool money behind a prompt, and Fable builds it in public"]]></title><description><![CDATA[
<p>Who foots the bill?</p>
]]></description><pubDate>Fri, 12 Jun 2026 13:57:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48504139</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48504139</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48504139</guid></item><item><title><![CDATA[New comment by sigmar in "Show HN: FablePool – pool money behind a prompt, and Fable builds it in public"]]></title><description><![CDATA[
<p>I'd love to see Anthropic (or someone with mythos access) create a cybersecurity version of this. So that I could create a pool that says "find security concerns in this github repo." Then the report from mythos gets sent to the code/project maintainer and revealed to the public (that paid for it) at the 90 day mark.</p>
]]></description><pubDate>Fri, 12 Jun 2026 02:52:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=48499305</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48499305</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48499305</guid></item><item><title><![CDATA[New comment by sigmar in "Claude Fable 5: mid-tier results on coding tasks"]]></title><description><![CDATA[
<p>Agree with this. Strange to me to frame the "training recall" as cheating (33 of the 38 cheating instances). Most people think of "cheating" as breaking rules. How is the LLM model supposed to not use what was put into the weights?</p>
]]></description><pubDate>Thu, 11 Jun 2026 21:40:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48496819</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48496819</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48496819</guid></item><item><title><![CDATA[New comment by sigmar in "OpenAI mulls slashing prices as it competes with Anthropic for users"]]></title><description><![CDATA[
<p>>As long as Codex remains so affordable and useful they do not have to slash prices, just keep Codex usable.<p>I imagine they track usage and can see whether their habitual users are switching to something else and aren't going to slash prices 'for the hell of it'.<p>just look at public stats on openrouter (obviously not indicative of first party app usage or direct api usage, but there's a huge difference between these graphs):
<a href="https://openrouter.ai/openai" rel="nofollow">https://openrouter.ai/openai</a>
<a href="https://openrouter.ai/anthropic" rel="nofollow">https://openrouter.ai/anthropic</a></p>
]]></description><pubDate>Thu, 11 Jun 2026 17:05:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48493068</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48493068</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48493068</guid></item><item><title><![CDATA[New comment by sigmar in "Anthropic requires 30 day data retention for Fable and Mythos"]]></title><description><![CDATA[
<p>It's temporary. From the fable blogpost:<p>>To release the model both safely and quickly, we’ve tuned these safeguards conservatively—they’ll sometimes catch harmless requests, though they trigger, on average, in less than 5% of sessions. With more capable models arriving in the coming months, we’re working to improve our safeguards and reduce false positives as quickly as we can.</p>
]]></description><pubDate>Thu, 11 Jun 2026 01:29:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=48485184</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48485184</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48485184</guid></item><item><title><![CDATA[New comment by sigmar in "Claude Fable 5"]]></title><description><![CDATA[
<p>The system card is 319 pages, at what point do we call it a "book" instead of a "card"?<p>There's a quote from a METR report on page 52:<p>>We ran [Mythos 5] on 38 of our hardest software tasks, including tasks centered around R&D. [Mythos5] generally outperformed an early checkpoint of Claude Mythos Preview in these, including by succeeding on some tasks that had not been solved by any public model we have previously evaluated. However, we still observed the model occasionally failing to correctly interpret nuanced instructions in difficult tasks... Based on the available evidence, we believe [Mythos 5] is likely unable to fully and reliably automate R&D for frontier projects spanning multiple weeks. We believe that a better, more confident assessment would require more time, evaluations, and information from the model developer.</p>
]]></description><pubDate>Tue, 09 Jun 2026 17:08:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48463979</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48463979</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48463979</guid></item><item><title><![CDATA[New comment by sigmar in "Confidential submission of draft S-1 to the SEC"]]></title><description><![CDATA[
<p>I think it is intended to sound like Sam Altman. Would look exactly like a tweet of his if it didn't have capitalized characters.</p>
]]></description><pubDate>Mon, 08 Jun 2026 22:46:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48453441</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48453441</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48453441</guid></item><item><title><![CDATA[New comment by sigmar in "NSA using Anthropic's Mythos for cyber attacks"]]></title><description><![CDATA[
<p>Not unexpected. Is anyone tracking which episode we're on in the Pantheon timeline?</p>
]]></description><pubDate>Thu, 04 Jun 2026 21:56:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48405149</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48405149</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48405149</guid></item><item><title><![CDATA[New comment by sigmar in "Artificial intelligence is not conscious – Ted Chiang"]]></title><description><![CDATA[
<p>>So what context would cause me to seriously consider the possibility that engineers had created a computer program that is conscious and an intentional user of language? Let me outline one potential sequence of steps. The first requirement is that the computer program has a body (either physical or virtual) and sense organs; there are many reasons for this, but for the purposes of this discussion the most relevant one is the fact that without a body, a computer program could have no desires or emotions, and I believe desires and emotions are necessary for consciousness. Then I’d want to see an embodied agent that could navigate its environment in order to survive as well as, say, a lizard can (and as a point of comparison, certain iguanas can live for decades in the wild). Next I would want to see an embodied agent with the same capacity to deal with novel situations as a mouse. After that I’d want to see agents whose social dynamics are as complex as those of wolves, and then agents with the tool-making abilities of chimpanzees. At that point I would want to see people successfully teaching such embodied agents how to communicate their desires, perhaps by using a button board or some other nonlinguistic modality, the way that people have taught chimpanzees and domesticated dogs.<p>I agree with some parts of this piece, but paragraphs like this one above seem pretty uninspired and simplistic. It's entirely plausible that a conscious mind would not be evolutionarily incentivized to be able to do those things. ie just because animals on earth needed to develop specific talents doesn't mean that other conscious entities need to. Why would a computer program need to hunt for food like a mouse would? Making tools like chimp? these seem like nonsensical metrics.</p>
]]></description><pubDate>Wed, 03 Jun 2026 20:39:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48389685</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48389685</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48389685</guid></item><item><title><![CDATA[New comment by sigmar in "Anthropic confidentially submits draft S-1 to the SEC"]]></title><description><![CDATA[
<p>>I can only hope the doomer narrative dominates until I can get a few shares at a reasonable valuation.<p>I conjecture that some amount of the "doomer posting" is a consequence of other people realizing what you realized here and attempting to sway public sentiment for personal gain.</p>
]]></description><pubDate>Mon, 01 Jun 2026 22:26:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48363442</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48363442</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48363442</guid></item><item><title><![CDATA[New comment by sigmar in "Anthropic confidentially submits draft S-1 to the SEC"]]></title><description><![CDATA[
<p>"capacity ramping" denotes that compute is increasing, which doesn't read like a discount, it reads like prorating.</p>
]]></description><pubDate>Mon, 01 Jun 2026 21:06:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48362637</link><dc:creator>sigmar</dc:creator><comments>https://news.ycombinator.com/item?id=48362637</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48362637</guid></item></channel></rss>