Hacker News: sigmar

New comment by sigmar in "The text in Claude Code’s “Extended Thinking” output"

sigmar — Mon, 22 Jun 2026 15:56:11 +0000

>the language in the docs is awfully indirect.

writes this^ and then proceeds to highlight a bold title from the docs that says "summarized thinking" that explains things clearly in the first sentence. lol

New comment by sigmar in "The Korean telecom giant at the center of Anthropic's Mythos controversy"

sigmar — Thu, 18 Jun 2026 22:23:19 +0000

Qualified it with "100%" because claude4 models show the first few lines of the chain of thought:

>On Claude 4 models, the first few lines of thinking output are more verbose, providing detailed reasoning that's particularly helpful for prompt engineering purposes. Claude Mythos Preview summarizes from the first token, so its thinking blocks do not show this verbose preamble. https://platform.claude.com/docs/en/build-with-claude/extend...

New comment by sigmar in "The Korean telecom giant at the center of Anthropic's Mythos controversy"

sigmar — Thu, 18 Jun 2026 21:16:11 +0000

Fable/mythos are the first models from anthropic that hide 100% of reasoning tokens. So it seems to me like we're about to get a lot more data about to what extent Chinese model progress has been a consequence of distillation techniques.

New comment by sigmar in "The hacker sent by Anthropic to calm the government's nerves about AI safety"

sigmar — Wed, 17 Jun 2026 19:58:37 +0000

>Some administration officials have said that a resolution should include an acknowledgment on Anthropic’s part that its rollout of Fable and communication with the White House could have been improved, people familiar with the talks said.

>followed initial frustration Friday among some administration officials when they couldn’t immediately get Amodei on the phone, the people said.

That he didn't drop everything to talk to them seems like the major crux? But Dario doesn't even do the day-to-day operations Daniela does. Feel like Anthropic should just hire Dean Ball to be their liason or something

New comment by sigmar in "Why Is Claude Turning into an a**Hole?"

sigmar — Mon, 15 Jun 2026 00:09:56 +0000

I should have contextualized the quote- "chat is dead" is from an openai employee which was describing how they're shifting focus to more agentic consumer products, and putting less focus on the back-and-forth chatbot interface.

New comment by sigmar in "Why Is Claude Turning into an a**Hole?"

sigmar — Sun, 14 Jun 2026 22:39:58 +0000

I like that "chat is dead" framing I heard recently because too many people are having interpersonal relations with these LLMs and want to tune their "emotions"/tone. Humanity would be in a better place if we thought of the LLMs as tools and not friends. (even though they are very good at beating a turing test)

New comment by sigmar in "A 'cold blob' in the Atlantic could be a sign of AMOC shutdown"

sigmar — Sun, 14 Jun 2026 15:19:03 +0000

All science-based conclusions come with uncertainty. Only ideologues (and siths) write in absolute terms.

New comment by sigmar in "The American World Cup Introduced Ad Breaks–and Everyone Hates It"

sigmar — Sun, 14 Jun 2026 00:16:39 +0000

Wacky decision. I know they want ads but it's an endurance sport! Imagine if they chopped a sport like 10km run into quarters.

New comment by sigmar in "GLM 5.2 Is Out"

sigmar — Sun, 14 Jun 2026 00:00:17 +0000

Did you read the blog post where they explained why there was a temporary block on all biology-related questions?

New comment by sigmar in "Show HN: FablePool – pool money behind a prompt, and Fable builds it in public"

sigmar — Fri, 12 Jun 2026 13:57:11 +0000

Who foots the bill?

New comment by sigmar in "Show HN: FablePool – pool money behind a prompt, and Fable builds it in public"

sigmar — Fri, 12 Jun 2026 02:52:44 +0000

I'd love to see Anthropic (or someone with mythos access) create a cybersecurity version of this. So that I could create a pool that says "find security concerns in this github repo." Then the report from mythos gets sent to the code/project maintainer and revealed to the public (that paid for it) at the 90 day mark.

New comment by sigmar in "Claude Fable 5: mid-tier results on coding tasks"

sigmar — Thu, 11 Jun 2026 21:40:49 +0000

Agree with this. Strange to me to frame the "training recall" as cheating (33 of the 38 cheating instances). Most people think of "cheating" as breaking rules. How is the LLM model supposed to not use what was put into the weights?

New comment by sigmar in "OpenAI mulls slashing prices as it competes with Anthropic for users"

sigmar — Thu, 11 Jun 2026 17:05:57 +0000

>As long as Codex remains so affordable and useful they do not have to slash prices, just keep Codex usable.

I imagine they track usage and can see whether their habitual users are switching to something else and aren't going to slash prices 'for the hell of it'.

just look at public stats on openrouter (obviously not indicative of first party app usage or direct api usage, but there's a huge difference between these graphs): https://openrouter.ai/openai https://openrouter.ai/anthropic

New comment by sigmar in "Anthropic requires 30 day data retention for Fable and Mythos"

sigmar — Thu, 11 Jun 2026 01:29:21 +0000

It's temporary. From the fable blogpost:

>To release the model both safely and quickly, we’ve tuned these safeguards conservatively—they’ll sometimes catch harmless requests, though they trigger, on average, in less than 5% of sessions. With more capable models arriving in the coming months, we’re working to improve our safeguards and reduce false positives as quickly as we can.

New comment by sigmar in "Claude Fable 5"

sigmar — Tue, 09 Jun 2026 17:08:35 +0000

The system card is 319 pages, at what point do we call it a "book" instead of a "card"?

There's a quote from a METR report on page 52:

>We ran [Mythos 5] on 38 of our hardest software tasks, including tasks centered around R&D. [Mythos5] generally outperformed an early checkpoint of Claude Mythos Preview in these, including by succeeding on some tasks that had not been solved by any public model we have previously evaluated. However, we still observed the model occasionally failing to correctly interpret nuanced instructions in difficult tasks... Based on the available evidence, we believe [Mythos 5] is likely unable to fully and reliably automate R&D for frontier projects spanning multiple weeks. We believe that a better, more confident assessment would require more time, evaluations, and information from the model developer.

New comment by sigmar in "Confidential submission of draft S-1 to the SEC"

sigmar — Mon, 08 Jun 2026 22:46:07 +0000

I think it is intended to sound like Sam Altman. Would look exactly like a tweet of his if it didn't have capitalized characters.

New comment by sigmar in "NSA using Anthropic's Mythos for cyber attacks"

sigmar — Thu, 04 Jun 2026 21:56:32 +0000

Not unexpected. Is anyone tracking which episode we're on in the Pantheon timeline?

New comment by sigmar in "Artificial intelligence is not conscious – Ted Chiang"

sigmar — Wed, 03 Jun 2026 20:39:37 +0000

>So what context would cause me to seriously consider the possibility that engineers had created a computer program that is conscious and an intentional user of language? Let me outline one potential sequence of steps. The first requirement is that the computer program has a body (either physical or virtual) and sense organs; there are many reasons for this, but for the purposes of this discussion the most relevant one is the fact that without a body, a computer program could have no desires or emotions, and I believe desires and emotions are necessary for consciousness. Then I’d want to see an embodied agent that could navigate its environment in order to survive as well as, say, a lizard can (and as a point of comparison, certain iguanas can live for decades in the wild). Next I would want to see an embodied agent with the same capacity to deal with novel situations as a mouse. After that I’d want to see agents whose social dynamics are as complex as those of wolves, and then agents with the tool-making abilities of chimpanzees. At that point I would want to see people successfully teaching such embodied agents how to communicate their desires, perhaps by using a button board or some other nonlinguistic modality, the way that people have taught chimpanzees and domesticated dogs.

I agree with some parts of this piece, but paragraphs like this one above seem pretty uninspired and simplistic. It's entirely plausible that a conscious mind would not be evolutionarily incentivized to be able to do those things. ie just because animals on earth needed to develop specific talents doesn't mean that other conscious entities need to. Why would a computer program need to hunt for food like a mouse would? Making tools like chimp? these seem like nonsensical metrics.

New comment by sigmar in "Anthropic confidentially submits draft S-1 to the SEC"

sigmar — Mon, 01 Jun 2026 22:26:08 +0000

>I can only hope the doomer narrative dominates until I can get a few shares at a reasonable valuation.

I conjecture that some amount of the "doomer posting" is a consequence of other people realizing what you realized here and attempting to sway public sentiment for personal gain.

New comment by sigmar in "Anthropic confidentially submits draft S-1 to the SEC"

sigmar — Mon, 01 Jun 2026 21:06:05 +0000

"capacity ramping" denotes that compute is increasing, which doesn't read like a discount, it reads like prorating.