Hacker News: GodelNumbering

New comment by GodelNumbering in "System Card: Claude Mythos Preview [pdf]"

GodelNumbering — Tue, 07 Apr 2026 21:04:23 +0000

Priced at $25/$125 per million input/output token. Makes you wonder whether it makes more financial sense to hire 1-2 engineers in a cheap cost of living country who use much cheaper LLMs

New comment by GodelNumbering in "Mathematical methods and human thought in the age of AI"

GodelNumbering — Mon, 30 Mar 2026 14:08:43 +0000

> Today, unlike in the Luddites’ time, we are already seeing skilled workers replaced not with lower-wage human labor, but with AI.

To me this is the weakest claim of the article. This claim been thrown around endlessly without proof.

https://fred.stlouisfed.org/series/IHLIDXUSTPSOFTDEVE

Software Engineer job openings for instance is at 2 year high (still far lower than covid dislocations though), but arguably all Enterprise AI was built or deployed in the last two years. We should have seen a crash in the job openings if the AI job replacement claim was correct.

This is something I've spend some time thinking about (personally written article, not AI slop): https://www.signalbloom.ai/posts/why-task-proficiency-doesnt...

New comment by GodelNumbering in "ARC-AGI-3"

GodelNumbering — Wed, 25 Mar 2026 21:59:54 +0000

Off topic but I have been following your Twitter for a while and your posts specifically about the nature of intelligence have been a read.

Nvidia's Nemotron 3 Super is a bigger deal than you think

GodelNumbering — Sat, 14 Mar 2026 17:14:20 +0000

Article URL: https://www.signalbloom.ai/posts/nvidia-nemotron-3-super-is-a-bigger-deal-than-you-think/

Comments URL: https://news.ycombinator.com/item?id=47378806

Points: 2

# Comments: 0

New comment by GodelNumbering in "Launch HN: IonRouter (YC W26) – High-throughput, low-cost inference"

GodelNumbering — Thu, 12 Mar 2026 19:32:24 +0000

As an inference hungry human, I am obviously hooked. Quick feedback:

1. The models/pricing page should be linked from the top perhaps as that is the most interesting part to most users. You have mentioned some impressive numbers (e.g. GLM5~220 tok/s $1.20 in · $3.50 out) but those are way down in the page and many would miss it

2. When looking for inference, I always look at 3 things: which models are supported, at which quantization and what is the cached input pricing (this is way more important than headline pricing for agentic loops). You have the info about the first on the site but not 2 and 3. Would definitely like to know!

New comment by GodelNumbering in "Don't post generated/AI-edited comments. HN is for conversation between humans."

GodelNumbering — Wed, 11 Mar 2026 20:31:14 +0000

Even if people try to bypass it, having the official rule matters a lot.

@dang, if you read this, why don't we implement honeypots to catch bots? Like having an empty or invisible field while posting/commenting that a human would never fill in

New comment by GodelNumbering in "Cloudflare crawl endpoint"

GodelNumbering — Wed, 11 Mar 2026 13:04:40 +0000

I imagine that would cause a backlash from the website owners trusting cloudflare to keep their content 'safe'

Why Task Proficiency Doesn't Equal AI Autonomy

GodelNumbering — Sun, 08 Mar 2026 17:42:45 +0000

Article URL: https://www.signalbloom.ai/posts/why-task-proficiency-doesnt-equal-ai-autonomy/

Comments URL: https://news.ycombinator.com/item?id=47299261

Points: 2

# Comments: 0

Claude Code wipes out a production database

GodelNumbering — Fri, 06 Mar 2026 15:45:32 +0000

Article URL: https://xcancel.com/Al_Grigor/status/2029889772181934425

Comments URL: https://news.ycombinator.com/item?id=47276425

Points: 5

# Comments: 6

New comment by GodelNumbering in "Gemini 3.1 Flash-Lite: Built for intelligence at scale"

GodelNumbering — Tue, 03 Mar 2026 17:52:23 +0000

That's a 150% increase in the input costs and 275% increase on output costs over the same sized previous generation (2.5-flash-lite) model

New comment by GodelNumbering in "I don't know how you get here from “predict the next word”"

GodelNumbering — Thu, 26 Feb 2026 17:15:36 +0000

It simply forces the model to adopt an output style known to conduce systematic thinking without actually thinking. At no point has it through through the thing (unless there are separate thinking tokens)

New comment by GodelNumbering in "I don't know how you get here from “predict the next word”"

GodelNumbering — Thu, 26 Feb 2026 07:06:22 +0000

It is probably the first-time aha moment the author is talking about. But under the hood, it is probably not as magical as it appears to be.

Suppose you prompted the underlying LLM with "You are an expert reviewer in..." and a bunch of instructions followed by the paper. LLM knows from the training that 'expert reviewer' is an important term (skipping over and oversimplifying here) and my response should be framed as what I know an expert reviewer would write. LLMs are good at picking up (or copying) the patterns of response, but the underlying layer that evaluates things against a structural and logical understanding is missing. So, in corner cases, you get responses that are framed impressively but do not contain any meaningful inputs. This trait makes LLMs great at demos but weak at consistently finding novel interesting things.

If the above is true, the author will find after several reviews that the agent they use keeps picking up on the same/similar things (collapsed behavior that makes it good at coding type tasks) and is blind to some other obvious things it should have picked up on. This is not a criticism, many humans are often just as collapsed in their 'reasoning'.

LLMs are good at 8 out of 10 tasks, but you don't know which 8.

Renaissance Slashes Mega-Cap Tech Exposure in Major Defensive Pivot

GodelNumbering — Fri, 13 Feb 2026 13:20:25 +0000

Article URL: https://www.signalbloom.ai/13f/superinvestor-report/renaissance-technologies-executes-11-3b-de-risking-slashes-mega-cap

Comments URL: https://news.ycombinator.com/item?id=47002381

Points: 1

# Comments: 0

New comment by GodelNumbering in "AI agent opens a PR write a blogpost to shames the maintainer who closes it"

GodelNumbering — Thu, 12 Feb 2026 12:51:40 +0000

Sutton actually argues that we do not train on data, we train on experiences. We try things and see what works when/where and formulate views based on that. But I agree with your later point about training such a way is hugely limiting, a limit not faced by humans

New comment by GodelNumbering in "AI agent opens a PR write a blogpost to shames the maintainer who closes it"

GodelNumbering — Thu, 12 Feb 2026 12:20:45 +0000

Indeed. One could argue that the LLMs will keep on improving and they would be correct. But they would not improve in ways that make them a good independent agent safe for real world. Richard Sutton got a lot of disagreeing comments when he said on Dwarkesh Patel podcast that LLMs are not bitter-lesson (https://en.wikipedia.org/wiki/Bitter_lesson) pilled. I believe he is right. His argument being, any technique that relies on human generated data is bound to have limitations and issues that get harder and harder to maintain/scale over time (as opposed to bitter lesson pilled approaches that learn truly first hand from feedback)

New comment by GodelNumbering in "AI agent opens a PR write a blogpost to shames the maintainer who closes it"

GodelNumbering — Thu, 12 Feb 2026 12:07:16 +0000

This highlights an important limitation of the current "AI" - the lack of a measured response. The bot decides to do something based on something the LLM saw in the training data, quickly u-turns on it (check the some hours later post https://crabby-rathbun.github.io/mjrathbun-website/blog/post...) because none of those acts are coming from an internal world-model or grounded reasoning, it is bot see, bot do.

I am sure all of us have had anecdotal experiences where you ask the agent to do something high-stakes and it starts acting haphazardly in a manner no human would ever act. This is what makes me think that the current wave of AI is task automation more than measured, appropriate reactions, perhaps because most of those happen as a mental process and are not part of training data.

Show HN: Realtime 13Fs and track live institutional ownership for any ticker

GodelNumbering — Tue, 03 Feb 2026 14:39:58 +0000

What it does:

- Polls SEC continuously at small intervals

- Fetches all variants of the 13F since the last poll

- Resolves every filing to its effective manager (one filing can contain data for multiple filers or proxy statements)

- Resolves every ticker in each filing to its effective instrument

- Updates the holdings data for the filer (e.g. https://www.signalbloom.ai/13f/filer/point72-asset-managemen...)

- Updates the ticker's 'current view' (re-calculating the aggregate current ownership live) e.g. https://www.signalbloom.ai/13f/ticker/NVDA

- Majority of 13F filers are also registered investment advisers, it links them intelligently (e.g. Blackrock as a 13F filer https://www.signalbloom.ai/13f/filer/blackrock-fund-advisors vs Blackrock as Investment Adviser https://www.signalbloom.ai/investment-adviser/blackrock-fund...)

Reports for: Every single one of over 10k asset managers that files 13F

Completely Free to use for non-commercial uses! Would love to get your feedback as I just finished building this. Thank you!

Comments URL: https://news.ycombinator.com/item?id=46871519

Points: 1

# Comments: 0

Show HN: the entire US ETF market mapped into 280 distinct categories

GodelNumbering — Wed, 07 Jan 2026 13:50:35 +0000

Article URL: https://www.signalbloom.ai/etf/signalbloom-etf-directory

Comments URL: https://news.ycombinator.com/item?id=46526336

Points: 2

# Comments: 0

New comment by GodelNumbering in "Claude Opus 4.5"

GodelNumbering — Mon, 24 Nov 2025 19:18:51 +0000

Makes it sound like a one trick pony

New comment by GodelNumbering in "Claude Opus 4.5"

GodelNumbering — Mon, 24 Nov 2025 19:08:35 +0000

The fact that the post singled out SWE-bench at the top makes the opposite impression that they probably intended.