Hacker News: azakai

New comment by azakai in "Show HN: Firefox in WebAssembly"

azakai — Wed, 15 Jul 2026 21:59:34 +0000

Prior art: WebKit.js, the WebKit rendering engine ported to JS

https://github.com/trevorlinton/webkit.js/

New comment by azakai in "Mechanistic interpretability researchers applying causality theory to LLMs"

azakai — Mon, 13 Jul 2026 04:45:42 +0000

The researchers in the field disagree with you. Look at conferences like NeurIPS and ICLR to see a steady stream of incremental progress in this area.

New comment by azakai in "Mechanistic interpretability researchers applying causality theory to LLMs"

azakai — Mon, 13 Jul 2026 02:54:34 +0000

The optimism is based on the successes so far, some of which are described in this article. Scientists have made progress here.

New comment by azakai in "Mechanistic interpretability researchers applying causality theory to LLMs"

azakai — Sun, 12 Jul 2026 19:37:05 +0000

Yes, we do see signs of actual reasoning, see the papers linked in the article. (There are many others too.)

Yes, we have a tendency to anthropomorphize, but (most) researchers are aware of this.

New comment by azakai in "Mechanistic interpretability researchers applying causality theory to LLMs"

azakai — Sun, 12 Jul 2026 19:16:03 +0000

The article answers this question, at least to the extent it can be answered, at this time.

We see some signs of reasoning, but also we understand little about how they work.

New comment by azakai in "What Emily Bender meant by "stochastic parrots""

azakai — Mon, 06 Jul 2026 18:37:45 +0000

They made a claim about language models in general, not just ones that had been released so far.

The point of the paper, in fact, is that language models are getting "too big", and another approach is needed to make progress, so they were certainly predicting things about later models.

With that said, they talked about "pure" language models, so it is fair to say that they didn't talk about, say, LLMs that are multimodal or that have tool use, which are advances that happened after their paper.

New comment by azakai in "What Emily Bender meant by "stochastic parrots""

azakai — Mon, 06 Jul 2026 17:17:58 +0000

My main criticism of the paper is that it says LLMs work "haphazardly", using probabilistic information. That is a hypothesis, but it is stated as a known fact, a fundamental limitation.

It is true that LLMs often behave haphazardly, and do rely on statistics. But plenty of research has shown them behaving in methodical ways too. There are findings going both ways!

Granted, many of the strongest contradictory results appeared after the Stochastic Parrots paper, so it isn't like they were ignoring the literature at the time. But they did make a very strong claim, and in the half-decade since, a lot of evidence has come out against it.

New comment by azakai in "Godot will no longer accept AI-authored code contributions"

azakai — Wed, 01 Jul 2026 16:13:39 +0000

Yes. Godot and Zig are the exceptions, and therefore newsworthy.

We Don't Understand Neural Networks at the Algorithmic Level

azakai — Tue, 23 Jun 2026 14:11:51 +0000

Article URL: https://kripken.github.io/blog/neuroscience/2026/06/20/algorithmic.html

Comments URL: https://news.ycombinator.com/item?id=48645276

Points: 2

# Comments: 0

New comment by azakai in "WASI 0.3"

azakai — Fri, 12 Jun 2026 18:04:16 +0000

> Let's keep WebAssembly lean and fast!

Note that wasm is still lean and fast - WASI is not part of core wasm, but layered on top.

That is, it is possible to implement wasm without WASI. That is also true for other wasm proposals like WasmGC. It is very possible that parts of the ecosystem will not implement certain proposals if they don't make sense there (e.g. parts of the embedded ecosystem may never add GC, etc.).

New comment by azakai in "AI is slowing down"

azakai — Mon, 08 Jun 2026 16:40:47 +0000

Not the person you are responding to, but here:

> I believe that artificial intelligence has three quarters to prove itself before the apocalypse comes, and when it does, it will be that much worse, savaging the revenues of the biggest companies in tech. Once usage drops, so will the remarkable amounts of revenue that have flowed into big tech, and so will acres of data centers sit unused, the cloud equivalent of the massive overhiring we saw in post-lockdown Silicon Valley.

We have seen 8 quarters since. Has any of that come to pass?

New comment by azakai in "If LLMs Have Human-Like Attributes, Then So Does Age of Empires II"

azakai — Sun, 07 Jun 2026 22:15:53 +0000

Exactly. Here is where this happens in the paper:

> Suppose one copies an LLM into AoE II and feeds into the AoE II-LLM ‘I feel lonely’ as an input. This AoE II-LLM replies: ‘I feel bad for you, maybe catch up with a friend? Closeness always helps in these situations’. One would be hard-pressed to make a convincing argument that, because of this response, an AoE II-LLM knows what helps in these situations

I don't see why one would be any more hard-pressed to make that conclusion about this system than a "normal" LLM.

That it is harder to "read" the data out is the only difference (the AoE II-LLM's output is encoded in game elements). But is ease of decoding an actual issue? If we can't understand a group of people that speak another language, does that say anything about them, or about us?

New comment by azakai in "Artificial intelligence is not conscious – Ted Chiang"

azakai — Thu, 04 Jun 2026 03:37:50 +0000

What about the cognitive capacity of understanding?

New comment by azakai in "Artificial intelligence is not conscious – Ted Chiang"

azakai — Thu, 04 Jun 2026 03:25:00 +0000

If you want examples of this, see the recent book "The AI Con"

https://www.goodreads.com/en/book/show/217432753-the-ai-con

which describes LLMs as "souped-up autocomplete", complex statistics that cannot truly understand anything. A more recent example is this paper:

https://zenodo.org/records/20071869

which says,

> [LLMs], as turbo-charged statistical models (recall their formal relation to logistic regression) can only but provide correlations.

And, of course, the Stochastic Parrot paper is the classic example in this area. It is from 5 years ago, but "LLMs only do statistics / can't understand" is very much alive and active among academics, even if it is a minority position.

New comment by azakai in "An OpenAI model has disproved a central conjecture in discrete geometry"

azakai — Wed, 20 May 2026 22:42:23 +0000

There was a lot new in calculus, but it also didn't come out of nowhere.

That Newton and Leibniz came up with similar ideas in parallel, independently, around the same time (what are the odds?), supports that.

https://en.wikipedia.org/wiki/Leibniz%E2%80%93Newton_calculu...

New comment by azakai in "Natural Language Autoencoders: Turning Claude's Thoughts into Text"

azakai — Fri, 08 May 2026 00:36:55 +0000

Thanks! I missed that part before.

New comment by azakai in "Natural Language Autoencoders: Turning Claude's Thoughts into Text"

azakai — Thu, 07 May 2026 23:28:52 +0000

I had the same question. I think that could be answered by using the predicted activation, but I don't see that in the paper.

That is, rather than just translate activation to text, then text to activation, that final activation could then be applied to the neural network, and it would be allowed to continue running from there.

If it kept running in a similar way, that would show that the predicted activation is close enough to the original one. Which would add some confidence here.

But a lot better would be to then do experiments with altered text. That is, if the text said "this is true" and it was changed to "this is false", and that intervention led to the final output implying it was false, that would be very interesting.

This seems obvious but I don't see it mentioned as a future direction there, so maybe there is an obvious reason it can't work.

New comment by azakai in "He asked AI to count carbs 27000 times. It couldn't give the same answer twice"

azakai — Wed, 29 Apr 2026 17:54:12 +0000

The hardware can also add nondeterminism. GPUs reorder operations, leading to different results.

Vendors might also be running A/B testing or who knows what, even when you ask for a temperature of 0.

But, if you run a fixed model with temperature 0 on your local CPU, it will be deterministic (unless there are bugs).

New comment by azakai in "He asked AI to count carbs 27000 times. It couldn't give the same answer twice"

azakai — Wed, 29 Apr 2026 15:55:21 +0000

A carb counting app might use API calls to these frontier models and then do some kind of analysis. It could see if different models agree or not, or multiple calls, and with how much variance.

So it would be more accurate to test the apps rather than the APIs, unless the goal is to warn people that just open chatgpt and ask there.

New comment by azakai in "Talkie: a 13B vintage language model from 1930"

azakai — Tue, 28 Apr 2026 02:36:21 +0000

fwiw, asking the model directly, "who is the ruler of England at present?" returns "Queen Victoria is the reigning sovereign of England."