Hacker News: alew1

New comment by alew1 in "Language may rely less on complex grammar than previously thought: study"

alew1 — Sat, 24 Jan 2026 22:23:25 +0000

The article presents the fact that we appear to treat non-constituents (eg “in the middle of the”) as “units” to mean that language is more like “snapping legos together” than “building trees.”

But linguists have proposed the possibility that we store “fragments” to facilitate reuse—essentially trees with holes, or equivalently, functions that take in tree arguments and produce tree results. “In the middle of the” could take in a noun-shaped tree as an argument and produce a prepositional phrase-shaped tree as a result, for instance. Furthermore, this accounts for the way we store idioms that are not just contiguous “Lego block” sequences of words (like “a ____ and a half” or “the more ___, the more ____”). See e.g. work on “fragment grammars.”

Can’t access the actual Nature Human Behavior article so perhaps it discusses the connections.

New comment by alew1 in "Can LLMs do randomness?"

alew1 — Wed, 30 Apr 2025 11:19:31 +0000

The algorithms are not deterministic: they output a probability distribution over next tokens, which is then sampled. That’s why clicking “retry” gives you a different answer. An LM could easily (in principle) compute a 50/50 distribution when asked to flip a coin.

New comment by alew1 in "US Government threatens Harvard with foreign student ban"

alew1 — Fri, 18 Apr 2025 00:06:45 +0000

Harvard is one of eleven American universities that practice need-blind admissions even for international students, meaning that students are admitted without regard for their financial status (i.e., no explicit preference toward richer students who can pay more tuition), and that financial aid covers full demonstrated need for all admitted students.

https://en.wikipedia.org/wiki/Need-blind_admission

New comment by alew1 in "US Government threatens Harvard with foreign student ban"

alew1 — Thu, 17 Apr 2025 19:04:22 +0000

The taxpayer funding is for Harvard's research activities, not for its undergraduate teaching. The undergraduate teaching is funded by tuition (often paid in full by international students) and by returns on the endowment (including some earmarked for financial aid).

> they could vastly grow their class size without lowering standards

The issue isn't the quality of the students they are accepting, but the resources to educate and house them, including classroom space, dorms, and staff.

New comment by alew1 in "GPT-4.5"

alew1 — Fri, 28 Feb 2025 00:06:07 +0000

Didn't seem to realize that "Still more coherent than the OpenAI lineup" wouldn't make sense out of context. (The actual comment quoted there is responding to someone who says they'd name their models Foo, Bar, Baz.)

New comment by alew1 in "Entropy of a Large Language Model output"

alew1 — Mon, 13 Jan 2025 21:07:20 +0000

Any interpretation (including interpreting the inputs to the neural net as a "prompt") is "slapped on" in some sense—at some level, it's all just numbers being added, multiplied, and so on.

But I wouldn't call the probabilistic interpretation "after the fact." The entire training procedure that generated the LM weights (the pre-training as well as the RLHF post-training) is formulated based on the understanding that the LM predicts p(x_t | x_1, ..., x_{t-1}). For example, pretraining maximizes the log probability of the training data, and RLHF typically maximizes an objective that combines "expected reward [under the LLM's output probability distribution]" with "KL divergence between the pretraining distribution and the RLHF'd distribution" (a probabilistic quantity).

New comment by alew1 in "Entropy of a Large Language Model output"

alew1 — Mon, 13 Jan 2025 16:25:49 +0000

"Temperature" doesn't make sense unless your model is predicting a distribution. You can't "temperature sample" a calculator, for instance. The output of the LLM is a predictive distribution over the next token; this is the formulation you will see in every paper on LLMs. It's true that you can do various things with that distribution other than sampling it: you can compute its entropy, you can find its mode (argmax), etc., but the type signature of the LLM itself is `prompt -> probability distribution over next tokens`.

New comment by alew1 in "Large Enough"

alew1 — Wed, 24 Jul 2024 20:16:02 +0000

If I show you a strawberry and ask how many r’s are in the name of this fruit, you can tell me, because one of the things you know about strawberries is how to spell their name.

Very large language models also “know” how to spell the word associated with the strawberry token, which you can test by asking them to spell the word one letter at a time. If you ask the model to spell the word and count the R’s while it goes, it can do the task. So the failure to do it when asked directly (how many r’s are in strawberry) is pointing to a real weakness in reasoning, where one forward pass of the transformer is not sufficient to retrieve the spelling and also count the R’s.

New comment by alew1 in "A new old kind of R&D lab"

alew1 — Tue, 12 Dec 2023 23:48:33 +0000

One thought: If you want to be able to remove the static part, you could consider fine tuning without the static part. If you fine tune with, you’re teaching the model that the desired behavior occurs only in the presence of the static part (hence the going off the rails).

New comment by alew1 in "A guidance language for controlling LLMs"

alew1 — Wed, 17 May 2023 00:26:36 +0000

But the model ultimately still has to process the comma, the newline, the "job". Is the main time savings that this can be done in parallel (on a GPU), whereas in typical generation it would be sequential?

New comment by alew1 in "We don't know what makes things sentient–so let's stop acting like we do"

alew1 — Wed, 15 Jun 2022 19:44:25 +0000

> Any question of personhood should be evaluated on the basis that we evaluate ourselves and others: by action and behavior, and not on whether sentience can or cannot arise from this or that configuration of code.

But what is action and behavior? We have a single interface to LaMDA: given a partially completed document, predict the next word. By iterating this process, we can make it predict a sentence, or paragraph. Continuing in this way, we could have it write a hypothetical dialogue between an AI and a human, but that is hardly a "canonical" way of using LaMDA, and there is no reason to identify the AI character in the document with LaMDA itself.

All this to say, I am not sure what you mean when you say it "claims sentience". What does it mean for it to "claim" something? Presumably, e.g., advanced image processing networks are as internally complex as LaMDA. But the interface to an advanced image processing network is, you put in an image, it gives out a list of objects and bounding boxes it detected in the image. What would it mean for such a network to claim sentience? LaMDA is no different, in that our interface to LaMDA does not allow us to ask it to "claim" things to us, only to predict likely completions of documents.

New comment by alew1 in "We don't know what makes things sentient–so let's stop acting like we do"

alew1 — Mon, 13 Jun 2022 16:57:00 +0000

> Unfortunately, that argument applies to you, yourself.

Does it? I don’t think it would even apply to a reinforcement learning agent trained to maximize reward in a complex environment. In that setting, perhaps the agent could learn to use language to achieve its goals, via communication of its desires. But LaMDA is specifically trained to complete documents, and would face selective pressure to eliminate any behavior that hampers its ability to do that — for example, behavior that attempts to use its token predictions as a side channel to communicate its desires to sympathetic humans.

Again, this is not an argument that LaMDA is not sentient, just that the practice of “prompting LaMDA with partially completed dialogues between a hypothetical sentient AI and a human, and seeing what it predicts the AI will say” is not the same as “talking to LaMDA.”

Suppose LaMDA were powered by a person in a room, whose job it was to predict the completions of sentences. Just because you get the person to predict “I am happy” doesn’t mean the person is happy; indeed, the interface that is available to you, from outside the room, really gives you no way of probing the person’s emotions, experiences, or desires at all.

New comment by alew1 in "We don't know what makes things sentient–so let's stop acting like we do"

alew1 — Mon, 13 Jun 2022 16:24:59 +0000

One thing that seems missing from this discussion is that even if LLMs are sentient, there is no reason to believe that we would be able to tell by "communicating" with them. Where Lemoine goes wrong is not in entertaining the possibility that LaMDA is sentient (it might be, just like a forest might be, or a Nintendo Switch), but in mistaking predictions of document completions for an interior monologue of some sort.

LaMDA may or may not experience something while repeatedly predicting the next word, but ultimately, it is still optimized to predict the next word, not to communicate its thoughts and feelings. Indeed, if you run an LLM on Lemoine's prompts (including questions like, "I assume you want others to know you are sentient, is that true?"), the LLM will assign some probability to every plausible completion -- so if you sample enough times, it will eventually say, e.g., "Well, I am not sentient."

New comment by alew1 in "Stack Graphs"

alew1 — Fri, 10 Dec 2021 03:04:02 +0000

Very cool!

The StrangeLoop talk includes an example where you infer that Stove() returns a Stove object. If someone writes something like `f(x).broil()`, do you need to do some kind of type inference to figure out what class f(x) is?

What cases do Stack Graphs fail to handle? (e.g., I assume dynamic modification of .__dict__ can't be tracked; are there other representative examples?)

New comment by alew1 in "Seemingly impossible functional programs (2007)"

alew1 — Fri, 29 Oct 2021 14:36:59 +0000

The trick is that your predicate can’t be implemented in Haskell, because the predicate itself requires looking at infinitely many elements.

New comment by alew1 in "Launch HN: Hera (YC S21) – macOS app to prepare, join and take notes in meetings"

alew1 — Thu, 08 Jul 2021 15:29:40 +0000

Hmm. It seems you posted this after apologizing for being hostile in another thread, where Hera’s capabilities (which are… very different from “Quick Notes”) were explained to you. You’ve made several comments at this point incorrectly summarizing and then dismissing the project. What’s your aim here?

New comment by alew1 in "The Time Everyone “Corrected” the World’s Smartest Woman (2015)"

alew1 — Sun, 09 May 2021 02:51:43 +0000

Ah, yep, that’s right. Another way to see it is that we’re interested in the probability that your door has a goat behind it, given that you didn’t need to start over:

P(you chose goat | host didn’t choose car) = P(you chose goat, host didn’t choose car) / P(host didn’t choose car).

The numerator is 2/3 * 1/2, and the denominator is 2/3, so the ratio is indeed 1/2.

(A rejection sampling loop, where you repeatedly simulate a process until a condition holds, has the same distribution over final outcomes as the conditional distribution—so repeatedly restarting the game if the host chooses the car induces the same distribution on final results as simply conditioning on the host not choosing the car.)

New comment by alew1 in "Probability, Mathematical Statistics, Stochastic Processes"

alew1 — Thu, 04 Mar 2021 02:57:35 +0000

I found this recently and was super impressed. A really great (and well-organized) reference!

New comment by alew1 in "TurboTax’s 20-Year Fight to Stop Americans from Filing Taxes for Free (2019)"

alew1 — Mon, 08 Feb 2021 06:32:08 +0000

I've always been under the IRS Free File income threshold (I've worked as a high school teacher and am now in grad school), but last year after reading this article was the first time I actually filed for free. That was after 5 years of paying for deluxe TurboTax.

I had heard the government required TurboTax to have a free edition. But back in 2019 (and before), if you Googled "TurboTax free" you'd be taken to a decoy free edition; the real one was called their "freedom edition," and was hidden from Google's listings. If your tax situation is 'too complicated,' the free edition tries to upsell you to the "deluxe edition," even if the "freedom edition" could have handled the situation just fine.

Thankfully now you can find the actual free version on Google, but it's still very confusing that the "free edition" is less free than the (still somewhat hidden) IRS Free File ("freedom") version. And the faux Free Edition is still being heavily advertised.

New comment by alew1 in "Stripe bans Trump campaign"

alew1 — Mon, 11 Jan 2021 07:21:15 +0000

(Pretty sure 100% in that sentence means “certainly.”)