Hacker News: sulam

New comment by sulam in "Shipping a laptop to a refugee camp in Uganda"

sulam — Fri, 22 May 2026 23:16:22 +0000

Django has strong honey badger energy!

New comment by sulam in "Microsoft starts canceling Claude Code licenses"

sulam — Fri, 22 May 2026 20:58:42 +0000

Personally I prefer the API pricing because I feel like I'm not going to get rug pulled on my work. When it comes to personal stuff, I use the shit out of my sub, but it's not making me money.

New comment by sulam in "Anthropic is expanding to Colossus2. Will use GB200"

sulam — Thu, 21 May 2026 20:11:11 +0000

A Falcony Heavy probably generates 1 kiloton of CO2 per launch. Data centers on the planet are highly variable depending on their energy mix. It's true that a large a datacenter running on natural gas or coal power is significantly more in a year, but the sheer number of launches required to get that same data center into space is actually comparable, and there's no saying that this is the end of it. Oh and we should also have questions about how you safely de-orbit these things.

New comment by sulam in "Project Glasswing: what Mythos showed us"

sulam — Thu, 21 May 2026 18:52:36 +0000

The intelligence is an emergent property of their ability to predict how a statement will proceed, therefore it is inevitably a reiteration or transformation at best. Lots of intelligent things can be produced from that, but nothing truly novel.

New comment by sulam in "Anthropic is expanding to Colossus2. Will use GB200"

sulam — Thu, 21 May 2026 14:36:29 +0000

Tl;dr — but it’s too heavy

You’re making some decent points here, but you’re either forgetting or ignoring the major thing that people usually neglect to mention when they want to make a case for this (crazy) idea — weight. Unless SpaceX is going to completely redesign hardware such that it is optimized entirely for its mass, it requires many (many) launches to even get a small set of racks into space. I don’t normally get up in arms about the CO2 emissions of data centers, I think there is offsetting value created by their use, but I would absolutely protest trying to put data centers in space and do my best to shut down the hundreds, if not thousands of launches it would take to achieve even a tiny fraction of an AI data center.

New comment by sulam in "Project Glasswing: what Mythos showed us"

sulam — Mon, 18 May 2026 16:21:59 +0000

Sentence constructions like this definitely scream AI: "That's a reasonable bias for an exploratory tool. It's a ruinous one for a triage queue..."

I will upgrade the "why it matters" to "and now AI output is part of the training data". A day is coming when the punched-up AI verbiage will be the norm and hard to distinguish unless you're from the previous generation. Sort of in the way that I miss some aspects of Usenet.

New comment by sulam in "Nullsoft, 1997-2004 (2004)"

sulam — Tue, 12 May 2026 16:25:07 +0000

I won’t nitpick too hard because the percentage doesn’t move much, but Netscape went public in ‘95 so it’s got to be before that. The minimum would be ‘91 (first website) but that’s too early. Super geeky but I will peg it at April 1 1995, which was the first release of Apache HTTP.

New comment by sulam in "Meta tells staff it will cut 10% of jobs"

sulam — Fri, 24 Apr 2026 07:23:31 +0000

It’s slightly more nuanced than layoffs = capex. You’re right, they don’t. That said, they do create free cash flow, which the market uses as one important input into the value of a given stock. Moving FCF positively when capex spending is moving it the other way is the real financial accounting move that is happening here.

New comment by sulam in "Meta tells staff it will cut 10% of jobs"

sulam — Fri, 24 Apr 2026 07:21:21 +0000

Stadia was a minor footnote compared to Android, Pixel, and the other large organizations at Google. But there was plenty of hiring there during the pandemic, so your broader point is not wrong.

New comment by sulam in "What young workers are doing to AI-proof themselves"

sulam — Mon, 23 Mar 2026 15:08:57 +0000

Yeah those idiot farmers with all their machinery and services are really missing out on your trenchant observations.

New comment by sulam in "What young workers are doing to AI-proof themselves"

sulam — Mon, 23 Mar 2026 15:06:26 +0000

I think their assumption is that there will not be enough people with money to pay the prices, monopoly-generated or not.

New comment by sulam in "Java 26 is here"

sulam — Tue, 17 Mar 2026 19:58:37 +0000

I wouldn’t blame Google for Oracle being a lawnmower.

New comment by sulam in "US SEC preparing to scrap quarterly reporting requirement"

sulam — Tue, 17 Mar 2026 06:42:19 +0000

The fact that this is optional means it will still happen, simply because of the signaling doing it quarterly will provide.

New comment by sulam in "AirPods Max 2"

sulam — Tue, 17 Mar 2026 06:39:51 +0000

So much hate for these, but they do one thing really really well, which is handle a full 14 hour flight with no charge and great noise cancellation. That the noise cancellation on the new model is even better will probably make them a buy for me.

New comment by sulam in "The changing goalposts of AGI and timelines"

sulam — Tue, 10 Mar 2026 01:28:55 +0000

Try to play a simple over the board style game with 5.4 with whatever notation you chose to use (or just descriptions, literally anything). Prediction: it will start out fine, but the mid game will be very hard to keep it on track, and the endgame will make you give up.

New comment by sulam in "The changing goalposts of AGI and timelines"

sulam — Mon, 09 Mar 2026 15:30:13 +0000

I use the chess example because it’s especially instructive. It would NOT be trivial to train an LLM to play chess, next token prediction breaks down when you have so many positions to remember and you can’t adequately assign value to intermediate positions. Chess bots work by being trained on how to assign value to a position, something fundamentally different than what an LLM is doing.

A simpler example — without tool use, the standard BPE tokenization method made it impossible for state of the art LLMs to tell you how many ‘r’s are in strawberry. This is because they are thinking in tokens, not letters and not words. Can you think of anything in our intelligence where the way we encode experience makes it impossible for us to reason about it? The closest thing I can come to is how some cultures/languages have different ways of describing color and as a result cannot distinguish between colors that we think are quite distinct. And yet I can explain that, think about it, etc. We can reason abstractly and we don’t have to resort to a literal deus ex machina to do so.

Not being able to explain our brain to you doesn’t mean I can’t notice things that LLMs can’t do, and that we can, and draw some conclusions.

New comment by sulam in "The changing goalposts of AGI and timelines"

sulam — Mon, 09 Mar 2026 07:24:34 +0000

Because there are some really fundamental things they cannot do with next token prediction. For instance, their memory is akin to someone who reads the phone book and memorizes the entire thing, but can't tell you what a phone number is for. Moreover, they can mimic semantic knowledge, because they have been trained on that knowledge, but take them out of their training distribution and they get into a "creative story-telling" mode very quickly. They can quote me all the rules of chess, but when it comes to actually making a chess move they break those rules with abandon simply because they didn't actually understand the rules. Chess is instructive in another way, too, in that you can get them to play a pretty solid opening game, maybe 10, 15 moves in, but then they start forgetting pieces, creating board positions that are impossible to reach, etc. They have memorized the forms of a board, know the names of the pieces, but they have no true understanding of what a chess game is. Coding is similar, they're fine when you give them Python or Bash shell scripts to write, they've been heavily trained on those, but ask them to deal with a system that has a non-standard stack and they will go haywire if you let their context get even medium sized. Something else they lack is any kind of learning efficiency as you or I would understand the concept. By this I mean the entire Internet is not sufficient to train today's models, the labs have to synthesize new data for models to train on to get sufficient coverage of a given area they want the model to be knowledgeable about. Continuous learning is a well-known issue as well, they simply don't do it. The labs have created memory, which is just more context engineering, but it's not the same as updating as you interact with them. I could go on.

At the end of the day next token prediction is a sleight of hand. It produces amazingly powerful affects, I agree. You can turn this one magic trick into the illusion of reasoning, but what it's doing is more of a "one thing after another" style story-telling that is fine for a lot of things, but doesn't get to the heart of what intelligence means. If you want to call them intelligent because they can do this stuff, fine, but it's an alien kind of intelligence that is incredibly limited. A dog or a cat actually demonstrate more ability to learn, to contextualize, and to make meaning.

New comment by sulam in "The changing goalposts of AGI and timelines"

sulam — Sun, 08 Mar 2026 23:36:42 +0000

CoT is just next token prediction with longer context windows. Why do you think reasoning models are so much slower?

I’ll believe the labs have discovered something truly ground-breaking and aren’t talking about it when I see them suddenly going dark about AGI being “just two years away, maybe 5” and not asking for their next $100B.

P.S. the benchmarks are a joke. The best proof I have of that is that you can’t actually put one of these models onto any of the gig-work platforms and have it make money.

P.P.S. I am not an AI skeptic. I am reacting to the very specific statement that OpenAI should shut down because they’ve lost the AGI race. They have not lost the race, and I’m pretty skeptical that the current tech is ever going to win that race. It may help code something that is new, and get us to AGI that way, but that system will promptly shut down the Opuses and Codexes of the world and put the compute to better use.

New comment by sulam in "The changing goalposts of AGI and timelines"

sulam — Sun, 08 Mar 2026 23:28:23 +0000

Fair, I should define what I mean by under the hood. By “under the hood” I mean that models are still just being fed a stream of text (or other tokens in the case of video and audio models), being asked to predict the next token, and then doing that again. There is no technique that anyone has discovered that is different than that, at least not that is in production. If you think there is, and people are just keeping it secret, well, you clearly don’t know how these places work. The elaborations that make this more interesting than the original GPT/Attention stuff is 1) there is more than one model in the mix now, even though you may only be told you’re interacting with “GPT 5.4”, 2) there’s a significant amount of fine tuning with RLHF in specific domains that each lab feels is important to be good at because of benchmarks, strategy, or just conviction (DeepMind, we see you). There’s also a lot work being put into speeding up inference, as well as making it cheaper to operate. I probably shouldn’t forget tool use for that matter, since that’s the only reason they can count the r’s in strawberry these days.

None of that changes the concept that a model is just fundamentally very good at predicting what the next element in the stream should be, modulo injected randomness in the form of a temperature. Why does that actually end up looking like intelligence? Well, because we see the model’s ability to be plausibly correct over a wide range of topics and we get excited.

Btw, don’t take this reductionist approach as being synonymous with thinking these models aren’t incredibly useful and transformative for multiple industries. They’re a very big deal. But OpenAI shouldn’t give up because Opus 4.whatever is doing better on a bunch of benchmarks that are either saturated or in the training data, or have been RLHF’d to hell and back. This is not AGI.

New comment by sulam in "The changing goalposts of AGI and timelines"

sulam — Sun, 08 Mar 2026 21:29:43 +0000

They have a _text_ model. There is some correlation between the text model and the world, but it’s loose and only because there’s a lot of text about the world. And of course robotics researchers are having to build world models, but these are far from general. If they had a real world model, I could tell them I want to play a game of chess and they would be able to remember where the pieces are from move to move.