Hacker News: famouswaffles

New comment by famouswaffles in "A recent experience with ChatGPT 5.5 Pro"

famouswaffles — Wed, 13 May 2026 07:10:01 +0000

>If you can sic ChatGPT on a mathematics problem and it can solve it without your input, that's a different matter but that's not what's happening.

I mean that has happened so yeah ?

https://www.scientificamerican.com/article/amateur-armed-wit...

Actual GPT transcript. Zero such input https://chatgpt.com/share/69dd1c83-b164-8385-bf2e-8533e9baba...

And maybe the other guy wasn't the most polite about it but his point is very valid. Replace chatgpt with a human in both of these stories and nobody would say that timothy 'took the horse and made it drink'. The 'Horse' would be the first and likely only Author so this just sounds like denial.

That there are multiple of these stories in the last few months by the latest set of models (there are even more than these 2) should provoke this sort of consideration and discussion.

New comment by famouswaffles in "Googlebook"

famouswaffles — Wed, 13 May 2026 05:57:13 +0000

You're thinking about it the wrong way. Have you never come across some successful business idea and go, 'Huh, I never realized this problem even existed' or even 'People are paying this much for this? Wow'

These machines are general purpose technologies used by hundreds of millions of people. ChatGPT alone is used by over 900M people every week at least. You can count the technologies with that scale of users in your hand.

You'll never conceive all the sort of uses it could possibly have, much like nobody could ever conceive all the uses the internet had and will have and it would be misguided to think so. As you see, there's like 2 dozen people here telling OP the thing he thought 'No one' could possibly LLMs use for is in-fact seeing some use.

New comment by famouswaffles in "Googlebook"

famouswaffles — Tue, 12 May 2026 20:15:51 +0000

Yeah I and i suspect a lot of others email myself little files all the time because surprisingly that's the most convenient way to get those files quickly from phone to laptop.

New comment by famouswaffles in "I'm going back to writing code by hand"

famouswaffles — Mon, 11 May 2026 15:43:15 +0000

Man people really overestimate training. Claude did not 'read' any of that either. I wish frontier models behaved like people that had read and remembered everything they've trained on, but they're not.

New comment by famouswaffles in "Let's talk about LLMs"

famouswaffles — Mon, 04 May 2026 22:03:23 +0000

You're not supposed to flag a post for something like that. Ideally you downvote and move on if you feel that strongly about it. Flagging is meant to be reserved for stuff that breaks the rules or guidelines.

New comment by famouswaffles in "Where the goblins came from"

famouswaffles — Thu, 30 Apr 2026 04:55:13 +0000

>Ex: I would not be comfortable flying on any airplane where the autopilot "just zones-out sometimes", even though it's a dysfunction also seen in people.

You might if that was the best auto-pilot could be. Have you never used a bus or taken a taxi ?

The vast majority of things people are using LLMs for isn't stuff deterministic logic machines did great at, but stuff those same machines did poorly at or straight up stuff previously relegated to the domains of humans only.

If your competition also "just zones out sometimes" then it's not something you're going to focus on.

New comment by famouswaffles in "AI should elevate your thinking, not replace it"

famouswaffles — Wed, 29 Apr 2026 17:55:07 +0000

>LLM didn't solve an Erdos problem, it generated a text that a human looked at, cleaned up, corrected and used as base for a solution.

That's not at all what happened. You clearly are unable to actually understand the work it did so it would have been nice if you'd read the article and accounts of experts.

New comment by famouswaffles in "Amateur armed with ChatGPT solves an Erdős problem"

famouswaffles — Wed, 29 Apr 2026 17:43:38 +0000

This is one of a number of such results achieved only in the last few months with only the last crop of models. They have undoubtedly gotten better in this domain. Saying anything else is just denial. You can run these same problems on GPT-4 or 5 all you want, you'll get nowhere. In fact people did, and you're hearing about it now because it's these crop of models that are getting meaningful results.

New comment by famouswaffles in "How ChatGPT serves ads"

famouswaffles — Wed, 29 Apr 2026 03:05:49 +0000

>The revenue from a few ads on the free tier in exchange for limited queries to GPT-5.3 is negligible compared to what they pull in from API costs and the subscription plans.

Unless they botch the implementation, it's not going to be negligible with ~800M+ free subscribers.

New comment by famouswaffles in "Amateur armed with ChatGPT solves an Erdős problem"

famouswaffles — Sun, 26 Apr 2026 16:06:47 +0000

Glad I could clear that up for you

New comment by famouswaffles in "Amateur armed with ChatGPT solves an Erdős problem"

famouswaffles — Sun, 26 Apr 2026 15:02:41 +0000

>But you can only do that now, in hindsight.

No you could always do that. The meaning you take from it is up to you but you could always separate humans and calculators.

>No, that is not right. Fool’s gold is a thing.

I know what fools gold is. I used it for contrast. Fools gold can be tested for.

>but that doesn’t mean you know how to do it.

It doesn't matter. If you claim it exists but you don't know how to do it and you can't point to anyone who can, it's the same as something you made up.

>It’s like tasting two similar beers or sodas. You may be able to identify them by taste and understand they’re difference but be unable to articulate exactly how you know which is which to the point someone else can use your verbal instructions to know the difference.

You are still making the same mistake. Two similar beers or sodas taste different. No one is asking you to come up with a theory for intelligence. All you have to say here is the equivalent of "It tastes different" and let me taste it for myself. But even that much, you can not do. So why on earth should I treat what you say as worth anything ?

New comment by famouswaffles in "Amateur armed with ChatGPT solves an Erdős problem"

famouswaffles — Sun, 26 Apr 2026 13:16:33 +0000

>Most people would consider someone who can calculate 56863*2446 instantly in their head to be intelligent. Does that mean pocket calculators are intelligent? The result is the same.

If you wanted to insist a calculator wasn't intelligent and satisfy my conditions then you can. At the very least you can test for the sort of intelligence that is present in humans but absent from calculators and cleanly separate the two. These are very easy conditions if there is some actual real difference.

>That is the equivalent of responding to criticism with “can you do better?”. One does not need to be a chef (or even know how to cook) to know when food tastes foul.

No it's not, and this is a silly argument. Foul food tastes different. Sometimes it even looks different. You can test for it and satisfy my conditions.

You come across a shiny piece of yellow metal that you think is gold. It looks like gold, feels like gold and tests like gold. Suddenly a strange fellow comes about insisting that it's not actually gold. No, apparently there is a 'fake' gold. You are intrigued so you ask him, "Alright, what exactly is fake gold, and how can I test or tell them apart ?". But this fellow is completely unable to answer either question. What would you say about him ? He's nothing more than a mad man rambling about a distinction he made up in his head.

What I'm asking you to do is incredibly easy and basic with a real distinction. I'm not going to tell you to stop believing in your fake gold, but I am going to tell you I and no one else can be expected to take you seriously.

New comment by famouswaffles in "Amateur armed with ChatGPT solves an Erdős problem"

famouswaffles — Sun, 26 Apr 2026 06:05:57 +0000

Intelligence is Intelligence. It's intelligent because it does intelligent things. If someone feels the need to add a 'real' and 'fake' moniker to it so they can exclude the machine and make themselves feel better (or for whatever reason) then they are the one meant to be doing the defining, and to tell us how it can be tested for. If they can't, then there's no reason to pay attention to any of it. It's the equivalent of nonsensical rambling. At the end of the day, the semantic quibbling won't change anything.

New comment by famouswaffles in "Amateur armed with ChatGPT solves an Erdős problem"

famouswaffles — Sun, 26 Apr 2026 04:20:16 +0000

Yeah? Those models are creative.

New comment by famouswaffles in "Amateur armed with ChatGPT solves an Erdős problem"

famouswaffles — Sun, 26 Apr 2026 04:19:14 +0000

None of it is really from logical thought. The rationalizations don't make any sense, but they haven't for a while. It's an emotional response. Honestly, It's to be expected.

New comment by famouswaffles in "I'm done making desktop applications (2009)"

famouswaffles — Fri, 24 Apr 2026 21:58:02 +0000

>To me, in my open source projects, my "development cycle" ends when I push to git, and that can be done as often as I want.

If development ends at a git push and users are left to build/fend for themselves (granted this is a lot of open source), then yeah not much difference, but if you're building and packaging it up for users (which you will more likely to be doing if your project is an app specifically) then the difference is massive.

New comment by famouswaffles in "We gave an AI a 3 year retail lease and asked it to make a profit"

famouswaffles — Fri, 17 Apr 2026 13:08:44 +0000

Explanations can be faithful sometimes. That's the standard we can expect for any intelligence as far as we're aware.

https://arxiv.org/abs/2504.14150

New comment by famouswaffles in "We gave an AI a 3 year retail lease and asked it to make a profit"

famouswaffles — Fri, 17 Apr 2026 06:07:28 +0000

I did answer it, albeit not directly. "Guaranteed to be the motivation" isn't a standard anyone can meet, and so framing it that way doesn't really probe anything meaningful about LLMs specifically. If what you want to hear is No, then sure, have your No, but it doesn't mean anything. There's just not much to the question.

Even though you had it up as one borne of a greater understanding of LLMs, the interpretability research we have so far, and our current very little understanding of the internal computations of these models does not support your position and certainly not how assured you are about it.

New comment by famouswaffles in "We gave an AI a 3 year retail lease and asked it to make a profit"

famouswaffles — Thu, 16 Apr 2026 20:11:15 +0000

>What research shows that you can ask ChatGPT to explain its reasoning and why it said what it said, and that's guaranteed to actually be the motivation?

What research shows that you can ask a Human to explain its reasoning and why it said what it said, and that's guaranteed to actually be the motivation? Because there's no such thing. If anything, what research exists suggests any explanation we're making is a nice post-hoc rationalization after the fact even if the Human thinks otherwise.

https://transformer-circuits.pub/2025/introspection/index.ht...

New comment by famouswaffles in "We gave an AI a 3 year retail lease and asked it to make a profit"

famouswaffles — Thu, 16 Apr 2026 17:40:54 +0000

Where do you get the idea that you have a good sense of the introspective capabilities of frontier models ? Certainly not from interpretability research. Ironically, the people who make these sort of comments understand LLMs the least.