Hacker News: bogtog

New comment by bogtog in "GLM 5.2 vs. Opus"

bogtog — Mon, 22 Jun 2026 10:09:56 +0000

> In fact I think long-term autonomy (in the range of several hours) and self-correcting is going to be where we see most improvements in coming years.

Right, model intelligence defines the scope of things they can one shot

I also suspect that users naturally calibrate to a model's useful scope, gradually getting positive/negative feedback and gradually making their requests bigger/smaller than before

New comment by bogtog in "OpenAI releases GPT-5.5 and GPT-5.5 Pro in the API"

bogtog — Fri, 24 Apr 2026 19:42:57 +0000

I'm surprised I never heard people talking about using -Pro variants, even though their rates ($125-175/M?) aren't drastically larger than old Opus ($75/M), which people seemed to use

New comment by bogtog in "Caveman: Why use many token when few token do trick"

bogtog — Sun, 05 Apr 2026 10:31:55 +0000

I'd be curious if there were some measurements of the final effects, since presumably models wont in caveman speak nor code like that

New comment by bogtog in "ATMs didn’t kill bank teller jobs, but the iPhone did"

bogtog — Fri, 13 Mar 2026 10:38:38 +0000

> Not listed here is how banks themselves have changed to be almost entirely online

Sorry what? Was this not the central theme of the article? (albeit with a title that used the word "iPhone" to be catchier)

New comment by bogtog in "Judge orders government to begin refunding more than $130B in tariffs"

bogtog — Thu, 05 Mar 2026 16:48:55 +0000

> This is wrong. It's not insider trading. Lutnick didn't have inside information. His son just had a brain. Anyone who read the case knew which way the court was going, it was the least surprising decision ever. Perhaps the only surprising thing is that the court ever heard it.

If this was so obvious, wouldn't there have been more competitors pushing down the value of it?

New comment by bogtog in "OpenAI agrees with Dept. of War to deploy models in their classified network"

bogtog — Sat, 28 Feb 2026 13:49:24 +0000

Mr. Less-than-Consistently-Candid strikes again

New comment by bogtog in "ChatGPT Containers can now run bash, pip/npm install packages and download files"

bogtog — Mon, 26 Jan 2026 23:36:13 +0000

> But now that most code is written by LLMs, it's as "hard" for the LLM to write Python as it is to write Rust/Go

The LLM still benefits from the abstraction provided by Python (fewer tokens and less cognitive load). I could see a pipeline working where one model writes in Python or so, then another model is tasked to compile it into a more performant language

New comment by bogtog in "Show HN: TetrisBench – Gemini Flash reaches 66% win rate on Tetris against Opus"

bogtog — Mon, 26 Jan 2026 20:49:44 +0000

I figure OP would try and give the models pure text forms of the game?

.....

l....

l.ttt

l..t.

New comment by bogtog in "Proof of Corn"

bogtog — Fri, 23 Jan 2026 23:00:55 +0000

This is fair, but this seems like the only way to test this type of thing while avoiding the risk of harassing tons of farmers with AI emails. In the end, the performance will be judged on how much of a human harness is given

New comment by bogtog in "The 600-year-old origins of the word 'hello'"

bogtog — Sat, 17 Jan 2026 16:51:50 +0000

I associate "yello" with Homer Simpson: https://www.facebook.com/TheDoctorZaius/videos/7233283715092...

(fingers crossed I'm not somehow doxxing myself by sharing a fb link)

New comment by bogtog in "Claude Cowork exfiltrates files"

bogtog — Thu, 15 Jan 2026 11:01:50 +0000

People will pay extra for Opus over Sonnet and often describe the $200 Max plan as cheap because of the time it saves. Paying for a somewhat better harness follows the same logic

New comment by bogtog in "Spherical Snake"

bogtog — Tue, 06 Jan 2026 20:12:00 +0000

The game looks really good, although I think it'd be improved if the sphere was a bit smaller. It feels like it takes too long for the game to become difficult

New comment by bogtog in "AWS raises GPU prices 15% on a Saturday, hopes you weren't paying attention"

bogtog — Tue, 06 Jan 2026 16:07:29 +0000

Oh my reasoning was coming at this from a different angle: H200s were released in November of 2023, so they're over 2 years old at this point while still being valuable

New comment by bogtog in "AWS raises GPU prices 15% on a Saturday, hopes you weren't paying attention"

bogtog — Tue, 06 Jan 2026 12:28:03 +0000

A few months ago, there was a lot of news lambasting tech companies for extending the depreciation lifespan of GPUs from ~3 years to ~5 years. Do these price hikes suggest a longer lifespan is probably the right way to see how long these GPUs will be valuable?

New comment by bogtog in "Google's year in review: areas with research breakthroughs in 2025"

bogtog — Wed, 24 Dec 2025 13:18:25 +0000

Thanks for sharing. I'm surprised you can't just ctrl-a + copy-paste your bank statement and get it to work easily

New comment by bogtog in "Google's year in review: areas with research breakthroughs in 2025"

bogtog — Wed, 24 Dec 2025 12:40:22 +0000

> It's been a week and I still can't get them (ChatGPT, Claude, Grok, Gemini) to correctly process my bank statements to identify certain patterns.

Can you give any more details on what you mean? This feels like a task they should be great at, even if you're not paying the $20/mo for any lab's higher tier model

New comment by bogtog in "X-ray: a Python library for finding bad redactions in PDF documents"

bogtog — Tue, 23 Dec 2025 22:57:47 +0000

I don't think the commentor above is saying that an AI should necessarily apply the redaction. Rather, an AI can serve as an objective-ish way of determining what should be redacted. This seems somewhat analogous to how (non-AI) models can we used to evaluate how gerrymandered a map is

New comment by bogtog in "Ask HN: How can I get better at using AI for programming?"

bogtog — Sat, 13 Dec 2025 21:40:50 +0000

That's fair. I sometimes find myself pausing or just talking in circles as I'm deciding what I want. I think when I'm speaking, I feel freer to use less precise/formal descriptions, but the model can still correctly interpret the technical meaning

In either case, different strokes for different folks, and what ultimately matters is whether you get good results. I think the upside is high, so I broadly suggest people try it out

New comment by bogtog in "Ask HN: How can I get better at using AI for programming?"

bogtog — Sat, 13 Dec 2025 18:35:52 +0000

> Claude on macOS and iOS have native voice to text transcription

Yeah, Claude/ChatGPT/Gemini all offer this, although Gemini's is basically unusable because it will immediately send the message if you stop talking for a few seconds

I imagine you totally could use the app transcript and paste it in, but keeping the friction to an absolute minimum (e.g., just needing to press one hotkey) feels nice

New comment by bogtog in "Ask HN: How can I get better at using AI for programming?"

bogtog — Sat, 13 Dec 2025 18:26:10 +0000

I'm using Wispr flow, but I've also tried Superwhisper. Both are fine. I have a convenient hotkey to start/end recording with one hand. Having it just need one hand is nice. I'm using this with the Claude Code vscode extension in Cursor. If you go down this route, the Claude Code instance should be moved into a separate window outside your main editor or else it'll flicker a lot