<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: bogtog</title><link>https://news.ycombinator.com/user?id=bogtog</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 22 Jun 2026 22:45:12 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=bogtog" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by bogtog in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>> In fact I think long-term autonomy (in the range of several hours) and self-correcting is going to be where we see most improvements in coming years.<p>Right, model intelligence defines the scope of things they can one shot<p>I also suspect that users naturally calibrate to a model's useful scope, gradually getting positive/negative feedback and gradually making their requests bigger/smaller than before</p>
]]></description><pubDate>Mon, 22 Jun 2026 10:09:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48628153</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=48628153</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48628153</guid></item><item><title><![CDATA[New comment by bogtog in "OpenAI releases GPT-5.5 and GPT-5.5 Pro in the API"]]></title><description><![CDATA[
<p>I'm surprised I never heard people talking about using -Pro variants, even though their rates ($125-175/M?) aren't drastically larger than old Opus ($75/M), which people seemed to use</p>
]]></description><pubDate>Fri, 24 Apr 2026 19:42:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47894903</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=47894903</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47894903</guid></item><item><title><![CDATA[New comment by bogtog in "Caveman: Why use many token when few token do trick"]]></title><description><![CDATA[
<p>I'd be curious if there were some measurements of the final effects, since presumably models wont <think> in caveman speak nor code like that</p>
]]></description><pubDate>Sun, 05 Apr 2026 10:31:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47647994</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=47647994</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47647994</guid></item><item><title><![CDATA[New comment by bogtog in "ATMs didn’t kill bank teller jobs, but the iPhone did"]]></title><description><![CDATA[
<p>>  Not listed here is how banks themselves have changed to be almost entirely online<p>Sorry what? Was this not the central theme of the article? (albeit with a title that used the word "iPhone" to be catchier)</p>
]]></description><pubDate>Fri, 13 Mar 2026 10:38:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=47362648</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=47362648</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47362648</guid></item><item><title><![CDATA[New comment by bogtog in "Judge orders government to begin refunding more than $130B in tariffs"]]></title><description><![CDATA[
<p>> This is wrong. It's not insider trading. Lutnick didn't have inside information. His son just had a brain. Anyone who read the case knew which way the court was going, it was the least surprising decision ever. Perhaps the only surprising thing is that the court ever heard it.<p>If this was so obvious, wouldn't there have been more competitors pushing down the value of it?</p>
]]></description><pubDate>Thu, 05 Mar 2026 16:48:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47263969</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=47263969</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47263969</guid></item><item><title><![CDATA[New comment by bogtog in "OpenAI agrees with Dept. of War to deploy models in their classified network"]]></title><description><![CDATA[
<p>Mr. Less-than-Consistently-Candid strikes again</p>
]]></description><pubDate>Sat, 28 Feb 2026 13:49:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47195354</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=47195354</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47195354</guid></item><item><title><![CDATA[New comment by bogtog in "ChatGPT Containers can now run bash, pip/npm install packages and download files"]]></title><description><![CDATA[
<p>> But now that most code is written by LLMs, it's as "hard" for the LLM to write Python as it is to write Rust/Go<p>The LLM still benefits from the abstraction provided by Python (fewer tokens and less cognitive load). I could see a pipeline working where one model writes in Python or so, then another model is tasked to compile it into a more performant language</p>
]]></description><pubDate>Mon, 26 Jan 2026 23:36:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46773289</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46773289</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46773289</guid></item><item><title><![CDATA[New comment by bogtog in "Show HN: TetrisBench – Gemini Flash reaches 66% win rate on Tetris against Opus"]]></title><description><![CDATA[
<p>I figure OP would try and give the models pure text forms of the game?<p>.....<p>l....<p>l....<p>l.ttt<p>l..t.</p>
]]></description><pubDate>Mon, 26 Jan 2026 20:49:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=46771312</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46771312</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46771312</guid></item><item><title><![CDATA[New comment by bogtog in "Proof of Corn"]]></title><description><![CDATA[
<p>This is fair, but this seems like the only way to test this type of thing while avoiding the risk of harassing tons of farmers with AI emails. In the end, the performance will be judged on how much of a human harness is given</p>
]]></description><pubDate>Fri, 23 Jan 2026 23:00:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=46739172</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46739172</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46739172</guid></item><item><title><![CDATA[New comment by bogtog in "The 600-year-old origins of the word 'hello'"]]></title><description><![CDATA[
<p>I associate "yello" with Homer Simpson: <a href="https://www.facebook.com/TheDoctorZaius/videos/723328371509214/" rel="nofollow">https://www.facebook.com/TheDoctorZaius/videos/7233283715092...</a><p>(fingers crossed I'm not somehow doxxing myself by sharing a fb link)</p>
]]></description><pubDate>Sat, 17 Jan 2026 16:51:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=46659529</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46659529</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46659529</guid></item><item><title><![CDATA[New comment by bogtog in "Claude Cowork exfiltrates files"]]></title><description><![CDATA[
<p>People will pay extra for Opus over Sonnet and often describe the $200 Max plan as cheap because of the time it saves. Paying for a somewhat better harness follows the same logic</p>
]]></description><pubDate>Thu, 15 Jan 2026 11:01:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=46630818</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46630818</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46630818</guid></item><item><title><![CDATA[New comment by bogtog in "Spherical Snake"]]></title><description><![CDATA[
<p>The game looks really good, although I think it'd be improved if the sphere was a bit smaller. It feels like it takes too long for the game to become difficult</p>
]]></description><pubDate>Tue, 06 Jan 2026 20:12:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=46517949</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46517949</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46517949</guid></item><item><title><![CDATA[New comment by bogtog in "AWS raises GPU prices 15% on a Saturday, hopes you weren't paying attention"]]></title><description><![CDATA[
<p>Oh my reasoning was coming at this from a different angle: H200s were released in November of 2023, so they're over 2 years old at this point while still being valuable</p>
]]></description><pubDate>Tue, 06 Jan 2026 16:07:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=46514126</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46514126</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46514126</guid></item><item><title><![CDATA[New comment by bogtog in "AWS raises GPU prices 15% on a Saturday, hopes you weren't paying attention"]]></title><description><![CDATA[
<p>A few months ago, there was a lot of news lambasting tech companies for extending the depreciation lifespan of GPUs from ~3 years to ~5 years. Do these price hikes suggest a longer lifespan is probably the right way to see how long these GPUs will be valuable?</p>
]]></description><pubDate>Tue, 06 Jan 2026 12:28:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=46511428</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46511428</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46511428</guid></item><item><title><![CDATA[New comment by bogtog in "Google's year in review: areas with research breakthroughs in 2025"]]></title><description><![CDATA[
<p>Thanks for sharing. I'm surprised you can't just ctrl-a + copy-paste your bank statement and get it to work easily</p>
]]></description><pubDate>Wed, 24 Dec 2025 13:18:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=46375313</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46375313</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46375313</guid></item><item><title><![CDATA[New comment by bogtog in "Google's year in review: areas with research breakthroughs in 2025"]]></title><description><![CDATA[
<p>> It's been a week and I still can't get them (ChatGPT, Claude, Grok, Gemini) to correctly process my bank statements to identify certain patterns.<p>Can you give any more details on what you mean? This feels like a task they should be great at, even if you're not paying the $20/mo for any lab's higher tier model</p>
]]></description><pubDate>Wed, 24 Dec 2025 12:40:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=46375070</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46375070</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46375070</guid></item><item><title><![CDATA[New comment by bogtog in "X-ray: a Python library for finding bad redactions in PDF documents"]]></title><description><![CDATA[
<p>I don't think the commentor above is saying that an AI should necessarily apply the redaction. Rather, an AI can serve as an objective-ish way of determining what should be redacted. This seems somewhat analogous to how (non-AI) models can we used to evaluate how gerrymandered a map is</p>
]]></description><pubDate>Tue, 23 Dec 2025 22:57:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=46370472</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46370472</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46370472</guid></item><item><title><![CDATA[New comment by bogtog in "Ask HN: How can I get better at using AI for programming?"]]></title><description><![CDATA[
<p>That's fair. I sometimes find myself pausing or just talking in circles as I'm deciding what I want. I think when I'm speaking, I feel freer to use less precise/formal descriptions, but the model can still correctly interpret the technical meaning<p>In either case, different strokes for different folks, and what ultimately matters is whether you get good results. I think the upside is high, so I broadly suggest people try it out</p>
]]></description><pubDate>Sat, 13 Dec 2025 21:40:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=46258368</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46258368</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46258368</guid></item><item><title><![CDATA[New comment by bogtog in "Ask HN: How can I get better at using AI for programming?"]]></title><description><![CDATA[
<p>> Claude on macOS and iOS have native voice to text transcription<p>Yeah, Claude/ChatGPT/Gemini all offer this, although Gemini's is basically unusable because it will immediately send the message if you stop talking for a few seconds<p>I imagine you totally could use the app transcript and paste it in, but keeping the friction to an absolute minimum (e.g., just needing to press one hotkey) feels nice</p>
]]></description><pubDate>Sat, 13 Dec 2025 18:35:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=46256791</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46256791</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46256791</guid></item><item><title><![CDATA[New comment by bogtog in "Ask HN: How can I get better at using AI for programming?"]]></title><description><![CDATA[
<p>I'm using Wispr flow, but I've also tried Superwhisper. Both are fine. I have a convenient hotkey to start/end recording with one hand. Having it just need one hand is nice. I'm using this with the Claude Code vscode extension in Cursor. If you go down this route, the Claude Code instance should be moved into a separate window outside your main editor or else it'll flicker a lot</p>
]]></description><pubDate>Sat, 13 Dec 2025 18:26:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=46256697</link><dc:creator>bogtog</dc:creator><comments>https://news.ycombinator.com/item?id=46256697</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46256697</guid></item></channel></rss>