<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: saberience</title><link>https://news.ycombinator.com/user?id=saberience</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 19 Jun 2026 16:59:50 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=saberience" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by saberience in "OpenAI Losses Increased Nearly 8X in 2025, with Spending Hitting $34B"]]></title><description><![CDATA[
<p>Yeah he has zero credentials and authority and an agenda to push. Not to mention most of his articles are financially and technically illiterate and full of mistakes and inaccuracies.<p>No idea why his shit keeps getting submitted.</p>
]]></description><pubDate>Tue, 16 Jun 2026 09:18:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48552630</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48552630</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48552630</guid></item><item><title><![CDATA[New comment by saberience in "Iroh 1.0"]]></title><description><![CDATA[
<p>This page is basically useless in explaining what Iroh is or does and why I should care.</p>
]]></description><pubDate>Mon, 15 Jun 2026 15:45:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48543011</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48543011</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48543011</guid></item><item><title><![CDATA[New comment by saberience in "Anthropic's Safety Superpower"]]></title><description><![CDATA[
<p>This is totally inaccurate, the APIs provide the reasoning logs. You ABSOLUTELY can distill from APIs, in fact, that's the primary way distillation is done currently.</p>
]]></description><pubDate>Mon, 15 Jun 2026 12:32:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48540317</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48540317</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48540317</guid></item><item><title><![CDATA[New comment by saberience in "Don't trust large context windows"]]></title><description><![CDATA[
<p>I never use Opus etc after 50% token usage (and from reading other devs blogs and on Twitter it seems this is a comment sentiment) because it falls off an intelligence cliff at that point.<p>I mean, I really, really see intelligence tank at a certain amount of context usage. I always start a new session when any implementation work is starting or when starting a new plan.<p>So I clean context before writing a plan, I clean context before any implementation of a plan. My first prompt is always putting enough of my own context, copy and pastes of docs, etc, to ensure the plan creation is good. Once the plan is made I clean the context and get Opus to implement said plan.<p>Out of all the methodologies I've tried, this seems to be the best in terms of output quality.</p>
]]></description><pubDate>Mon, 15 Jun 2026 10:14:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48539129</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48539129</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48539129</guid></item><item><title><![CDATA[New comment by saberience in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>I got downgraded from Opus to Fable for asking why MDMA was not addictive in the same way Cocaine is, so yeah, the "guardrails" are clearly vibe-coded.</p>
]]></description><pubDate>Sat, 13 Jun 2026 07:25:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=48514427</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48514427</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48514427</guid></item><item><title><![CDATA[New comment by saberience in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>Ah right the super-smart model which had to be banned created this terrible looking website. Hmm...</p>
]]></description><pubDate>Sat, 13 Jun 2026 07:20:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48514394</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48514394</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48514394</guid></item><item><title><![CDATA[New comment by saberience in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>No it wasn't, Fable is a general purpose model for use in regular chat, analysis, as well as coding.<p>And yes, the parent poster is accurate, Fable is just as prone to moronic mistakes as Opus was. Stop being so AI-pilled.<p>Codex is still a better model, and yes, for the hardest engineering problems. I use Claude for UI/GUIs and Codex for all my backend, because I have 20 years of experience of actual hard engineering, and I can see that Codex writes, cleaner code, and is far more steerable.<p>Bad engineers think Claude is better because it writes more lines of code and is more "proactive", but lines of code doesn't make a better system.</p>
]]></description><pubDate>Sat, 13 Jun 2026 07:18:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48514375</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48514375</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48514375</guid></item><item><title><![CDATA[New comment by saberience in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>I think you have AI Psychosis friend.<p>I've used Claude Code and Codex since release and use them both in parallel. Codex is still better (yes even with Fable).<p>Claude Code is best for UI, nice looking guis, etc, and apparently also best at impressing mediocre programmers who are prone to AI psychosis.<p>All the best engineers I worked with in past jobs (faang folk, spacex, x.ai, and others) all use Codex, go figure.</p>
]]></description><pubDate>Sat, 13 Jun 2026 07:15:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=48514354</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48514354</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48514354</guid></item><item><title><![CDATA[New comment by saberience in "Claude Fable is relentlessly proactive"]]></title><description><![CDATA[
<p>Interesting, I’d love to see the comparisons of your system using Claude vs Codex. I have about 20 years of experience in distributed systems and super high scale at several faangs, and also building ai model serving infra for 20k transactions per second roughly.<p>For me, Claude makes bone headed decisions all the time, like glaring errors, not even particularly subtle.<p>But the more obvious flag is the amount of irrelevant code and tests which Fable writes. Like it regularly writes 2X or 3X the amount of code and tests that are needed. It’s an expert at writing plausible but entirely useless tests.<p>But I think that if you’re a more junior engineer or haven’t been around a the block you can easily think that “more code equals smarter”. Claude ends up creating a massive, hard to manage codebase, and if you look the Claude Code codebase (which was leaked), you can see I’m right!<p>The Claude Code codebase is terrible. And presumably Anthropic has been using their smartest models for working on Claude Code. I wrote my own coding harness with Codex (as a fun experiment) which used a fraction of the code and is about 100X more performant and memory efficient (than Claude Code)!</p>
]]></description><pubDate>Fri, 12 Jun 2026 18:57:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48508076</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48508076</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48508076</guid></item><item><title><![CDATA[New comment by saberience in "David Hockney, Who Restored the Human Form to Art, Dies at 88"]]></title><description><![CDATA[
<p>Thanks for sharing this, what a great story, gave a smile to my day :)</p>
]]></description><pubDate>Fri, 12 Jun 2026 17:07:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48506656</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48506656</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48506656</guid></item><item><title><![CDATA[New comment by saberience in "Claude Fable is relentlessly proactive"]]></title><description><![CDATA[
<p>I'm writing low-level Rust, distributed systems, also sandboxing tech which has to be secure and performant.<p>The only thing I have Fable do now is create UIs or otherwise front-ends for systems where correctness doesn't matter as much.<p>Anthropic models lead at making nice looking UIs for sure, but when it comes to making sure my Rust code is actually 100% correct and uses 1% of CPU most of the time, Codex is king.</p>
]]></description><pubDate>Fri, 12 Jun 2026 17:04:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48506618</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48506618</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48506618</guid></item><item><title><![CDATA[New comment by saberience in "Claude Fable is relentlessly proactive"]]></title><description><![CDATA[
<p>This is exactly what I find too, I make plans in both models and compare them in the other model. And Claude usually agrees (65-80% of the time) that the Codex plan included things it didn't think of, or was better in some other way.<p>Note, this is better than it was with Opus, where it was more like 90% of the time the Codex plans were obviously better.</p>
]]></description><pubDate>Fri, 12 Jun 2026 16:41:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48506332</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48506332</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48506332</guid></item><item><title><![CDATA[New comment by saberience in "Claude Fable is relentlessly proactive"]]></title><description><![CDATA[
<p>I use Codex and Claude Code. I've used both Codex and CC since release with basically every model they've ever released, I always try both for almost every plan that I write and benchmark the plans against each other, Claude almost always acknowledges that the Codex plan is better! Even now with Fable, this still happens.<p>As in, I give the exact same prompt to Fable and GPT 5.5 Pro, then produce the plans, then give each model the other's plan. Claude always realizes it missed stuff and Codex usually ends up finding missing things in Claudes plan.<p>This situation did improve with Fable versus Opus 4.8, but in general, Codex for me is still the better model.</p>
]]></description><pubDate>Fri, 12 Jun 2026 15:51:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48505685</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48505685</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48505685</guid></item><item><title><![CDATA[New comment by saberience in "Claude Fable is relentlessly proactive"]]></title><description><![CDATA[
<p>I use GPT 5.5 High Fast, I often benchmark versus Fable (and previously did versus Opus) and it's night and day.<p>Claude still (and has always) writes far too much code to fulfill a given spec or plan. It misses edge cases and is generally far too verbose.<p>Claude also is (and even more so with Fable) super tokenmaxxing, i.e. it seems tuned to use the max amount of tokens per task, whereas Codex will simply get your job done as you specified with the minimum fuss and tokens.<p>Codex feels way more steerable and just more "professional" as though I'm working with a seasoned engineer, versus someone smart but over excitable, like a super smart associate engineer.</p>
]]></description><pubDate>Fri, 12 Jun 2026 15:49:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48505656</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48505656</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48505656</guid></item><item><title><![CDATA[New comment by saberience in "Tailwind and slop apps"]]></title><description><![CDATA[
<p>These pages all look like Claude AI designs though, so I think you're proving the original point?</p>
]]></description><pubDate>Fri, 12 Jun 2026 11:09:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48502572</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48502572</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48502572</guid></item><item><title><![CDATA[New comment by saberience in "Claude Fable is relentlessly proactive"]]></title><description><![CDATA[
<p>It's not being aggressive, it's just trying throwing shit at problems until it sticks... or doesn't.<p>That doesn't make it smart or aggressive, if anything it's just been turned to crank tokens until something happens, which doesn't make it a good model.<p>Why are you positively anthropomorphizing this? It's an LLM, it's been tuned via RL, and it's been tuned by engineers at Anthropic to use a metric fuck-load of sub-agents and tokens to presumably pump their pre-IPO revenue!<p>A co-worker managed to get Fable to spin up 50 (!!!) sub-agents for a problem which codex worked on with 3 sub-agents. What the hell is going on here? It certainly doesn't mean Fable is "smarter" than Codex.<p>I've tested it extensively and I'm still using GPT 5.5 High Fast as my primary engineering model. It's far more steerable, writes less, higher quality code, and consistently finds issues and edge cases which are not found by Fable or Opus 4.7.</p>
]]></description><pubDate>Fri, 12 Jun 2026 10:59:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48502481</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48502481</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48502481</guid></item><item><title><![CDATA[New comment by saberience in "Claude Fable is relentlessly proactive"]]></title><description><![CDATA[
<p>And Fable is still worse than Codex.<p>I use both and the only thing (as always) that I will use Claude for is UI design.<p>Opus 4.8 and now Fable are still both worse at actually getting the job done than the Codex model. Claude models write FAR too much code when it's not needed, they burn far too many tokens, when they are not needed, write un-necessary tests, write plans which are 5 pages longer than are needed, etc. etc.<p>Have you actually compared code quality and plan quality versus Codex? It's demonstrably worse.</p>
]]></description><pubDate>Fri, 12 Jun 2026 10:54:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48502434</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48502434</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48502434</guid></item><item><title><![CDATA[New comment by saberience in "Waymo Premier"]]></title><description><![CDATA[
<p>Yes, but realistically, it's not that likely.</p>
]]></description><pubDate>Thu, 11 Jun 2026 19:56:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48495592</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48495592</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48495592</guid></item><item><title><![CDATA[New comment by saberience in "Waymo Premier"]]></title><description><![CDATA[
<p>Who tips on Uber?</p>
]]></description><pubDate>Thu, 11 Jun 2026 19:50:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48495528</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48495528</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48495528</guid></item><item><title><![CDATA[New comment by saberience in "Pokémon Go Scans Trained the Navigation Tech for Military Drones"]]></title><description><![CDATA[
<p>This is one of the most dystopian things I've heard in a long while.<p>I mean, we have a lot of weird shit going down right now... like AI being used to automate art BEFORE it's being used to automate dangerous and menial jobs, but knowing that people are being killed with help from data generated by millions of kids and young adults playing a fun, cute videogame is just so freaking dark and weird.<p>We are a very strange species and I don't have a great deal of hope for our future.</p>
]]></description><pubDate>Thu, 11 Jun 2026 10:27:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48488491</link><dc:creator>saberience</dc:creator><comments>https://news.ycombinator.com/item?id=48488491</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48488491</guid></item></channel></rss>