<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: Rudybega</title><link>https://news.ycombinator.com/user?id=Rudybega</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 09 Apr 2026 11:10:06 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=Rudybega" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by Rudybega in "Issue: Claude Code is unusable for complex engineering tasks with Feb updates"]]></title><description><![CDATA[
<p>That's the nature of abstraction.  Everything you create on a computer is built on a towering stack of black boxes.</p>
]]></description><pubDate>Mon, 06 Apr 2026 22:20:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47668090</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=47668090</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47668090</guid></item><item><title><![CDATA[New comment by Rudybega in "Pentagon formally labels Anthropic supply-chain risk"]]></title><description><![CDATA[
<p>Anthropic and the military had a contract. The military wanted to change the terms of that contract. Anthropic said no, which is their clearly defined contractual right. They got labeled a supply chain risk.  How is this anything other than a shakedown? Does contract law mean anything to this administration?</p>
]]></description><pubDate>Thu, 05 Mar 2026 20:57:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47267171</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=47267171</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47267171</guid></item><item><title><![CDATA[New comment by Rudybega in "I am directing the Department of War to designate Anthropic a supply-chain risk"]]></title><description><![CDATA[
<p>You don't actually believe in the core tenets of the USA if you think that the government should have or should exercise unchecked, abusive power.</p>
]]></description><pubDate>Sat, 28 Feb 2026 00:51:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47188449</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=47188449</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47188449</guid></item><item><title><![CDATA[New comment by Rudybega in "I am directing the Department of War to designate Anthropic a supply-chain risk"]]></title><description><![CDATA[
<p>Then the government should end their contract with Anthropic. The terms of the contract were clear.<p>Designating them a supply chain risk is unprecedented authoritarian strong-arming.</p>
]]></description><pubDate>Sat, 28 Feb 2026 00:16:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=47188070</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=47188070</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47188070</guid></item><item><title><![CDATA[New comment by Rudybega in "The path to ubiquitous AI (17k tokens/sec)"]]></title><description><![CDATA[
<p>Well yeah, they're running a small, outdated, older model.  That's not really the point.  This approach can be used for better, larger, newer models.</p>
]]></description><pubDate>Sat, 21 Feb 2026 02:32:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47096891</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=47096891</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47096891</guid></item><item><title><![CDATA[New comment by Rudybega in "Gemini 3.1 Pro"]]></title><description><![CDATA[
<p>If only there were some way to test it, like swapping the two nouns in the sentence. Alas.</p>
]]></description><pubDate>Thu, 19 Feb 2026 20:53:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=47079172</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=47079172</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47079172</guid></item><item><title><![CDATA[New comment by Rudybega in "Beginning fully autonomous operations with the 6th-generation Waymo driver"]]></title><description><![CDATA[
<p>If a passenger says to you, "go around this car by using that private driveway", are they driving the car?</p>
]]></description><pubDate>Fri, 13 Feb 2026 20:35:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47007493</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=47007493</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47007493</guid></item><item><title><![CDATA[New comment by Rudybega in "Beginning fully autonomous operations with the 6th-generation Waymo driver"]]></title><description><![CDATA[
<p>There aren't remote "drivers" in the Philippines, that's not how Fleet Response works.  You can see how it works here if you're curious: <a href="https://waymo.com/blog/2024/05/fleet-response" rel="nofollow">https://waymo.com/blog/2024/05/fleet-response</a> but the TLDR is that they give the Waymo driver options in confusing situations (things like, you can go use this driveway on the side to pass this blocked traffic).</p>
]]></description><pubDate>Thu, 12 Feb 2026 19:51:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=46994109</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46994109</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46994109</guid></item><item><title><![CDATA[New comment by Rudybega in "GLM-5: Targeting complex systems engineering and long-horizon agentic tasks"]]></title><description><![CDATA[
<p>MMLU performance caps out around 90% because there are tons of errors in the actual test set.  There's a pretty solid post on it here: <a href="https://www.reddit.com/r/LocalLLaMA/comments/163x2wc/philip_from_ai_explained_finds_hundreds_of_errors/" rel="nofollow">https://www.reddit.com/r/LocalLLaMA/comments/163x2wc/philip_...</a><p>As far as I can tell for AIME, pretty much every frontier model gets 100% <a href="https://llm-stats.com/benchmarks/aime-2025" rel="nofollow">https://llm-stats.com/benchmarks/aime-2025</a></p>
]]></description><pubDate>Wed, 11 Feb 2026 21:50:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=46981626</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46981626</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46981626</guid></item><item><title><![CDATA[New comment by Rudybega in "Waymo exec reveals company uses remote workers in the Philippines"]]></title><description><![CDATA[
<p>I mean, Waymo gives a lot of examples of the situations, in their blog post about Fleet Response where they detail this, released May 21, 2024.  They're very explicit that the Waymo Driver autonomous system is in control the entire time.<p>This isn't something new.<p><a href="https://waymo.com/blog/2024/05/fleet-response" rel="nofollow">https://waymo.com/blog/2024/05/fleet-response</a></p>
]]></description><pubDate>Tue, 10 Feb 2026 22:55:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=46968169</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46968169</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46968169</guid></item><item><title><![CDATA[New comment by Rudybega in "We tasked Opus 4.6 using agent teams to build a C Compiler"]]></title><description><![CDATA[
<p>TinyCC can't compile a modern linux kernel.  It doesn't support a ton of the extensions they use.  That Rust compiler similarly can't do it.</p>
]]></description><pubDate>Fri, 06 Feb 2026 20:15:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=46917596</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46917596</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46917596</guid></item><item><title><![CDATA[New comment by Rudybega in "We tasked Opus 4.6 using agent teams to build a C Compiler"]]></title><description><![CDATA[
<p>There are two compilers that can handle the Linux kernel. GCC and LLVM.  Both are written in C, not Rust.  It's "in distribution" only if you really stretch the meaning of the term. A generic C compiler isn't going to be anywhere near the level of rigour of this one.</p>
]]></description><pubDate>Fri, 06 Feb 2026 16:43:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46915070</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46915070</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46915070</guid></item><item><title><![CDATA[New comment by Rudybega in "GPT-5.3-Codex"]]></title><description><![CDATA[
<p>I think they immediately corrected their median timelines for takeoff to 2028 upon releasing the article (I believe there was a math mistake or something initially), so all those dates can probably be bumped back a few months.  Regardless, the trend seems fairly on track.</p>
]]></description><pubDate>Thu, 05 Feb 2026 22:08:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=46906072</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46906072</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46906072</guid></item><item><title><![CDATA[New comment by Rudybega in "GPT-5.3-Codex"]]></title><description><![CDATA[
<p>The denial of this overall trend on here and in other internet spaces is starting to really bother me.  People need to have sober conversations about the speed of this increase and what kind of effects it's going to have on the world.</p>
]]></description><pubDate>Thu, 05 Feb 2026 22:05:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=46906044</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46906044</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46906044</guid></item><item><title><![CDATA[New comment by Rudybega in "We tasked Opus 4.6 using agent teams to build a C Compiler"]]></title><description><![CDATA[
<p>Direct transpilation would create a ton of unsafe code (this repo doesn't have any) and fixing that would require a lot of manual fixes from the model. Even that would be a massive achievement, but it's not how this was created.</p>
]]></description><pubDate>Thu, 05 Feb 2026 22:03:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=46906023</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46906023</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46906023</guid></item><item><title><![CDATA[New comment by Rudybega in "Claude Opus 4.6"]]></title><description><![CDATA[
<p>You're comparing two different benchmarks.  Pro vs Verified.</p>
]]></description><pubDate>Thu, 05 Feb 2026 21:59:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=46905986</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46905986</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46905986</guid></item><item><title><![CDATA[New comment by Rudybega in "We tasked Opus 4.6 using agent teams to build a C Compiler"]]></title><description><![CDATA[
<p>It wrote the compiler in Rust.  As far as I know, there aren't any Rust based C compilers with the same capabilities. If you can find one that can compile the Linux kernel or get 99% on the GCC torture test suite, I would be quite surprised. I couldn't in a search.<p>Maybe read the article before being so dismissive.</p>
]]></description><pubDate>Thu, 05 Feb 2026 21:22:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=46905531</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46905531</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46905531</guid></item><item><title><![CDATA[New comment by Rudybega in "The Codex App"]]></title><description><![CDATA[
<p>Yeah, I'm having the same issue.  Disappointing limitations.</p>
]]></description><pubDate>Mon, 02 Feb 2026 21:24:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=46861780</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46861780</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46861780</guid></item><item><title><![CDATA[New comment by Rudybega in "Waymo robotaxi hits a child near an elementary school in Santa Monica"]]></title><description><![CDATA[
<p>That's probably more a function of more people being in the road than people not understanding what object they're about to hit.</p>
]]></description><pubDate>Thu, 29 Jan 2026 18:31:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=46814329</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46814329</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46814329</guid></item><item><title><![CDATA[New comment by Rudybega in "Waymo robotaxi hits a child near an elementary school in Santa Monica"]]></title><description><![CDATA[
<p>"Oh that obstructing object doesn't look like a child? Gun it, YOLO." Lmao.<p>I suspect the cars are trying to avoid running into anything, as that's generally considered bad.</p>
]]></description><pubDate>Thu, 29 Jan 2026 18:30:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=46814323</link><dc:creator>Rudybega</dc:creator><comments>https://news.ycombinator.com/item?id=46814323</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46814323</guid></item></channel></rss>