<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: lunar_mycroft</title><link>https://news.ycombinator.com/user?id=lunar_mycroft</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 29 Jun 2026 22:54:57 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=lunar_mycroft" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by lunar_mycroft in "Tokenmaxxing is dead, long live tokenmaxxing"]]></title><description><![CDATA[
<p>It's funny, because editor choice is also an analogy I use, to argue for the exact opposite conclusion.<p>Your hypothetical developer wouldn't be using notepad because they're unaware of other editors, they'd be using it because they evaluated other editors and concluded that, for whatever reason, they would be worse for them.  I'd be fascinated to hear <i>why</i> they came to that conclusion, but I'm not going to tell them they're wrong if they're performing acceptably, aren't constantly breaking CI because the linter rejects their code, etc.  Everyone is different, and I'm not narcissistic enough to think the fact that <i>I</i> would be way less productive without my modal editor, LSP, linter, terminal multiplexer, etc. justifies forcing everyone else has to adopt my exact setup.</p>
]]></description><pubDate>Mon, 29 Jun 2026 12:31:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48718412</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48718412</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48718412</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "We found a bug in the hyper HTTP library"]]></title><description><![CDATA[
<p>You can set the lints to `forbid` instead of `deny`, which means they can't be `allowed` like that.</p>
]]></description><pubDate>Mon, 29 Jun 2026 12:20:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48718320</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48718320</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48718320</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Only 16 Percent of Americans Think AI Will Have a Positive Impact on Society"]]></title><description><![CDATA[
<p>Normal users <i>aren't using harnesses</i> in the sense developers think of them.  They're interacting with models where they've been shoehorned in for no good reason, or they're using them nearly entirely through chat interfaces.</p>
]]></description><pubDate>Wed, 17 Jun 2026 19:52:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48575907</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48575907</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48575907</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Only 16 Percent of Americans Think AI Will Have a Positive Impact on Society"]]></title><description><![CDATA[
<p>Is Claude really that much better than all the others for normal use?</p>
]]></description><pubDate>Wed, 17 Jun 2026 17:56:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48574116</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48574116</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48574116</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "AI is slowing down"]]></title><description><![CDATA[
<p>(Sorry I'm being vague, but I'm not sure I'm not sure what's public knowledge)<p>The cap is moderately above the high subscription tiers, and managements/the executives were clearly extremely concerned about how expensive it would be if we all or even mostly came close to hitting it.  I heard that they originally wanted to go lower but the developers in the pilot program blew past their planned limit very quickly.<p>As for the company, its almost entirely B2B SaaS (I think it has some offerings that are used by consumers, but they're mostly/entirely paid for by another business on behalf of <i>their</i> customers), and they have developers all over the world, although the headquarters and biggest group of developers is in silicon valley (my office is in the midwest).</p>
]]></description><pubDate>Tue, 09 Jun 2026 07:24:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48457753</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48457753</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48457753</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "AI is slowing down"]]></title><description><![CDATA[
<p>Uber is <i>not</i> the only company that's putting a per-developer limit on AI spending.  I know this because I work for another one (and we have a significantly lower limit).  You just heard about Uber first because they're high profile.</p>
]]></description><pubDate>Mon, 08 Jun 2026 23:42:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48454024</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48454024</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48454024</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Bun Has Been Converted to Rust. Now What?"]]></title><description><![CDATA[
<p>I'm not sure I'd classify "double" as "the same ballpark", but either way C2Rust appears to just mark everything as `unsafe` (even when it's completely unnecessary), so I don't think that's a fair comparison.  Further, I know for a fact that their are instances of the `unsafe` in the bun codebase right now that are just trivially incorrect in rust.  A vaguely competent human programmer, even one doing a 1:1 translation (which is likely a bad idea anyway) would have caught those.</p>
]]></description><pubDate>Wed, 03 Jun 2026 16:33:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=48386241</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48386241</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48386241</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Bun Has Been Converted to Rust. Now What?"]]></title><description><![CDATA[
<p>Yes and no.  In theory, you could start to go through an factor out unnecessary unsafe blocks from the codebase now.  In practice, writing safe rust often requires significantly different design decisions than writing in garbage collected or fully manually memory managed languages, especially if you want the results to be performant.  There's a very good chance you'd be better off rewriting from scratch than trying to do a 1:1 translation/port.</p>
]]></description><pubDate>Wed, 03 Jun 2026 13:37:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48383882</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48383882</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48383882</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Bun has been converted to rust. Now what?"]]></title><description><![CDATA[
<p>Bun has about twice the density of `unsafe` compared to deno, which does roughly the same job (wrap a c/c++ javascript engine to make a server side runtime written in rust).  So not as massive a difference as the linked post's comparison, but still significantly more unsafe than we'd expect.</p>
]]></description><pubDate>Wed, 03 Jun 2026 13:29:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48383766</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48383766</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48383766</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Bun has been converted to rust. Now what?"]]></title><description><![CDATA[
<p>> Comparing unsafe counts to UV makes no sense here.<p>Agreed.  The better comparison is Deno, which does roughly the same thing (wrapping a c/c++ javascript engine to create a serverside runtime).  Deno was smaller, but per line of code/file it had just over half the unsafe as the bun slop rewrite.  There are also examples of unsafe blocks that are just <i>locally</i> incorrect [0], meaning you don't even need to read outside the function and it's definition to determine that the entity doing the porting (in this case claude) didn't understand how unsafe rust works.<p>> I am skeptical why Bun was even an acquisition target in the first place, other than pulling stunts like this.<p>From what I've heard, it sounds like the author had become fully hooked by LLM coding before the acquisition, so I don't think we know enough to conclude that this was premeditated by Anthropic.  I think the more likely trigger is that Bun had just bragged about how they'd forked the zig language for a performance increase and attacked zig for it's no-AI policy, only for a zig core team member to point out that the code couldn't be upstreamed even if it had been hand written [1].<p>The better point, IMO, is how this is an example of how AI use/addiction makes developers who fall into it loose the ability to judge code quality, particularly for code they prompted for themselves.  From what I've heard of Mr. Sumner and the team at oven from before AI, I very much doubt code of this poor quality would have been allowed anywhere near the main branch before LLMs.<p>[0] <a href="https://github.com/oven-sh/bun/blob/d2a6506dfad4c3ef3dddb9ae1adec875eb5734bd/src/bun_core/string/MutableString.rs#L413" rel="nofollow">https://github.com/oven-sh/bun/blob/d2a6506dfad4c3ef3dddb9ae...</a><p>[1] <a href="https://ziggit.dev/t/bun-s-zig-fork-got-4x-faster-compilation-times/15183/19" rel="nofollow">https://ziggit.dev/t/bun-s-zig-fork-got-4x-faster-compilatio...</a></p>
]]></description><pubDate>Wed, 03 Jun 2026 13:19:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48383627</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48383627</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48383627</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "SQLite is all you need for durable workflows"]]></title><description><![CDATA[
<p>First, you're very likely underestimating how much load SQLite can handle.  SQLite is usually write limited, but for smallish writes it can easily handle thousands a second with very trivial optimization, and with more thought can scale to tens or hundreds of thousands of write transactions per second.  In some cases, it can actually out perform traditional server based RDMSs because of reduced overhead and because holding locks on network timescales (which will likely happen even for databases with multiple writers, because eventually you have to deal with two transactions needing to write to the same place) is very inefficient.<p>Second, I think you're overestimating how hard sharding is here.  There are plenty of use cases for which sharding isn't just easy to set up, but the natural thing you'd be likely to do even without scale.  Things like e.g. a helpdesk SaaS, where each customer/organization has it's own independent data.<p>Third, a large part of the point is that you are unlikely to know ahead of time you're going to "scale like that".  As I already pointed out, most SaaS apps do not end up having many users.  For some that's intentional, but for others the reason is that they simply never caught on.  For those cases (and they're the vast majority), cosplaying as a much larger app is a complete waste.  It's much better to wait until you're successful enough to need to switch and then use the revenue you <i>now</i> have to solve the scaling problem you ran into.<p>Fourth, as an aside, ironically the <i>other</i> (slightly less) easy way to shard superficially resembles a common way people cosplay as netflix: splitting your data by domain as "microservices" (although there's a good chance they don't need to be independent processes/on independent machines).</p>
]]></description><pubDate>Tue, 02 Jun 2026 14:57:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48371128</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48371128</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48371128</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "AI job grief: A psychological crisis hitting tech workers"]]></title><description><![CDATA[
<p>I didn't say they did.  What I said was that <i>if</i> the person I was replying to was correct that average Elo ratings had increased because of AI, the mechanism would have to be "AI makes lower ranked players give up" instead of "AI makes players better on average"</p>
]]></description><pubDate>Tue, 02 Jun 2026 07:03:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48366956</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48366956</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48366956</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Vibe Coding Is Not Engineering"]]></title><description><![CDATA[
<p>Two possibilities:<p>1. Your unit tests are exacting enough to fully specify the unit.  In that case, congratulations, your unit tests <i>are</i> the code.  They're also probably much more awkward to write, maintain, etc.  Also, the compilation step to go from the unit tests to the actual code is now orders of magnitude more expensive, requires a SaaS to even work, etc.<p>2. Your unit tests are not that exacting and still leave ambiguity, edge cases, etc.  In that case it very much matters what's in the blob of code, because while it could be a correct implementation of what you wanted, it could also be something else entirely that just happens to be correct <i>for the part you did specify</i>.</p>
]]></description><pubDate>Sat, 30 May 2026 17:45:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48338836</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48338836</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48338836</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "AI job grief: A psychological crisis hitting tech workers"]]></title><description><![CDATA[
<p>Right, but you have to give the new players <i>some</i> rating, and as long as that number remains constant it will also be the mean Elo.  Therefore, "new players entering the pool" can't cause a rise in mean Elo.  As for lower rated players leaving, that could indeed raise the mean (assuming you stop counting them in the average), but that changes the point from "players have gotten better because of AI" to "worse players are more likely to just give up on chess because of AI", which is a significantly less optimistic picture IMO.</p>
]]></description><pubDate>Sat, 30 May 2026 17:11:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48338512</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48338512</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48338512</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "AI job grief: A psychological crisis hitting tech workers"]]></title><description><![CDATA[
<p>Elo is zero sum.  Each point gained by one player is lost by another.  It follows that the mean elo is always exactly equal to the initial elo assigned to new players before they play any games, and can't change over time.  If the highest ranked player are higher elo than before, the lower ranked players must be <i>lower</i> elo.</p>
]]></description><pubDate>Sat, 30 May 2026 16:40:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48338149</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48338149</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48338149</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Anthropic surpasses OpenAI to become most valuable AI startup"]]></title><description><![CDATA[
<p>> This February I tried again and used Claude to generate Rust code. I have never been more stunned in my life. It's just as good as I am, and 30x faster. No fluff, the code is verbatim just as I would have written.<p>Having looked at a bunch of known or suspected (based on the intent of the code and/or what I know about the developer(s)) LLM generated rust, there's only a few explanations here:<p>1. You're <i>way</i> better at prompting than (virtually) anyone else.<p>2. You're vastly overestimating how good the rust code it produced is.<p>3. You handheld the model throughout and made lots of edits.<p>4. Your hand written rust code is very bad.<p>Because from every example I've seen, these models write <i>horrible</i> rust.  Sure, it may technically pass all the tests, but it's horribly pessimized, badly organized, doesn't even attempt to use the type system, if there aren't bugs <i>now</i> there will be the second it tries to refactor or add a new feature, etc. etc.<p>(I also strongly suspect that the same would be true for other languages, but I can detect it in rust more easily because it's my main language)</p>
]]></description><pubDate>Sat, 30 May 2026 14:48:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48336860</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48336860</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48336860</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "SQLite is all you need for durable workflows"]]></title><description><![CDATA[
<p>The typical multi-user SaaS webapp doesn't have anywhere near enough users to overwhelm a single SQLite instance.  Of the few that do succeed to the point where that's no longer true, a significant fraction can use techniques like sharding to stretch SQLite further.</p>
]]></description><pubDate>Fri, 29 May 2026 20:32:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48328835</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48328835</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48328835</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Blue Origin's New Glenn blows up during static fire test"]]></title><description><![CDATA[
<p>SpaceX is "in the process" of a lot of things, not all of which pan out.  So far the cases that have actually started serious construction are in Cape Canaveral, and will absolutely be necessary assuming Starship becomes operational (because the number of launches SpaceX is allowed to do from Starbase is limited).</p>
]]></description><pubDate>Fri, 29 May 2026 19:10:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48327880</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48327880</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48327880</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "Blue Origin's New Glenn blows up during static fire test"]]></title><description><![CDATA[
<p>Launch pads are not something you just buy on a whim to keep around just in case you need them.  They're very expensive pieces of infrastructure that you only acquire when you have an actual, known need.  That's how every launch provider that I know of behaves, including SpaceX.</p>
]]></description><pubDate>Fri, 29 May 2026 17:17:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48326243</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48326243</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48326243</guid></item><item><title><![CDATA[New comment by lunar_mycroft in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>> we've had say superhuman competitive programming performance for awhile<p>Fair.  Question though, is this when compared to competitive programmers, or developers in general?<p>> extremely strong performance (super-p90-engineer) on say language-to-language porting<p>I'd need to see the methodology here and could easily be wrong, but I suspect this is largely down to "faster" and "willing to do a lot more of it without complaining"<p>> RE-bench (ML research engineering benchmark from METR) is already clearly above human perf<p>This pretty much <i>has</i> to be "relative to devs who don't specialize in that area", because if it wasn't the frontier labs wouldn't be paying a fortune to hire ML researchers.<p>> Mythos clearly has superior cyber capabilities<p>Based on Daniel Stenberg's experience with it [0], it seems like it's at best roughly on par with human experts.  It's advantage is cost/speed.<p>> Also, why do you discount speed and cost so much?<p>Because in all the domains LLMs are applicable to, getting something cheaper/faster at the expense of quality isn't new or particularly interesting.<p>> Every model iteration is a significant bump up in performance according to a lot of complementary and principled measurements. What's been the thing that hasn't been true?<p>That they were good <i>enough</i>.  To reuse the baby analogy, if every week your friend told you that their infant child was now heavier than an elephant (while acknowledging that the baby was lighter than one the previous week), and every week that turned out not to be true, it wouldn't be a defense of your friend to argue "ah, but the baby <i>was</i> heavier every week than the week before".<p>Also worth noting that as of ~8 months ago, while <i>benchmark scores</i> were steadily increasing, <i>merge rates</i> (aka whether the code was "good enough") were not [1].<p>> thats a bit of a crazy ask.<p>Why?  If you use LLMs to do anything you're basically doing that already, it's just that the scope of your Y is smaller.  Either the benchmarks are irrelevant and you're using something else to determine when that's appropriate for a given Y, or you do in fact have a value of X for the Y's you've handed over to LLMs.<p>> Yet one side has a mountain of hard evidence and the other side has...an outdated n < 20 METR study using Sonnet 3?<p>There's a lot of irony here, because by <i>far</i> the most common pro-LLM coding argument is "I <i>feel</i> like I'm producing good code faster with them", followed by "this other person <i>feels</i> like they're producing good code faster with them".<p>Also note that the most important part of the METR study you reference wasn't the slowdown they observed, it was the dramatic disagreement between what the participants <i>thought</i> the impact of AI was vs what it <i>actually</i> was.  That <i>isn't</i> dependent on the model.<p>[0] <a href="https://daniel.haxx.se/blog/2026/05/11/mythos-finds-a-curl-vulnerability/" rel="nofollow">https://daniel.haxx.se/blog/2026/05/11/mythos-finds-a-curl-v...</a><p>[1] <a href="https://entropicthoughts.com/no-swe-bench-improvement" rel="nofollow">https://entropicthoughts.com/no-swe-bench-improvement</a></p>
]]></description><pubDate>Fri, 29 May 2026 15:20:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48324266</link><dc:creator>lunar_mycroft</dc:creator><comments>https://news.ycombinator.com/item?id=48324266</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48324266</guid></item></channel></rss>