<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: gymbeaux</title><link>https://news.ycombinator.com/user?id=gymbeaux</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 21 Jun 2026 02:23:50 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=gymbeaux" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by gymbeaux in "GPT-5.5 hallucinates 3x more than MIT-licensed GLM-5.2"]]></title><description><![CDATA[
<p>Those numbers are abysmal. Should we really be using LLMs to write our code? I have a theory- LLMs can spit out code that gets the job done and looks ok, maybe even great, but contains small “anomalies” that compound over time. An enterprise app developed entirely with LLM-happy devs might end up virtually unmaintainable.<p>I’m not sure how to explain it, but the more I see LLM-written code the more I feel it’s bad code doing a good job of masquerading as good code. I think this take will become less-hot in the next year or two when we see enterprise greenfield projects that were created entirely with LLM “assistance” go to prod. I think we’ll find that the code is difficult for humans to read, understand, debug, and extend- and I think the larger the codebase the harder it will be for LLMs to maintain. More opportunity for hallucination, larger context windows needed, more tokens bought and spent for smaller and smaller code changes. I think the more code an LLM writes for an app, the worse that codebase becomes.</p>
]]></description><pubDate>Sat, 20 Jun 2026 05:49:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48606671</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48606671</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48606671</guid></item><item><title><![CDATA[New comment by gymbeaux in "S&P 500 rejects SpaceX, also blocking entry for OpenAI and Anthropic"]]></title><description><![CDATA[
<p>Sure but there’s <i>no way</i> Enron was the last Enron. Also keep in mind something can be “legal” and still misleading to shareholders. I’m sure there are loopholes, some of which may have been reintroduced by Republicans in the last 20 years.</p>
]]></description><pubDate>Sat, 06 Jun 2026 22:10:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48429510</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48429510</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48429510</guid></item><item><title><![CDATA[New comment by gymbeaux in "S&P 500 rejects SpaceX, also blocking entry for OpenAI and Anthropic"]]></title><description><![CDATA[
<p>(P/E ratios aren’t expressed in dollars)<p>So a PE of 500 means it would take 500 years for the earnings of the company to equal the current market cap (price per share X number of shares). This implies absurd (almost certainly impossible) growth over the next 500 years. Of course anyone expecting to pull their investment out and spend it on retirement can’t be looking at a 500-year investment horizon. I suppose the 1% can, though. What the hell else are they going to spend their cash on?</p>
]]></description><pubDate>Sat, 06 Jun 2026 22:08:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48429489</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48429489</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48429489</guid></item><item><title><![CDATA[New comment by gymbeaux in "U.S. midterms have a cyber problem, but it's not at the ballot box"]]></title><description><![CDATA[
<p>I think most people just vote for whoever’s mailer they like more. Where I live, the Republican candidate just sends out a mailer with him standing next to a firefighter and a cop and that’s really it. Maybe some garb about reducing crime and taxes at the same time. That’s enough for many (most?) to decide whether that’s the candidate they want to vote for.</p>
]]></description><pubDate>Wed, 03 Jun 2026 15:25:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48385364</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48385364</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48385364</guid></item><item><title><![CDATA[New comment by gymbeaux in "Skillhound: Give your AI access to every public SKILL.md"]]></title><description><![CDATA[
<p>This brings up a good point- can we just have Claude Code create a ripoff of this site and run it locally for free? Probably.</p>
]]></description><pubDate>Wed, 03 Jun 2026 15:18:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48385270</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48385270</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48385270</guid></item><item><title><![CDATA[New comment by gymbeaux in "The Eternal Sloptember"]]></title><description><![CDATA[
<p>I think it's worth noting that the "work" these LLMs are doing is digital. ChatGPT cannot plow fields, or construct buildings and roads, or otherwise interact with the physical world. LLMs are useful because humanity has evolved (devolved?) into so much intangible work- documents, code, powerpoint presentations... One solar flare and that's all wiped out, and then we can say definitively that LLMs were a waste of time and energy.</p>
]]></description><pubDate>Tue, 02 Jun 2026 19:21:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48374933</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48374933</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48374933</guid></item><item><title><![CDATA[New comment by gymbeaux in "The Eternal Sloptember"]]></title><description><![CDATA[
<p>Anecdotally I am seeing websites/web portals/et al. are much buggier. I have no doubt that software quality has declined since ChatGPT was made public, and tools like Claude Code are accelerating that "enshittification" of the world's codebase.</p>
]]></description><pubDate>Tue, 02 Jun 2026 19:18:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=48374885</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48374885</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48374885</guid></item><item><title><![CDATA[New comment by gymbeaux in "The Eternal Sloptember"]]></title><description><![CDATA[
<p>I think we should investigate the backgrounds of those making claims one way or another and rely on those backgrounds for determining credibility. I suspect that we'd find that those who are saying LLMs write great, bulletproof code with "100% unit test coverage" (true story- a coworker was bragging about 100% unit test coverage) are not really qualified to be software engineers. This is a trend I have noticed in my org. Those drinking the most LLM kool aid do NOT have an engineering/comp sci degree, have relatively little experience, resumes are incredibly weak (e.g., generic stuff that we've all done as software engineers).<p>We no longer have the luxury of welcoming bootcamp engineers into our field with open arms. We need to protect our craft. Call these fools out or they'll keep spreading hype/FOMO.</p>
]]></description><pubDate>Tue, 02 Jun 2026 19:12:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48374800</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48374800</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48374800</guid></item><item><title><![CDATA[New comment by gymbeaux in "The Eternal Sloptember"]]></title><description><![CDATA[
<p>The unit test example has been my team's experience as well. The unit tests look good on the surface, but their passing or failing has little predictive value on whether there are actually bugs in the code.<p>Some people have suggested you write the unit tests by hand to basically "check" the LLM's work and keep it honest, but to write good unit tests you have to understand the underlying code, which takes time (since you didn't write it), so to me this is another bullet point that suggests LLMs will eventually be relegated to "StackOverflow+" duty - give me snippets, but I'll still write effectively all the code.</p>
]]></description><pubDate>Tue, 02 Jun 2026 18:44:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48374432</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48374432</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48374432</guid></item><item><title><![CDATA[New comment by gymbeaux in "Magnifica Humanitas"]]></title><description><![CDATA[
<p>I’ve been a software engineer for over a decade. At what point is my “residency” over? At what point does the “community” decide I can write software with some level of competency and don’t need to solve fizz buzz over and over again?</p>
]]></description><pubDate>Tue, 02 Jun 2026 02:47:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48365367</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48365367</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48365367</guid></item><item><title><![CDATA[New comment by gymbeaux in "The Eternal Sloptember"]]></title><description><![CDATA[
<p>I think of housing as a better example- KB Home cookie cutter special for $400k or 100-year-old craftsman house for $800k. Most are okay with the $400k cookie cutter special that probably has piss in the walls, because $800k is such a huge increase. I fear most companies won’t want to pay the extra cost for “the same software” to be written by hand.</p>
]]></description><pubDate>Tue, 02 Jun 2026 02:44:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48365344</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48365344</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48365344</guid></item><item><title><![CDATA[New comment by gymbeaux in "What's gonna happen to software engineers?"]]></title><description><![CDATA[
<p>Even though the synthetic benchmarks paint a picture of LLMs coming a long way since 2022, my practical experience has been that they aren’t tangibly better. No doubt someone reading this will chime in and say LLMs are way better at writing code or whatever, and maybe that’s true, but there’s no difference between ChatGPT 3.5 and Claude Opus 4.8 as far as my trusting the output. Opus 4.8 still messes up plenty. It’s particularly bad with identifying and fixing CI yaml, but it struggles in the usual areas too.<p>So I’m thinking we’ve just about reached apex with LLMs, and they have failed at replacing software engineers (companies can freeze hiring juniors at their own, future peril using any excuse they like).</p>
]]></description><pubDate>Tue, 02 Jun 2026 02:38:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=48365309</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48365309</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48365309</guid></item><item><title><![CDATA[New comment by gymbeaux in "What's gonna happen to software engineers?"]]></title><description><![CDATA[
<p>LLMs seem to be best at writing web apps intended to be internal tools. I don’t need to ever really read the code because the functionality is relatively simple- but still valuable. I had Claude Code build me a CLI tool for running common kubectl commands against our EKS cluster. With EKS (AWS) you generally use the AWS CLI to authenticate and then choose the correct EKS cluster and <i>then</i> you can run your kubectl commands. So this CLI tool remembers all the AWS accounts and EKS cluster names and namespaces and pods that I care about. It’s been a huge time saver, and I never would have had the time to build it as part of my day to day. And because it’s an internal tool, I don’t have to worry about things like security/authentication, and any bugs aren’t as big a deal as if they were customer-facing. I wouldn’t want to use it for generating an app that is going on the public internet and doing anything sensitive/important/valuable with my or other people’s data (I’m sure that doesn’t stop others from vibe coding commercial products).</p>
]]></description><pubDate>Tue, 02 Jun 2026 02:33:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48365278</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48365278</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48365278</guid></item><item><title><![CDATA[New comment by gymbeaux in "What's gonna happen to software engineers?"]]></title><description><![CDATA[
<p>LLMs have taken much of the enjoyment out of coding for me, but I don’t think it has to be that way. I hope that as an industry we settle on LLMs being more like tools than human assistants. I think most of the engineers at my company are using LLMs as human assistants- most of them have agentic workflows set up and have premium subscriptions to Claude AND Gemini AND ChatGPT. Many have local LLMs running on their company MacBook Pros, but they can never manage to describe to me in plain English what those local LLMs are doing. I would compare local LLMs to Raspberry Pi clusters. They’re neat, and technically they can do <i>stuff</i> but they are incredibly impractical. I want so desperately to have the power of Claude Opus running locally, but we are incredibly, ridiculously, extremely far off (benchmarks are often misleading and nobody likes to talk about the paltry tokens/s they’re getting on their home rig).<p>So the problem for me is that I’ve noticed a trend with my coworkers- they usually don’t have a formal CS background (e.g., Comp Sci degree), their resumes are unimpressive, and they have personal websites where anyone can download a copy of their resume. I haven’t seen this at any previous job. It’s just weird, man. Anyway, so my problem is when they are purported to be the best of the best at this company, and they’re chugging the LLM kool-aid, I’m struggling to figure out whether they’re really ahead of the curve and getting the drop on the rest of us, or they’re doing basically no work and waving through each other’s PRs with little to no oversight on what the LLMs are writing. The bush league nature of many of the bugs I’m seeing going all the way to prod makes me think it’s the latter.<p>Meanwhile, these guys and their teams are completing an absurd number of story points each sprint. My team is dead last (and coincidentally we’re all pretty anti-AI for writing code).<p>I’m waiting for the other shoe to drop. I’m waiting for the C suite to call our project a failure and lay everyone off. The product <i>generally</i> works, though I think if the devs were truly rockstars, things would be much smoother. It seems like LLMs are letting middle-of-the-road or mediocre devs appear to be rockstars purely on output speed. Nobody in this org cares if the output has 4x the bugs it should have, just that it was completed quickly.<p>The main issue I think is that nobody- including the rockstar tech leads- knows how the system works. They don’t know the code. There’s a bug and it takes forever to track down. Don’t ask about unit tests.<p>I think in a perfect world, these LLMs are treated like another tool rather than another person- meaning you and I still write a large portion of the code, and we read and understand every line- especially the lines generated by LLMs. It’s still a huge productivity multiplier for me to not have to remember how to do mundane things like write out arrays into CSV files- before I’d find a close implementation on StackOverflow and tweak to my needs. Now I can just describe what I want the CSV to look like and Claude Code generally gets it right on the first try.</p>
]]></description><pubDate>Tue, 02 Jun 2026 02:28:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=48365239</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48365239</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48365239</guid></item><item><title><![CDATA[New comment by gymbeaux in "Magnifica Humanitas"]]></title><description><![CDATA[
<p>Enlighten us to these drawbacks. On the surface I am inclined to say the pros would outweigh the cons. Compared to other professions, software engineering seems to struggle the most with H-1B/Green Card abuse and interview processes. Job interviews are absurdly different (easier) for doctors, lawyers, et al. than for software engineers, and that I believe is because of the licensure. I do think licensure adds overhead to an industry (e.g., malpractice insurance, governing bodies, license management) and that probably discourages anyone with real power (like FAANG) to pursue it and try to set it as an industry-wide standard. Most software engineers in the U.S. are making around $130-140k, but lawyers and medical doctors usually make significantly more (perhaps because of the licensure overhead - I'm not sure if malpractice insurance is included in a medical doctor's salary- I would imagine it's not and is taken out of each paycheck like any other industry's health insurance benefits).</p>
]]></description><pubDate>Mon, 25 May 2026 17:35:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48269461</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48269461</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48269461</guid></item><item><title><![CDATA[New comment by gymbeaux in "Magnifica Humanitas"]]></title><description><![CDATA[
<p>Great message from the Pope (I never thought I'd write those words). I'm glad he recognizes what's going on with AI (LLMs, really) as one of the highest areas of concern for humanity and the Earth and is using his influence to encourage things like renewable energy investment. It's a shame he isn't in a position of "real" power and we're still at the mercy of the worst U.S. president in history.<p>One thing that isn't talked about enough is how so much is going into AI but not much is coming out of it. The rhetoric from tech bros is still that AI is "going to" change the world. Hasn't really changed the world yet, except driven up everyone's utility bills and put hundreds of thousands of people out of work.<p>I encourage people, especially software engineers here, to remember the previous "new hotness" tech advancements - blockchain and NoSQL DBs being two recent examples. In both instances there was a flood of VC money into startups that have mostly failed because each was supposed to change the world (or at least change software). I spent a lot of my free time in those days trying to "find a problem" for blockchain and NoSQL to solve. I remember thinking I must be a lousy software engineer because I just wasn't getting the hype. Now I know it's because whenever something new comes out, people talk about how it can do X Y and Z, and there's a disconnect between what a technology CAN do, and what it SHOULD do. I can use blockchain for all sorts of things, but in most cases it wouldn't be the best option. Same for NoSQL DBs. Same for LLMs. The more you understand blockchain, the more you realize it's only good as a globally-distributed, immutable ledger - and currency is the only practical application of that in our society today (e.g., Bitcoin). NoSQL is the same way - yeah you can use MongoDB for whatever app, but it's going to be a bad time maintaining and scaling when you're storing relatively simple and consistent records. A CRUD app usually doesn't warrant a NoSQL DB.<p>LLMs are the same. I'm finding they are good at search-type tasks, where frankly not much "thinking" is involved. Therefore, with respect to writing software, they are best suited for simple, internal tools. Even then, I have to baby it, especially with today's LLMs. Claude Opus has been nerfed (most-likely quantized) to make space in the datacenters for Mythos, and eventually Mythos will be bad too for the same reasons. The question becomes, as it always has, "will LLMs rise up to the challenge" and history tells us "no." These things never live up to the hype. When you understand how LLMs work, you understand why ChatGPT 3.5 isn't that much worse than GPT 5.5, artificial benchmarks be damned.</p>
]]></description><pubDate>Mon, 25 May 2026 17:21:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48269263</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48269263</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48269263</guid></item><item><title><![CDATA[New comment by gymbeaux in "Show HN: Countries where you can leave your MacBook at a random coffee shop"]]></title><description><![CDATA[
<p>Surely they’re using a third party mapping library and they have absolutely 0 control over what the borders look like in disputed areas.</p>
]]></description><pubDate>Sun, 10 May 2026 14:14:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48084165</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=48084165</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48084165</guid></item><item><title><![CDATA[New comment by gymbeaux in "I built a 516-panel financial terminal in 3 weeks using AI"]]></title><description><![CDATA[
<p>Strong disagree on AI implying it doesn’t actually work. Staff engineers at FAANG are vibe coding these days and their stuff absolutely works.</p>
]]></description><pubDate>Wed, 01 Apr 2026 04:48:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47596944</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=47596944</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47596944</guid></item><item><title><![CDATA[New comment by gymbeaux in "I built a 516-panel financial terminal in 3 weeks using AI"]]></title><description><![CDATA[
<p>Fascinating. It’s called Instant Bloomberg and it’s just AOL instant messenger but only between Bloomberg Terminals. So that is absolutely the “data that insiders rely on that pedestrians will never have” and I’m sure it offers tremendous advantage and may explain why stocks trend in a direction for extended periods of time absent any public news.<p>I mean there’s no way average joes like us stand a chance doing anything but dollar cost averaging into index funds.<p>E: imagine if there was a law passed that required those IMs to be public in near-real-time (releasing the transcripts days later defeats the purpose).</p>
]]></description><pubDate>Wed, 01 Apr 2026 04:46:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47596934</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=47596934</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47596934</guid></item><item><title><![CDATA[New comment by gymbeaux in "I built a 516-panel financial terminal in 3 weeks using AI"]]></title><description><![CDATA[
<p>As far as I know, finding quality data sources is virtually impossible unless you already have so much money that your involvement in the stock market is either because you have insider knowledge or severe boredom owing to your never needing to work a day in your life.<p>I remember some startup selling a stock market data API as a subscription. I don’t think they exist anymore. So anyone who spent weeks, months of their free time building an app around that API is now completely shit out of luck.<p>I suspect the <i>real</i> APIs are still running the same code they ran in the 90s and if you have to ask how much they cost, you can’t afford them.</p>
]]></description><pubDate>Wed, 01 Apr 2026 04:42:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47596916</link><dc:creator>gymbeaux</dc:creator><comments>https://news.ycombinator.com/item?id=47596916</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47596916</guid></item></channel></rss>