<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: irthomasthomas</title><link>https://news.ycombinator.com/user?id=irthomasthomas</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 10 Apr 2026 09:25:48 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=irthomasthomas" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by irthomasthomas in "Claude mixes up who said what and that's not OK"]]></title><description><![CDATA[
<p>I have suffered a lot with this recently. I have been using llms to analyze my llm history. It frequently gets confused and responds to prompts in the data. In one case I woke up to find that it had fixed numerous bugs in a project I abandoned years ago.</p>
]]></description><pubDate>Thu, 09 Apr 2026 12:59:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47703152</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47703152</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47703152</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Project Glasswing: Securing critical software for the AI era"]]></title><description><![CDATA[
<p>That still leaves open the possibility that they reduce model quality due to profit. ;p</p>
]]></description><pubDate>Wed, 08 Apr 2026 00:04:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47682938</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47682938</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47682938</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Trump says 'a whole civilization will die tonight' if Iran does not make a deal"]]></title><description><![CDATA[
<p>Apparently they came with 75 meters of bombing an nuclear power plant already. A plant with 10x the material Chernobyl had, and in vulnerable above-ground storage. <a href="https://www.ndtv.com/world-news/middle-east-war-why-attacks-near-irans-bushehr-nuclear-plant-alarm-the-gulf-11317406" rel="nofollow">https://www.ndtv.com/world-news/middle-east-war-why-attacks-...</a></p>
]]></description><pubDate>Tue, 07 Apr 2026 19:53:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47680519</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47680519</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47680519</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Artemis computer running two instances of MS outlook; they can't figure out why"]]></title><description><![CDATA[
<p>Is this incident not reason enough? Astronauts in space are needing remote support to debug it, and taking up priceless mission time.</p>
]]></description><pubDate>Thu, 02 Apr 2026 21:34:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=47620494</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47620494</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47620494</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Claude Code's source code has been leaked via a map file in their NPM registry"]]></title><description><![CDATA[
<p>Actually, this could be a case where its useful. Even it only catches half the complaints, that's still a lot of data, far more than ordinary telemetry used to collect.</p>
]]></description><pubDate>Tue, 31 Mar 2026 17:27:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47590695</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47590695</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47590695</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Claude Code's source code has been leaked via a map file in their NPM registry"]]></title><description><![CDATA[
<p>This just proves its vibe coded because LLMs love writing solutions like that. I probably have a hundred examples just like it in my history.</p>
]]></description><pubDate>Tue, 31 Mar 2026 14:13:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=47587699</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47587699</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47587699</guid></item><item><title><![CDATA[New comment by irthomasthomas in "TurboQuant: Redefining AI efficiency with extreme compression"]]></title><description><![CDATA[
<p>it enables models larger than was previously possible.</p>
]]></description><pubDate>Wed, 25 Mar 2026 17:58:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47520939</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47520939</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47520939</guid></item><item><title><![CDATA[New comment by irthomasthomas in "TurboQuant: Redefining AI efficiency with extreme compression"]]></title><description><![CDATA[
<p>Efficiency gains can be used to make existing models more profitable, or to make new larger and more intelligent models.</p>
]]></description><pubDate>Wed, 25 Mar 2026 12:55:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47516711</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47516711</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47516711</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Vatican Rebukes Peter Thiel's Antichrist Lectures in Rome"]]></title><description><![CDATA[
<p>Its possibly just an SEO trick. People have been calling Thiel the antichrist for a long time.</p>
]]></description><pubDate>Sun, 22 Mar 2026 14:55:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47478169</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47478169</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47478169</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Profiling Hacker News users based on their comments"]]></title><description><![CDATA[
<p>A friend made a cli tool, ideal for agents, which does this and can aggregate intelligence across multiple platforms.<p><a href="https://github.com/bm-github/owasp-social-osint-agent" rel="nofollow">https://github.com/bm-github/owasp-social-osint-agent</a></p>
]]></description><pubDate>Sun, 22 Mar 2026 10:57:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47476257</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47476257</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47476257</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Many SWE-bench-Passing PRs would not be merged"]]></title><description><![CDATA[
<p>Have you tried meta-prompts e.g.
"Rewrite the prompt to improve the perceived taste and expertise of the author"</p>
]]></description><pubDate>Thu, 12 Mar 2026 15:45:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47352494</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47352494</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47352494</guid></item><item><title><![CDATA[New comment by irthomasthomas in "No, it doesn't cost Anthropic $5k per Claude Code user"]]></title><description><![CDATA[
<p>Opus doubled in speed with version  4.5, leading me to speculate that they had promoted a sonnet size model. The new faster opus was the same speed as Gemini 3 flash running on the same TPUs. I think anthropics margins are probably the highest in the industry, but they have to chop that up with google by renting their TPUs.</p>
]]></description><pubDate>Tue, 10 Mar 2026 10:50:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47321530</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47321530</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47321530</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Pentagon formally labels Anthropic supply-chain risk"]]></title><description><![CDATA[
<p>They will rename it The Free Democratic Republic of America.</p>
]]></description><pubDate>Thu, 05 Mar 2026 22:10:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47268007</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47268007</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47268007</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Agentic Engineering Patterns"]]></title><description><![CDATA[
<p>Here is an example where the prompt was only a few hundred tokens and the output reasoning chain was correct, but the actual function call was wrong <a href="https://x.com/xundecidability/status/2005647216741105962?s=20" rel="nofollow">https://x.com/xundecidability/status/2005647216741105962?s=2...</a></p>
]]></description><pubDate>Wed, 04 Mar 2026 12:44:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47246678</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47246678</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47246678</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Agentic Engineering Patterns"]]></title><description><![CDATA[
<p>Here is an example where the prompt was only a few hundred tokens and the output reasoning chain was correct, but the actual function call was wrong <a href="https://x.com/xundecidability/status/2005647216741105962?s=20" rel="nofollow">https://x.com/xundecidability/status/2005647216741105962?s=2...</a></p>
]]></description><pubDate>Wed, 04 Mar 2026 12:43:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47246671</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47246671</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47246671</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Bet on German Train Delays"]]></title><description><![CDATA[
<p>No they don't, the practice was banned some time ago. You now require a "insurable interest".
<a href="https://en.wikipedia.org/wiki/Marine_Insurance_Act_1745#:~:text=The%20purpose%20of,policies%20against%20gambling" rel="nofollow">https://en.wikipedia.org/wiki/Marine_Insurance_Act_1745#:~:t...</a></p>
]]></description><pubDate>Wed, 04 Mar 2026 12:24:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47246477</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47246477</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47246477</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Bet on German Train Delays"]]></title><description><![CDATA[
<p>People used to bet on ships sinking and sailors drowning. 
Till they learned better.<p>Edit:
This was common until Parliament passed the Marine Insurance Act of 1745.<p>Before that, speculators could take out "wagering policies" on vessels they had no connection to. This created "coffin ships" - unseaworthy vessels sent to sea because the insurance payout for a wreck was worth more than the ship itself. The law introduced "insurable interest," meaning you cannot bet on a disaster unless you stand to lose something if it happens. This removed the incentive for sabotage and murder for profit.<p>Modern prediction markets are heading toward the same problem. Betting on train delays or bridge collapses without having any stake gives bad actors a reason to cause it. If the cost of sabotage is lower than the payout, the market effectively pays for the disaster to happen.<p>Whoever downvoted this wants you to ignore centuries of legal precedent designed to prevent exactly this kind of blood money. Those who ignore the lessons of the past learn wisdom in blood... <a href="https://en.wikipedia.org/wiki/Coffin_ship_(insurance)#:~:text=A%20coffin%20ship,sunk%20than%20afloat." rel="nofollow">https://en.wikipedia.org/wiki/Coffin_ship_(insurance)#:~:tex...</a>
<a href="https://en.wikipedia.org/wiki/Marine_Insurance_Act_1745#:~:text=The%20purpose%20of,policies%20against%20gambling" rel="nofollow">https://en.wikipedia.org/wiki/Marine_Insurance_Act_1745#:~:t...</a></p>
]]></description><pubDate>Wed, 04 Mar 2026 11:57:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=47246228</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47246228</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47246228</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Agentic Engineering Patterns"]]></title><description><![CDATA[
<p>I do this too, but then you need some method to handle it, because now you have to read and test and verify multiple work streams. It can become overwhelming. In the past week I had the following problems from parallel agents:<p>Gemini running an benchmark- everything ran smoothly for an hour. But on verification it had hallucinated the model used for judging, invalidating the whole run.<p>Another task used Opus and I manually specified the model to use. It still used the wrong model.<p>This type of hallucination has happened to me at least 4-5 times in the past fortnight using opus 4.6 and gemini-3.1-pro. GLM-5 does not seem to hallucinate so much.<p>So if you are not actively monitoring your agent and making the corrections, you need something else that is.</p>
]]></description><pubDate>Wed, 04 Mar 2026 11:50:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47246159</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47246159</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47246159</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Why XML tags are so fundamental to Claude"]]></title><description><![CDATA[
<p>The main thing i use xml tags for is seperating content from instructions. Say I am doing prompt engineering, so that the content being operated on is itself a prompt then I wrap it with<p><NO_OP_DRAFT>
draft prompt
</NO_OP_DRAFT><p>instructions for modifying draft prompt<p>If I don't do this, a significant number of times it responds to the instructions in the draft.</p>
]]></description><pubDate>Mon, 02 Mar 2026 10:40:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47216182</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47216182</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47216182</guid></item><item><title><![CDATA[New comment by irthomasthomas in "Our Agreement with the Department of War"]]></title><description><![CDATA[
<p>Why would you want a duplicitous CEO in charge of your countries terminator systems?</p>
]]></description><pubDate>Sun, 01 Mar 2026 13:33:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47206562</link><dc:creator>irthomasthomas</dc:creator><comments>https://news.ycombinator.com/item?id=47206562</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47206562</guid></item></channel></rss>