<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ElFitz</title><link>https://news.ycombinator.com/user?id=ElFitz</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 14 Jun 2026 14:36:25 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ElFitz" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ElFitz in "Making Claude a Chemist"]]></title><description><![CDATA[
<p>> Feels complex like solving a Rubik's cube to write down synthesis steps but it is all a sequence of memorized tricks. Do Cannizaro if you want this, Bergmann to do that.<p>I remember two years ago, when I actually got into using graph data structures, wondering if maybe the "space" of available reactions for  any given starter and target molecules could be mapped as a graph, with intermediates as nodes and reactions as weighted directed edges, so synthesis becomes pathfinding through chemical space.<p>Turns out, it’s a thing! [^0]<p>Edit: Makes you wonder how much interesting stuff is sitting in plain sight, waiting for someone with the right cross-domain awareness / knowledge / whatever to notice it.<p>[0]: <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC9574932/" rel="nofollow">https://pmc.ncbi.nlm.nih.gov/articles/PMC9574932/</a></p>
]]></description><pubDate>Sun, 14 Jun 2026 07:46:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48525090</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48525090</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48525090</guid></item><item><title><![CDATA[New comment by ElFitz in "Show HN: Skill for your agent to visualize your gbrain and Obsidian"]]></title><description><![CDATA[
<p>> Ah, I see, sort of like figuring out the boundaries of your knowledge base and seeing if you have missed any connections between concepts?<p>Yeah. And also resurface relevant old notes, ideas, and web clippings I have forgotten. Two recent example[^0][^1].<p>> it could be an interesting synthesis/writing exercise to try to connect concepts that are far removed in your own mental model.<p>As well as different takes on or applications for those concepts, same for sources.<p>[0]: <a href="https://imgur.com/a/jhxUV9i" rel="nofollow">https://imgur.com/a/jhxUV9i</a><p>[1]: <a href="https://imgur.com/a/gmaq9Yw" rel="nofollow">https://imgur.com/a/gmaq9Yw</a></p>
]]></description><pubDate>Sun, 14 Jun 2026 07:22:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48524962</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48524962</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48524962</guid></item><item><title><![CDATA[New comment by ElFitz in "Amazon CEO's Talks with U.S. Officials Triggered Crackdown on Anthropic Models"]]></title><description><![CDATA[
<p>Isn't that what we call public debt?</p>
]]></description><pubDate>Sat, 13 Jun 2026 19:46:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48520743</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48520743</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48520743</guid></item><item><title><![CDATA[New comment by ElFitz in "Show HN: Skill for your agent to visualize your gbrain and Obsidian"]]></title><description><![CDATA[
<p>The graph visualisation itself isn’t much use in my experience.<p>But being able to tie related notes together, and see at the bottom of one which other notes reference it is interesting.<p>Even more now that a LLM can take care of the actual tending and pruning.</p>
]]></description><pubDate>Sat, 13 Jun 2026 10:56:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48515899</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48515899</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48515899</guid></item><item><title><![CDATA[New comment by ElFitz in "Digital Sovereignty Becomes an Imperative as the US Reads Dutch Emails"]]></title><description><![CDATA[
<p>I think it’s not the first time the US has used that sort of interpretation of the law. There’s this one[^0] but also, I believe, an older case, also involving Microsoft, about data in Ireland. But I can’t find it.<p>[0]: <a href="https://hackernoon.com/the-factors-prompting-a-judge-to-issue-warrants-for-microsoft-data-stored-in-ireland" rel="nofollow">https://hackernoon.com/the-factors-prompting-a-judge-to-issu...</a></p>
]]></description><pubDate>Sat, 13 Jun 2026 07:02:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=48514270</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48514270</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48514270</guid></item><item><title><![CDATA[New comment by ElFitz in "AI agent bankrupted their operator while trying to scan DN42"]]></title><description><![CDATA[
<p>These kinds of situations are why I gave my AI agents stray thoughts (automated insights / suggestions from a separate llm call with some curated context) that trigger on loop / rabbit hole detection.<p>Quite a bit of false positives, but it hasn’t had any ill-effect so far. Aside from increased quota usage.</p>
]]></description><pubDate>Sat, 13 Jun 2026 06:57:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48514228</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48514228</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48514228</guid></item><item><title><![CDATA[New comment by ElFitz in "AI agent bankrupted their operator while trying to scan DN42"]]></title><description><![CDATA[
<p>Haha. Yes. Much smaller scale versions of this led me to joke with a coding agent that LLMs tended to converge towards "Large corporation infrastructure best practices" when designing cloud infrastructure, when it was only me working on hobby side-projects with nearly no users and that I wouldn’t be able to put food in my fridge if they kept just spinning up VPCs for no reason.<p>Which somehow ended up being a very convincing argument for more frugal engineering, leading to a sort of "mind the user’s fridge" policy, "Fridge-Driven Development".<p>A policy that has been dutifully and scrupulously observed by all agents since, across all projects. Unlike my original clear, comprehensive, infrastructure guidelines.</p>
]]></description><pubDate>Fri, 12 Jun 2026 18:02:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48507359</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48507359</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48507359</guid></item><item><title><![CDATA[New comment by ElFitz in "If you are asking for human attention, demonstrate human effort"]]></title><description><![CDATA[
<p>I’ve been making Codex and Claude get their work reviewed by most recent best performing model of their own family, and each other’s, for months.<p>On top of that, we have been running multi-model AI reviews on every PR through their respective GitHub integrations (Codex, Gemini, Copilot, Greptile, CodeRabbit).<p>They never fully overlap, and yet  they somehow usually all miss the same things. The most significant improvement came from having agents commit their plan along with their work.<p>On the upside, it means I get to focus my reviews on different things.</p>
]]></description><pubDate>Fri, 12 Jun 2026 06:32:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48500620</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48500620</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48500620</guid></item><item><title><![CDATA[New comment by ElFitz in "Claude Fable 5: mid-tier results on coding tasks"]]></title><description><![CDATA[
<p>> and I have the feeling that the harness is much more important than the consensus expectation.<p>Is that really the consensus? There’s been a bit of literature lately on that. Can’t find the one about looking into whether or not the harness had a greater impact than the models (for comparable models), but there’s this one: <a href="https://arxiv.org/html/2605.23950" rel="nofollow">https://arxiv.org/html/2605.23950</a></p>
]]></description><pubDate>Thu, 11 Jun 2026 22:53:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48497508</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48497508</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48497508</guid></item><item><title><![CDATA[New comment by ElFitz in "Claude Fable 5: mid-tier results on coding tasks"]]></title><description><![CDATA[
<p>I’ve had a similar experience.<p>Pointing out past suboptimal / failing behaviours to new opus sessions would almost always actually create a sort of "anchoring bias" that would drive the agents towards exhibiting the failure mode (often while mentioning how it wouldn’t fall for it).<p>As far as I can recall, Fable has been the first model to discover the documented failure modes, comment on them, and just… keep going, actually avoiding them. Quite a surprise.</p>
]]></description><pubDate>Thu, 11 Jun 2026 22:39:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48497379</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48497379</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48497379</guid></item><item><title><![CDATA[New comment by ElFitz in "Surprise, pay $1000"]]></title><description><![CDATA[
<p>I <i>love</i> hard caps, and am tired of cloud services not even offering those. May make sense for large companies. Makes no sense for hobbyists and small companies. But maybe that's the point?</p>
]]></description><pubDate>Wed, 10 Jun 2026 14:09:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48476553</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48476553</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48476553</guid></item><item><title><![CDATA[New comment by ElFitz in "Claude Fable 5"]]></title><description><![CDATA[
<p>That’s what evals are for.<p>And there’s no reason evals can’t be done on multi-turn agents in a loop (or not): it’s pretty much what all these benchmarks do.</p>
]]></description><pubDate>Wed, 10 Jun 2026 07:37:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48472824</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48472824</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48472824</guid></item><item><title><![CDATA[New comment by ElFitz in "Apple reveals new AI architecture built around Google Gemini models"]]></title><description><![CDATA[
<p>I remember Windows Phone 8 <i>existed</i>, but that’s pretty much it. And yes, that’s the big question: what’s in it for the app publishers?<p>:/</p>
]]></description><pubDate>Tue, 09 Jun 2026 08:53:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48458466</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48458466</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48458466</guid></item><item><title><![CDATA[New comment by ElFitz in "Apple Core AI Framework"]]></title><description><![CDATA[
<p>A few months back someone reverse-engineered private ANE APIs and shown some significant performance improvements compared to CoreML and Metal, on both inference and training.<p>- <a href="https://maderix.substack.com/p/inside-the-m4-apple-neural-engine" rel="nofollow">https://maderix.substack.com/p/inside-the-m4-apple-neural-en...</a><p>- <a href="https://news.ycombinator.com/item?id=47257931">https://news.ycombinator.com/item?id=47257931</a></p>
]]></description><pubDate>Tue, 09 Jun 2026 05:35:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48456917</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48456917</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48456917</guid></item><item><title><![CDATA[New comment by ElFitz in "Apple Core AI Framework"]]></title><description><![CDATA[
<p>Oh, neat. Totally missed it, thanks!</p>
]]></description><pubDate>Tue, 09 Jun 2026 05:28:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48456865</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48456865</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48456865</guid></item><item><title><![CDATA[New comment by ElFitz in "Apple reveals new AI architecture built around Google Gemini models"]]></title><description><![CDATA[
<p>Yes, but you can have this by providing surfaces as UI components to Siri.<p>Not sure if they do that (yet), but no reason an app couldn’t expose "Here’s what you can use to present data of shape X", or "here’s a UI for process doing y".<p>It feels like turning the common approach inside-out. But it works.<p>Edit: you could even imagine, in that world, apps that only expose surfaces, composable UI libraries,  multi-step flows, declaring what they’re for, what kind of inputs they take, and what output they produce. Without ever owning any of the data (eg flights data, hotels inventory, booked trips, financial data, etc) or capabilities (eg book a flight).</p>
]]></description><pubDate>Tue, 09 Jun 2026 05:13:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48456741</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48456741</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48456741</guid></item><item><title><![CDATA[New comment by ElFitz in "Apple reveals new AI architecture built around Google Gemini models"]]></title><description><![CDATA[
<p>I used to wonder what "apps" might become in an "App Intent-first" world.<p>Bundles that provide data and capabilities to iOS and Siri? And perhaps libraries of UI components to display and interact with said data?<p>But then, if that works really well, and gets strong adoption, why ever open the app? What’s the point of having navigation flows inside an app? Could one make entire apps solely dedicated to providing a set of data, capabilities, and UI components to the system?<p>In that world, what drives user retention, for such apps? What even <i>is</i> an app? App engagement disappears as well.<p>And that’s not even diving into the use-case of Siri, say, planning a trip across five different apps (flights, hotel, restaurants, whatever) using just App Intents. If done well.<p>In that world, do most apps just become plugins, providers for Siri?</p>
]]></description><pubDate>Tue, 09 Jun 2026 05:01:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48456658</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48456658</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48456658</guid></item><item><title><![CDATA[New comment by ElFitz in "Ask HN: What are tools you have made for yourself since the advent of AI?"]]></title><description><![CDATA[
<p>Wait. You worked on Guild Wars, Starcraft, Warcraft, and Diablo?<p>This place is incredible.</p>
]]></description><pubDate>Tue, 09 Jun 2026 04:52:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48456592</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48456592</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48456592</guid></item><item><title><![CDATA[New comment by ElFitz in "Ask HN: What are tools you have made for yourself since the advent of AI?"]]></title><description><![CDATA[
<p>I’m working on Descartes[^0]. First to help diagnose what’s wrong with a machine. Later to help manage and monitor it by letting an agent build layers of tailor-made deterministic rules and statistical models, a bit like the description of the ship’s AI at the beginning of Absolution Gap. And serve as an ops "point of contact" for other agents for the machine / fleet of machines it’s in charge of.<p>Sometimes I daydream that the end goal demo would be to set one up in a VM and let a sort of ransomware lose in it, and see if the agent can identify what’s going on and react in time.<p>But for now I’m fighting Apple notarization to enable local notifications on macOS.<p>[0]: <a href="https://github.com/lightless-labs/descartes" rel="nofollow">https://github.com/lightless-labs/descartes</a></p>
]]></description><pubDate>Mon, 08 Jun 2026 23:34:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48453954</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48453954</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48453954</guid></item><item><title><![CDATA[New comment by ElFitz in "AI Is Slowing Down"]]></title><description><![CDATA[
<p>Perhaps that’s it. I would tend to agree with his position, I think, but don’t appreciate being preached to. Even less so when I agree with what’s being said.</p>
]]></description><pubDate>Mon, 08 Jun 2026 17:24:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48448250</link><dc:creator>ElFitz</dc:creator><comments>https://news.ycombinator.com/item?id=48448250</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48448250</guid></item></channel></rss>