<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: joshuaisaact</title><link>https://news.ycombinator.com/user?id=joshuaisaact</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 05 Jun 2026 03:19:41 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=joshuaisaact" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by joshuaisaact in "Embarrassingly simple self-distillation improves code generation"]]></title><description><![CDATA[
<p>This was a really interesting paper but there's a massive gap in what they didn't try, which is inference-time temperature changes based on the fork/lock distinction.<p>Maybe I'll try that myself, because it feels like it could be a great source of improvements. It would be really useful to see adaptive per-token sampling as an additional decode-only baseline.</p>
]]></description><pubDate>Sun, 05 Apr 2026 07:50:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=47647147</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=47647147</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47647147</guid></item><item><title><![CDATA[New comment by joshuaisaact in "European Tech Alternatives"]]></title><description><![CDATA[
<p>ASML is European and is arguably the most strategically important company in the entire semiconductor supply chain.</p>
]]></description><pubDate>Thu, 19 Feb 2026 06:34:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47070608</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=47070608</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47070608</guid></item><item><title><![CDATA[New comment by joshuaisaact in "European Tech Alternatives"]]></title><description><![CDATA[
<p>Have you heard of a little company called Arm Holdings?<p>It was a travesty that the UK government let it be sold, admittedly.</p>
]]></description><pubDate>Thu, 19 Feb 2026 06:32:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47070589</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=47070589</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47070589</guid></item><item><title><![CDATA[New comment by joshuaisaact in "I proved my AI agent can't skip the approval step (196 states, zero bypasses)"]]></title><description><![CDATA[
<p>I've been exploring Petri nets as a formalism for AI agent safety, specifically, proving properties like termination and human-gate enforcement exhaustively across every reachable state, rather than testing them on sample inputs. This post benchmarks the approach against n8n and ReAct on the same agent. Tomorrow I'm open-sourcing the engine as a declarative rules DSL.</p>
]]></description><pubDate>Mon, 16 Feb 2026 14:15:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=47035216</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=47035216</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47035216</guid></item><item><title><![CDATA[I proved my AI agent can't skip the approval step (196 states, zero bypasses)]]></title><description><![CDATA[
<p>Article URL: <a href="https://joshtuddenham.dev/blog/agent-safety/">https://joshtuddenham.dev/blog/agent-safety/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47035215">https://news.ycombinator.com/item?id=47035215</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 16 Feb 2026 14:15:47 +0000</pubDate><link>https://joshtuddenham.dev/blog/agent-safety/</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=47035215</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47035215</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Ask HN: Are you using an agent orchestrator to write code?"]]></title><description><![CDATA[
<p>I don't think you need two separate models for this - I get similarly good results re-prompting with Claude. Well, not re-prompting, I just have a skill that wipes the context then gets Claude to review the current PR and make improvements before I review it.</p>
]]></description><pubDate>Fri, 13 Feb 2026 05:48:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=46999350</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46999350</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46999350</guid></item><item><title><![CDATA[You're Building Petri Nets. You're Just Building Them Badly]]></title><description><![CDATA[
<p>Article URL: <a href="https://joshtuddenham.dev/blog/petri-nets/">https://joshtuddenham.dev/blog/petri-nets/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46999339">https://news.ycombinator.com/item?id=46999339</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 13 Feb 2026 05:47:30 +0000</pubDate><link>https://joshtuddenham.dev/blog/petri-nets/</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46999339</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46999339</guid></item><item><title><![CDATA[Show HN: FizzBuzz Enterprise Edition 2026. AI-powered divisibility detection]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/joshuaisaact/fizzbuzz-enterprise-edition-2026">https://github.com/joshuaisaact/fizzbuzz-enterprise-edition-2026</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46900632">https://news.ycombinator.com/item?id=46900632</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 05 Feb 2026 15:20:52 +0000</pubDate><link>https://github.com/joshuaisaact/fizzbuzz-enterprise-edition-2026</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46900632</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46900632</guid></item><item><title><![CDATA[New comment by joshuaisaact in "The future of software engineering is SRE"]]></title><description><![CDATA[
<p>Couldn't disagree with this article more. I think the future of software engineering is more T-shaped.<p>Look at the 'Product Engineer' roles we are seeing spreading in forward-thinking startups and scaleups.<p>That's the future of SWE I think. SWEs take on more PM and design responsibilities as part of the existing role.</p>
]]></description><pubDate>Mon, 26 Jan 2026 06:00:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=46762320</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46762320</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46762320</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Claude Code's new hidden feature: Swarms"]]></title><description><![CDATA[
<p>Fair push back. The distinction I'm drawing is between:<p>A. Using a role prompt to configure a single function's scope ("you are a code reviewer, focus on X") - totally reasonable, leverages training<p>B. Building an elaborate multi-agent orchestration layer with hand-offs, coordination protocols, and framework abstractions on top of that<p>I'm not arguing against A. I'm arguing that B often adds complexity without proportional benefit, especially as models get better at long-context reasoning.<p>Fairly recent research (arXiv May 2025: "Single-agent or Multi-agent Systems?" - <a href="https://arxiv.org/abs/2505.18286" rel="nofollow">https://arxiv.org/abs/2505.18286</a>) found that MAS benefits over single-agent diminish as LLM capabilities improve. The constraints that motivated swarm architectures are being outpaced by model improvements. I admit the field is moving fast, but the direction of travel appears to be that the better the models get, the simpler your abstractions need to be.<p>So yes, use roles. But maybe don't reach for a framework to orchestrate a PM handing off to an Engineer handing off to QA when a single context with scoped instructions would do.</p>
]]></description><pubDate>Sun, 25 Jan 2026 12:36:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=46753582</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46753582</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46753582</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Claude Code's new hidden feature: Swarms"]]></title><description><![CDATA[
<p>Fair point on the date - the paper was updated October 2024 with Llama-3 and Qwen2.5 (up to 72B), same findings. The v1 to v3 revision is interesting. They initially found personas helped, then reversed their conclusion after expanding to more models.<p>"Comprehensively disproven" was too strong - should have said "evidence suggests the effect is largely random." There's also Gupta et al. 2024 (arxiv.org/abs/2408.08631) with similar findings if you want more data points.</p>
]]></description><pubDate>Sun, 25 Jan 2026 12:31:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=46753545</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46753545</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46753545</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Claude Code's new hidden feature: Swarms"]]></title><description><![CDATA[
<p>This has been pretty comprehensively disproven:<p><a href="https://arxiv.org/abs/2311.10054" rel="nofollow">https://arxiv.org/abs/2311.10054</a><p>Key findings:<p>-Tested 162 personas across 6 types of interpersonal relationships and 8 domains of expertise, with 4 LLM families and 2,410 factual questions<p>-Adding personas in system prompts does not improve model performance compared to the control setting where no persona is added<p>-Automatically identifying the best persona is challenging, with predictions often performing no better than random selection<p>-While adding a persona may lead to performance gains in certain settings, the effect of each persona can be largely random<p>Fun piece of trivia - the paper was originally designed to prove the opposite result (that personas make LLMs better). They revised it when they saw the data completely disproved their original hypothesis.</p>
]]></description><pubDate>Sun, 25 Jan 2026 09:00:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=46752125</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46752125</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46752125</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Claude Code's new hidden feature: Swarms"]]></title><description><![CDATA[
<p>This feels like massively overengineering something very simple.<p>Agents are stateless functions with a limited heap (context window) that degrades in quality as it fills. Once you see it that way, the whole swarm paradigm is just function scoping and memory management cosplaying as an org chart:<p>Agent = function<p>Role = scope constraints<p>Context window = local memory<p>Shared state file = global state<p>Orchestration = control flow<p>The solution isn't assigning human-like roles to stateless functions. It's shared state (a markdown file) and clear constraints.</p>
]]></description><pubDate>Sun, 25 Jan 2026 08:45:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=46752037</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46752037</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46752037</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Software engineers can no longer neglect their soft skills"]]></title><description><![CDATA[
<p>Fair. I'll retire 'bringing people along with you' before it ends up on a motivational poster with a stock photo of a rowing team.<p>Though you're right that there's no I in team. There is one in AI though, which probably tells us something.</p>
]]></description><pubDate>Sun, 18 Jan 2026 15:04:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=46668343</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46668343</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46668343</guid></item><item><title><![CDATA[Speed Vertigo: A New Kind of Engineering Debt]]></title><description><![CDATA[
<p>Article URL: <a href="https://joshtuddenham.dev/blog/vertigo/">https://joshtuddenham.dev/blog/vertigo/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46668237">https://news.ycombinator.com/item?id=46668237</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 18 Jan 2026 14:53:16 +0000</pubDate><link>https://joshtuddenham.dev/blog/vertigo/</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46668237</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46668237</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Iconify: Library of Open Source Icons"]]></title><description><![CDATA[
<p>This is a brilliant library, thanks so much for sharing it</p>
]]></description><pubDate>Sun, 18 Jan 2026 14:46:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=46668183</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46668183</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46668183</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Software engineers can no longer neglect their soft skills"]]></title><description><![CDATA[
<p>I may have misread your comment, but I don't think soft skills are a 'narrow thing' at all. Effective communication, building trust, bringing people along with you - these are fundamental to being an effective human, not some niche pivot.</p>
]]></description><pubDate>Sun, 18 Jan 2026 14:45:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=46668167</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46668167</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46668167</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Software engineers can no longer neglect their soft skills"]]></title><description><![CDATA[
<p>This couldn't ring more true to me - I think one of the consequences of the rapid change in the profession we are seeing is that skills that typically were required only at more senior levels become required further down the stack.<p>If I was a junior today, I'd be studying business impact, effective communication, project management, skills that were previously something you could get away with under-indexing on until senior+.</p>
]]></description><pubDate>Sun, 18 Jan 2026 14:39:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=46668118</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46668118</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46668118</guid></item><item><title><![CDATA[New comment by joshuaisaact in "Show HN: KeelTest – AI-driven VS Code unit test generator with bug discovery"]]></title><description><![CDATA[
<p>I notice one of the things you don't really talk about in the blog post (or if you did, I missed it) is unnecessary tests, which is one of the key problems LLMs have with test writing.<p>In my experience, if you just ask an LLM to write tests, it'll write you a ton of boilerplate happy path tests that aren't wrong, per se, they're just pointless (one fun one in react is 'the component renders').<p>How do you plan to handle this?</p>
]]></description><pubDate>Wed, 07 Jan 2026 14:51:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=46526999</link><dc:creator>joshuaisaact</dc:creator><comments>https://news.ycombinator.com/item?id=46526999</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46526999</guid></item></channel></rss>