<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: aszen</title><link>https://news.ycombinator.com/user?id=aszen</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 26 Apr 2026 09:54:40 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=aszen" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by aszen in "An update on recent Claude Code quality reports"]]></title><description><![CDATA[
<p>Claude code is not infra, the model is the infra. They changed settings to make their models faster and probably cheaper to run too. Honestly with adaptive thinking it no longer matters what model it is if you can dynamically make it do less or more work.</p>
]]></description><pubDate>Thu, 23 Apr 2026 18:16:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=47879329</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=47879329</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47879329</guid></item><item><title><![CDATA[New comment by aszen in "Parallel agents in Zed"]]></title><description><![CDATA[
<p>Same here. Reviewing gets harder too and multi tasking kills any kind of productivity if you need to review the code then.<p>My approach these days is to do one change at a time, until I can fully merge it with confidence.</p>
]]></description><pubDate>Thu, 23 Apr 2026 12:48:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47875113</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=47875113</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47875113</guid></item><item><title><![CDATA[New comment by aszen in "Show HN: Mdarena – Benchmark your Claude.md against your own PRs"]]></title><description><![CDATA[
<p>This is quite interesting, will try it. I kind of expect this to be done continuously as the code base changes.</p>
]]></description><pubDate>Mon, 06 Apr 2026 09:03:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=47658484</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=47658484</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47658484</guid></item><item><title><![CDATA[New comment by aszen in "Closed Source vs. Open Source AI: A Cage Fight Few People Understand"]]></title><description><![CDATA[
<p>This article doesn't mention the moat of data gathering, frontier AI labs have a huge advantage in curating proprietary datasets from actual usage of their platforms.<p>This in turn allows them to optimize their models for the long tail of tasks that open weight models can't compete with.<p>Another factor is that pure intelligence isn't enough, how the model communicates is a huge plus. An enterprise used to talking to Claude all day won't be easy to switch to another model</p>
]]></description><pubDate>Thu, 26 Mar 2026 19:21:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=47534523</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=47534523</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47534523</guid></item><item><title><![CDATA[New comment by aszen in "Ask HN: How are people doing AI evals these days?"]]></title><description><![CDATA[
<p>Seems like you are testing llms genric abilities rather than your actual agent logic.<p>Llms are like vendor code you don't need to test them yourself people already created benchmarks for that.</p>
]]></description><pubDate>Wed, 11 Mar 2026 07:54:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47332760</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=47332760</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47332760</guid></item><item><title><![CDATA[New comment by aszen in "The L in "LLM" Stands for Lying"]]></title><description><![CDATA[
<p>If you buy real handcrafted scarves they are both thinner and warmer than anything factory made bcz of their choice of pashmina wool.</p>
]]></description><pubDate>Thu, 05 Mar 2026 15:41:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47262920</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=47262920</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47262920</guid></item><item><title><![CDATA[New comment by aszen in "Improving 15 LLMs at Coding in One Afternoon. Only the Harness Changed"]]></title><description><![CDATA[
<p>So the new implementation always operates at the line level, replacing one or more lines. That's not ideal for some refactorings like rename where search and replace is faster.<p>Edit<p>Checking ohmypi The model has access to str replace too so this is just a edit till</p>
]]></description><pubDate>Thu, 12 Feb 2026 16:08:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=46990551</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46990551</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46990551</guid></item><item><title><![CDATA[New comment by aszen in "After two years of vibecoding, I'm back to writing by hand [video]"]]></title><description><![CDATA[
<p>I bet writing the code directly could have been even faster, llms aren't magically fast</p>
]]></description><pubDate>Sat, 24 Jan 2026 16:51:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=46745113</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46745113</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46745113</guid></item><item><title><![CDATA[New comment by aszen in "Bubblewrap: A nimble way to prevent agents from accessing your .env files"]]></title><description><![CDATA[
<p><a href="https://devenv.sh/integrations/secretspec/" rel="nofollow">https://devenv.sh/integrations/secretspec/</a></p>
]]></description><pubDate>Fri, 16 Jan 2026 18:49:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=46650343</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46650343</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46650343</guid></item><item><title><![CDATA[New comment by aszen in "Bubblewrap: A nimble way to prevent agents from accessing your .env files"]]></title><description><![CDATA[
<p>I wonder why we are even storing secrets in .env files in plain text</p>
]]></description><pubDate>Thu, 15 Jan 2026 09:14:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=46630060</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46630060</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46630060</guid></item><item><title><![CDATA[New comment by aszen in "AI is a business model stress test"]]></title><description><![CDATA[
<p>How? You don't know what the llm was trained on and don't know if it has any bias. 
Imo llms are a disaster for knowledge work because they act like a black box.</p>
]]></description><pubDate>Sun, 11 Jan 2026 09:52:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=46574102</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46574102</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46574102</guid></item><item><title><![CDATA[New comment by aszen in "Fly's Sprites.dev addresses dev environment sandboxes and API sandboxes together"]]></title><description><![CDATA[
<p>Stupid question but why not use a local sandbox for yolo mode instead of a remote machine.<p>Is there a similar service that runs locally?</p>
]]></description><pubDate>Sun, 11 Jan 2026 01:44:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=46571954</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46571954</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46571954</guid></item><item><title><![CDATA[New comment by aszen in "How to code Claude Code in 200 lines of code"]]></title><description><![CDATA[
<p>Agreed it probably contributes to the model improving for all agents but crucially it is verifiably better against their own agent. So they get a good feedback loop to improve both</p>
]]></description><pubDate>Thu, 08 Jan 2026 23:20:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=46547920</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46547920</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46547920</guid></item><item><title><![CDATA[New comment by aszen in "How to code Claude Code in 200 lines of code"]]></title><description><![CDATA[
<p>They nailed down the UX I would say and the models themselves are a lot better even outside of CC</p>
]]></description><pubDate>Thu, 08 Jan 2026 21:54:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=46547031</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46547031</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46547031</guid></item><item><title><![CDATA[New comment by aszen in "How to code Claude Code in 200 lines of code"]]></title><description><![CDATA[
<p>Yeah that's one example, but I suspect they train the model on entire sequences of tool calls, so unless you prompt the model exactly as them you won't get the same results.<p>There's a reason they won the agent race, their models are trained to use their own tools.</p>
]]></description><pubDate>Thu, 08 Jan 2026 21:52:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=46547014</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46547014</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46547014</guid></item><item><title><![CDATA[New comment by aszen in "OpenAPI Isn't Enough"]]></title><description><![CDATA[
<p>Seems odd to not mention other semantic standards that standardize resource operations like pagination, sorting etc.<p>Jsonld, json:api</p>
]]></description><pubDate>Thu, 08 Jan 2026 21:40:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=46546883</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46546883</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46546883</guid></item><item><title><![CDATA[New comment by aszen in "How to code Claude Code in 200 lines of code"]]></title><description><![CDATA[
<p>The most imp part is editing code, to do that reliably Claude models are trained on their own str replace tool schema I think. Models find it hard to modify existing code, they also can't just rewrite whole files bcz that's expensive and doesn't scale.</p>
]]></description><pubDate>Thu, 08 Jan 2026 21:26:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=46546714</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46546714</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46546714</guid></item><item><title><![CDATA[New comment by aszen in "How to code Claude Code in 200 lines of code"]]></title><description><![CDATA[
<p>Can you elaborate on this?</p>
]]></description><pubDate>Thu, 08 Jan 2026 21:23:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=46546686</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46546686</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46546686</guid></item><item><title><![CDATA[New comment by aszen in "How to code Claude Code in 200 lines of code"]]></title><description><![CDATA[
<p>I don't think so, model improvements far outweigh any harness or tooling.<p>Look at <a href="https://github.com/SWE-agent/mini-swe-agent" rel="nofollow">https://github.com/SWE-agent/mini-swe-agent</a> for proof</p>
]]></description><pubDate>Thu, 08 Jan 2026 21:18:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=46546628</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46546628</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46546628</guid></item><item><title><![CDATA[New comment by aszen in "The rise of industrial software"]]></title><description><![CDATA[
<p>A custom ERP solution won't work because of audits, you need standard solutions at large scale. That's why even Google uses oracles ERP</p>
]]></description><pubDate>Thu, 01 Jan 2026 15:06:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=46454672</link><dc:creator>aszen</dc:creator><comments>https://news.ycombinator.com/item?id=46454672</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46454672</guid></item></channel></rss>