<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: collinwilkins</title><link>https://news.ycombinator.com/user?id=collinwilkins</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 04 May 2026 01:11:35 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=collinwilkins" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by collinwilkins in "Expensively Quadratic: The LLM Agent Cost Curve"]]></title><description><![CDATA[
<p>what i've learned running multi-agent workflows... 
>use the expensive models for planning/design and the cheaper models for implementation
>stick with small/tightly scoped requests
>clear the context window often and let the AGENTS.md files control the basics</p>
]]></description><pubDate>Mon, 16 Feb 2026 16:59:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=47037381</link><dc:creator>collinwilkins</dc:creator><comments>https://news.ycombinator.com/item?id=47037381</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47037381</guid></item><item><title><![CDATA[New comment by collinwilkins in "Qwen3.5: Towards Native Multimodal Agents"]]></title><description><![CDATA[
<p>at this point it seems every new model scores within a few points of each other on SWE-bench. the actual differentiator is how well it handles multi-step tool use without losing the plot halfway through and how well it works with an existing stack</p>
]]></description><pubDate>Mon, 16 Feb 2026 16:54:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47037310</link><dc:creator>collinwilkins</dc:creator><comments>https://news.ycombinator.com/item?id=47037310</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47037310</guid></item></channel></rss>