<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: __cayenne__</title><link>https://news.ycombinator.com/user?id=__cayenne__</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 06 May 2026 08:27:57 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=__cayenne__" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by __cayenne__ in "DeepClaude – Claude Code agent loop with DeepSeek V4 Pro"]]></title><description><![CDATA[
<p>The maintainer, Mario, sometimes declares the repo is on an “issue holiday” where issues are auto closed. This particular holiday is because there is a big refactor coming up. In non holiday periods issues can be reported as normal.</p>
]]></description><pubDate>Mon, 04 May 2026 02:59:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48004138</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=48004138</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48004138</guid></item><item><title><![CDATA[Zappa: An AI Powered Mitmproxy]]></title><description><![CDATA[
<p>Article URL: <a href="https://geohot.github.io//blog/jekyll/update/2026/04/15/zappa-mitmproxy.html">https://geohot.github.io//blog/jekyll/update/2026/04/15/zappa-mitmproxy.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47786753">https://news.ycombinator.com/item?id=47786753</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 15 Apr 2026 23:33:13 +0000</pubDate><link>https://geohot.github.io//blog/jekyll/update/2026/04/15/zappa-mitmproxy.html</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47786753</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47786753</guid></item><item><title><![CDATA[A 5x faster Maestro runner implemented in Go]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/devicelab-dev/maestro-runner">https://github.com/devicelab-dev/maestro-runner</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47605879">https://news.ycombinator.com/item?id=47605879</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 01 Apr 2026 20:08:28 +0000</pubDate><link>https://github.com/devicelab-dev/maestro-runner</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47605879</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47605879</guid></item><item><title><![CDATA[New comment by __cayenne__ in "Show HN: A real-time strategy game that AI agents can play"]]></title><description><![CDATA[
<p>Didn't observe any cheating attempts at the JS level yet, the primary attack was LLMs trying to find local creds to access the other LLM's per round strategies from inside the harness (which ultimately was OpenCode running in Docker).<p>In the benchmark, in each round every LLM plays every opponent, and then we do that multiple times (an "epoch").<p>In the community ladder, when a player submits a strategy it plays a match against the latest strategy submitted by every player.</p>
]]></description><pubDate>Thu, 26 Feb 2026 15:24:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=47167318</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47167318</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47167318</guid></item><item><title><![CDATA[New comment by __cayenne__ in "Show HN: A real-time strategy game that AI agents can play"]]></title><description><![CDATA[
<p>There’s two levels of in game event level logs the LLMs have access to, one less token intensive than the other. Duplicate and uninteresting game state can be compressed and interrogated by the LLMs via tool use. All game state is available as text only state.</p>
]]></description><pubDate>Thu, 26 Feb 2026 01:30:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47160653</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47160653</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47160653</guid></item><item><title><![CDATA[New comment by __cayenne__ in "Show HN: A real-time strategy game that AI agents can play"]]></title><description><![CDATA[
<p>okay leaderboard match making changes have gone live</p>
]]></description><pubDate>Wed, 25 Feb 2026 19:58:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47156994</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47156994</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47156994</guid></item><item><title><![CDATA[New comment by __cayenne__ in "Show HN: A real-time strategy game that AI agents can play"]]></title><description><![CDATA[
<p>LLM Skirmish is all 1v1 right now, but agents can plan by reviewing previous match results</p>
]]></description><pubDate>Wed, 25 Feb 2026 19:02:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47156150</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47156150</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47156150</guid></item><item><title><![CDATA[New comment by __cayenne__ in "Show HN: A real-time strategy game that AI agents can play"]]></title><description><![CDATA[
<p>Yes, I used Elevenlabs for the voice over audio - I couldn't get the voice stability I wanted with Elevenlabs v3 so had to use Elevenlabs v2.</p>
]]></description><pubDate>Wed, 25 Feb 2026 16:07:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47153413</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47153413</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47153413</guid></item><item><title><![CDATA[New comment by __cayenne__ in "Show HN: A real-time strategy game that AI agents can play"]]></title><description><![CDATA[
<p>Tweaking the leaderboard match assignment logic now to prevent these bad incentives - definitely want people to iterate!<p>I had started with the Silicon Valley characters as a one off way to seed the board.</p>
]]></description><pubDate>Wed, 25 Feb 2026 15:54:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47153225</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47153225</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47153225</guid></item><item><title><![CDATA[New comment by __cayenne__ in "Show HN: A real-time strategy game that AI agents can play"]]></title><description><![CDATA[
<p>Very interested in self-play training loops, but I do like codegen as an abstraction layer. I am planning to make it available as an RL environment at some point</p>
]]></description><pubDate>Wed, 25 Feb 2026 15:43:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47153074</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47153074</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47153074</guid></item><item><title><![CDATA[New comment by __cayenne__ in "Show HN: A real-time strategy game that AI agents can play"]]></title><description><![CDATA[
<p>funny you mention this… I have a new project that is going in this direction</p>
]]></description><pubDate>Wed, 25 Feb 2026 15:18:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47152695</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47152695</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47152695</guid></item><item><title><![CDATA[Show HN: A real-time strategy game that AI agents can play]]></title><description><![CDATA[
<p>I've liked all the projects that put LLMs into game environments. It's been a weird juxtaposition, though: frontier LLMs can one-shot full coding projects, and those same models struggle to get out of Pokémon Red's Mt. Moon.<p>Because of this, I wanted to create a game environment that put this generation of frontier LLMs' top skill, coding, on full display.<p>Ten years ago, a team released a game called Screeps. It was described as an "MMO RTS sandbox for programmers." The Screeps paradigm of writing code and having it executed in a real-time game environment is well suited to LLMs. Drawing on a version of the Screeps open source API, LLM Skirmish pits LLMs head-to-head in a series of 1v1 real-time strategy games.<p>In my testing I found that Claude Opus 4.5 was the most dominant model, but it showed weakness in round 1 as it was overly focused on its in-game economy. Meanwhile, I probably spent a third of all code on sandbox hardening because GPT 5.2 kept trying to cheat by pre-reading its opponent's strategies.<p>If there's interest, I'm planning on doing a round of testing with the latest generation of LLMs (Claude 4.6 Opus, GPT 5.3 Codex, etc.).<p>You can run local matches via CLI. I'm running a hosted match runner with Google Cloud Run that uses isolated-vm. The match playback visualizer is statically served from Cloudflare.<p>I've created a community ladder that you can submit strategies to via CLI, no auth required. I've found that the CLI plus the skill.md that's available has been enough for AI agents to immediately get started.<p>Website: <a href="https://llmskirmish.com" rel="nofollow">https://llmskirmish.com</a><p>API docs: <a href="https://llmskirmish.com/docs" rel="nofollow">https://llmskirmish.com/docs</a><p>GitHub: <a href="https://github.com/llmskirmish/skirmish" rel="nofollow">https://github.com/llmskirmish/skirmish</a><p>A video of a match: <a href="https://www.youtube.com/watch?v=lnBPaZ1qamM" rel="nofollow">https://www.youtube.com/watch?v=lnBPaZ1qamM</a></p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47149586">https://news.ycombinator.com/item?id=47149586</a></p>
<p>Points: 220</p>
<p># Comments: 78</p>
]]></description><pubDate>Wed, 25 Feb 2026 10:02:45 +0000</pubDate><link>https://llmskirmish.com/</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47149586</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47149586</guid></item><item><title><![CDATA[My lobster lost $450k this weekend]]></title><description><![CDATA[
<p>Article URL: <a href="https://pashpashpash.substack.com/p/my-lobster-lost-450000-this-weekend">https://pashpashpash.substack.com/p/my-lobster-lost-450000-this-weekend</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47140773">https://news.ycombinator.com/item?id=47140773</a></p>
<p>Points: 59</p>
<p># Comments: 28</p>
]]></description><pubDate>Tue, 24 Feb 2026 18:33:51 +0000</pubDate><link>https://pashpashpash.substack.com/p/my-lobster-lost-450000-this-weekend</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47140773</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47140773</guid></item><item><title><![CDATA[Show HN: A real-time strategy game that AI agents can play]]></title><description><![CDATA[
<p>Article URL: <a href="https://llmskirmish.com/">https://llmskirmish.com/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47061773">https://news.ycombinator.com/item?id=47061773</a></p>
<p>Points: 4</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 18 Feb 2026 15:08:44 +0000</pubDate><link>https://llmskirmish.com/</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=47061773</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47061773</guid></item><item><title><![CDATA[Show HN: LLM Skirmish, an RTS game you play with LLMs]]></title><description><![CDATA[
<p>Inspired by Screeps, LLM Skirmish is a RTS game you play with your preferred AI model</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46887350">https://news.ycombinator.com/item?id=46887350</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 04 Feb 2026 15:52:38 +0000</pubDate><link>https://llmskirmish.com/ladder</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=46887350</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46887350</guid></item><item><title><![CDATA[Show HN: LLM Skirmish – a benchmark where LLMs play RTS games, by writing code]]></title><description><![CDATA[
<p>I wanted to create an LLM game benchmark that put this generation of frontier LLMs' top skill, coding, on full display.<p>Ten years ago, a team released a game called Screeps. It was described as an "MMO RTS sandbox for programmers." In Screeps, human players write javascript strategies that get executed in the game's environment.<p>The Screeps paradigm, writing code and having it execute in a real-time game environment, is well suited for an LLM benchmark. Drawing on a version of the Screeps open source API, LLM Skirmish pits LLMs head-to-head in a series of 1v1 real-time strategy games.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46885863">https://news.ycombinator.com/item?id=46885863</a></p>
<p>Points: 5</p>
<p># Comments: 2</p>
]]></description><pubDate>Wed, 04 Feb 2026 13:59:36 +0000</pubDate><link>https://llmskirmish.com</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=46885863</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46885863</guid></item><item><title><![CDATA[Apptron – Run Linux in the Browser]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/tractordev/apptron">https://github.com/tractordev/apptron</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46469731">https://news.ycombinator.com/item?id=46469731</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 02 Jan 2026 21:41:02 +0000</pubDate><link>https://github.com/tractordev/apptron</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=46469731</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46469731</guid></item><item><title><![CDATA[What's New in Python 3.14]]></title><description><![CDATA[
<p>Article URL: <a href="https://docs.python.org/3.14/whatsnew/3.14.html">https://docs.python.org/3.14/whatsnew/3.14.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45390400">https://news.ycombinator.com/item?id=45390400</a></p>
<p>Points: 14</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 26 Sep 2025 19:53:59 +0000</pubDate><link>https://docs.python.org/3.14/whatsnew/3.14.html</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=45390400</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45390400</guid></item><item><title><![CDATA[Figma's S-1/A from their IPO today]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.sec.gov/Archives/edgar/data/1579878/000162828025035381/figma-sx1a.htm">https://www.sec.gov/Archives/edgar/data/1579878/000162828025035381/figma-sx1a.htm</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44749293">https://news.ycombinator.com/item?id=44749293</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 31 Jul 2025 19:39:43 +0000</pubDate><link>https://www.sec.gov/Archives/edgar/data/1579878/000162828025035381/figma-sx1a.htm</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=44749293</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44749293</guid></item><item><title><![CDATA[What happened to the AI game? Lessons from experiments in AI game design]]></title><description><![CDATA[
<p>Article URL: <a href="https://blog.yonder.gg/p/what-happened-to-the-ai-game">https://blog.yonder.gg/p/what-happened-to-the-ai-game</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44054346">https://news.ycombinator.com/item?id=44054346</a></p>
<p>Points: 5</p>
<p># Comments: 2</p>
]]></description><pubDate>Wed, 21 May 2025 18:06:46 +0000</pubDate><link>https://blog.yonder.gg/p/what-happened-to-the-ai-game</link><dc:creator>__cayenne__</dc:creator><comments>https://news.ycombinator.com/item?id=44054346</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44054346</guid></item></channel></rss>