<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: Leynos</title><link>https://news.ycombinator.com/user?id=Leynos</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 14 Apr 2026 17:14:37 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=Leynos" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by Leynos in "GitHub Stacked PRs"]]></title><description><![CDATA[
<p>I just used it because I preferred the UX.</p>
]]></description><pubDate>Mon, 13 Apr 2026 21:42:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47758188</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47758188</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47758188</guid></item><item><title><![CDATA[New comment by Leynos in "Tell HN: OpenAI silently removed Study Mode from ChatGPT"]]></title><description><![CDATA[
<p>They got rid of "nerdy" last month too. Shame</p>
]]></description><pubDate>Mon, 13 Apr 2026 01:07:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=47746328</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47746328</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47746328</guid></item><item><title><![CDATA[New comment by Leynos in "OpenClaw’s memory is unreliable, and you don’t know when it will break"]]></title><description><![CDATA[
<p>How well do LoRAs work for this using something like Thinking Machine's Tinker?</p>
]]></description><pubDate>Sat, 11 Apr 2026 23:34:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=47734891</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47734891</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47734891</guid></item><item><title><![CDATA[New comment by Leynos in "How We Broke Top AI Agent Benchmarks: And What Comes Next"]]></title><description><![CDATA[
<p>Also, fuzz your benchmarks</p>
]]></description><pubDate>Sat, 11 Apr 2026 20:02:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=47733526</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47733526</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47733526</guid></item><item><title><![CDATA[New comment by Leynos in "System Card: Claude Mythos Preview [pdf]"]]></title><description><![CDATA[
<p>Opus 4.6 currently leads the remote labor index at 4.17. GPT-5.4 isn't measured on that one though: <a href="https://www.remotelabor.ai/" rel="nofollow">https://www.remotelabor.ai/</a><p>GPT 5.4 Pro leads Frontier Maths Tier 4 at 35%: <a href="https://epoch.ai/benchmarks/frontiermath-tier-4/" rel="nofollow">https://epoch.ai/benchmarks/frontiermath-tier-4/</a></p>
]]></description><pubDate>Tue, 07 Apr 2026 20:08:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=47680706</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47680706</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47680706</guid></item><item><title><![CDATA[New comment by Leynos in "Tell HN: Anthropic no longer allowing Claude Code subscriptions to use OpenClaw"]]></title><description><![CDATA[
<p>It's not difficult at all to burn through your weekly limit just writing code.</p>
]]></description><pubDate>Sat, 04 Apr 2026 06:09:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47636325</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47636325</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47636325</guid></item><item><title><![CDATA[New comment by Leynos in "Renewables reached nearly 50% of global electricity capacity last year"]]></title><description><![CDATA[
<p>Wondered if you were able to suggest what might be stopping large scale build out of sodium ion and redox flow batteries?</p>
]]></description><pubDate>Fri, 03 Apr 2026 12:29:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47625914</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47625914</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47625914</guid></item><item><title><![CDATA[New comment by Leynos in "Anatomy of the .claude/ folder"]]></title><description><![CDATA[
<p>You just said it. If consistency is that important, keep consistent versions of model, harness, prompts, skills, etc., and regression test changes. That way lies madness :)</p>
]]></description><pubDate>Sun, 29 Mar 2026 11:06:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47562114</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47562114</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47562114</guid></item><item><title><![CDATA[New comment by Leynos in "Anatomy of the .claude/ folder"]]></title><description><![CDATA[
<p>It feels to me that "walk down the design tree" has a specific meaning with respect to treating the design as a hierarchy (although whether that means BFS or DFS is still ambiguous). "Be critical" lacks that specificity.</p>
]]></description><pubDate>Sat, 28 Mar 2026 07:59:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47552521</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47552521</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47552521</guid></item><item><title><![CDATA[New comment by Leynos in "Anatomy of the .claude/ folder"]]></title><description><![CDATA[
<p>Use evals<p>Coming soon, unit, behavioural and regression tests for your prompts and skills :P</p>
]]></description><pubDate>Sat, 28 Mar 2026 07:56:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47552509</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47552509</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47552509</guid></item><item><title><![CDATA[New comment by Leynos in "$500 GPU outperforms Claude Sonnet on coding benchmarks"]]></title><description><![CDATA[
<p>Kimi is surprisingly good at Rust.</p>
]]></description><pubDate>Fri, 27 Mar 2026 07:51:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47540012</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47540012</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47540012</guid></item><item><title><![CDATA[New comment by Leynos in "How I'm Productive with Claude Code"]]></title><description><![CDATA[
<p>Again, here's what works for me.<p>When I get an idea for something I want to build, I will usually spend time talking to ChatGPT about it. I'll request deep research on existing implementations, relevant technologies and algorithms, and a survey of literature. I find NotebookLM helps a lot at this point, as does Elevenreader (I tend to listen to these reports while walking or doing the dishes or what have you). I feed all of those into ChatGPT Deep Research along with my own thoughts about the direction the system, and ask it to produce a design document.<p>That gets me something like this:<p><a href="https://github.com/leynos/spycatcher-harness/blob/main/docs/spycatcher-harness-design.md" rel="nofollow">https://github.com/leynos/spycatcher-harness/blob/main/docs/...</a><p>If I need further revisions, I'll ask Codex or Claude Code to do those.<p>Finally, I break that down into a roadmap of phases, steps and achievable tasks using a prompt that defines what I want from each of those.<p>That gets me this:<p><a href="https://github.com/leynos/spycatcher-harness/blob/main/docs/roadmap.md" rel="nofollow">https://github.com/leynos/spycatcher-harness/blob/main/docs/...</a><p>Then I use an adapted version of OpenAI's execplans recipe to plan out each task (<a href="https://github.com/leynos/agent-helper-scripts/blob/main/skills/execplans/SKILL.md" rel="nofollow">https://github.com/leynos/agent-helper-scripts/blob/main/ski...</a>).<p>The task plans end up looking like this:<p><a href="https://github.com/leynos/spycatcher-harness/blob/main/docs/execplans/1-2-1-cassette-schema-versioning.md" rel="nofollow">https://github.com/leynos/spycatcher-harness/blob/main/docs/...</a><p>At the moment, I use Opus or GPT-5.4 on high to generate those plans, and Sonnet or GPT-5.4 medium to implement.<p>The roadmap and the design are definitely not set in stone. Each step is a learning opportunity, and I'll often change the direction of the project based on what I learn during the planning and implementation. And of course, this is just what works for me. The fun of the last few months has been everyone finding out what works for them.</p>
]]></description><pubDate>Tue, 24 Mar 2026 00:27:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47497116</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47497116</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47497116</guid></item><item><title><![CDATA[New comment by Leynos in "4Chan mocks £520k fine for UK online safety breaches"]]></title><description><![CDATA[
<p>And do you have statistical evidence to back up your claim of increased enforcement, or are you just reading about it in the Daily Mail?</p>
]]></description><pubDate>Mon, 23 Mar 2026 23:10:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47496373</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47496373</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47496373</guid></item><item><title><![CDATA[New comment by Leynos in "4Chan mocks £520k fine for UK online safety breaches"]]></title><description><![CDATA[
<p>Yup</p>
]]></description><pubDate>Mon, 23 Mar 2026 23:07:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=47496348</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47496348</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47496348</guid></item><item><title><![CDATA[New comment by Leynos in "How I'm Productive with Claude Code"]]></title><description><![CDATA[
<p>Here's what I suggest:<p>Serious planning. The plans should include constraints, scope, escalation criteria, completion criteria, test and documentation plan.<p>Enforce single responsibility, cqrs, domain segregation, etc. Make the code as easy for you to reason about as possible. Enforce domain naming and function / variable naming conventions to make the code as easy to talk about as possible.<p>Use code review bots (Sourcery, CodeRabbit, and Codescene). They catch the small things (violations of contract, antipatterns, etc.) and the large (ux concerns, architectural flaws, etc.).<p>Go all in on linting. Make the rules as strict as possible, and tell the review bots to call out rule subversions. Write your own lints for the things the review bots are complaining about regularly that aren't caught by lints.<p>Use BDD alongside unit tests, read the .feature files before the build and give feedback. Use property testing as part of your normal testing strategy. Snapshot testing, e2e testing with mitm proxies, etc. For functions of any non-trivial complexity, consider bounded or unbounded proofs, model checking or undefined behaviour testing.<p>I'm looking into mutation testing and fuzzing too, but I am still learning.<p>Pause for frequent code audits. Ask an agent to audit for code duplication, redundancy, poor assumptions, architectural or domain violations, TOCTOU violations. Give yourself maintenance sprints where you pay down debt before resuming new features.<p>The beauty of agentic coding is, suddenly you have time for all of this.</p>
]]></description><pubDate>Mon, 23 Mar 2026 22:50:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=47496198</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47496198</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47496198</guid></item><item><title><![CDATA[New comment by Leynos in "4Chan mocks £520k fine for UK online safety breaches"]]></title><description><![CDATA[
<p>Okay, here goes. You can tell when someone is acting in bad faith when they talk about a law that has been in force and enforced since the 1960s is something new.<p>Of course, "touch grass" works just as well.</p>
]]></description><pubDate>Thu, 19 Mar 2026 22:26:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47447229</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47447229</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47447229</guid></item><item><title><![CDATA[New comment by Leynos in "Pixel-art virtual office for AI agent teams"]]></title><description><![CDATA[
<p>It's fun. That's the utility. Although you could also probably convey information in a visual shorthand in a way that is quicker to parse than a table.<p>I want one that shows my agents working in a hipster coffee shop with chrono trigger-style lighting.</p>
]]></description><pubDate>Wed, 18 Mar 2026 09:07:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47423343</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47423343</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47423343</guid></item><item><title><![CDATA[New comment by Leynos in "Language model teams as distributed systems"]]></title><description><![CDATA[
<p>An agent is a way of performing an action that will generate context or a useful side effect without having to worry about the intermediate context.<p>People already do this serially by having a model write a plan, clearing the context, then having the same or a cheaper model action the plan. Doing so discards the intermediate context.<p>Sub-agents just let you do this in parallel. This works best when you have a task that needs to be done multiple times that cannot be done deterministically. For example, applying the same helper class usage in multiple places across a codebase, finding something out about multiple parts of the codebase, or testing a hypothesis in multiple places across a codebase.</p>
]]></description><pubDate>Tue, 17 Mar 2026 04:59:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=47408822</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47408822</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47408822</guid></item><item><title><![CDATA[New comment by Leynos in "LLMs can be exhausting"]]></title><description><![CDATA[
<p>It's important to enforce the rules that make the code easier to read.</p>
]]></description><pubDate>Mon, 16 Mar 2026 08:12:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47396292</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47396292</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47396292</guid></item><item><title><![CDATA[New comment by Leynos in "Shall I implement it? No"]]></title><description><![CDATA[
<p><a href="https://github.com/steipete/Peekaboo" rel="nofollow">https://github.com/steipete/Peekaboo</a></p>
]]></description><pubDate>Thu, 12 Mar 2026 23:49:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47358908</link><dc:creator>Leynos</dc:creator><comments>https://news.ycombinator.com/item?id=47358908</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47358908</guid></item></channel></rss>