<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: eithed</title><link>https://news.ycombinator.com/user?id=eithed</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 11 Jun 2026 09:07:49 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=eithed" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>Ah, I see what you mean now. Yes, my mind went straight to static analysis and testing (unit, feature, uat, mutation). Thanks for expanding on your point!</p>
]]></description><pubDate>Wed, 10 Jun 2026 16:49:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48479089</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48479089</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48479089</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p><a href="https://www.danielzarick.com/uploads/2018-05-draw-the-owl.jpg" rel="nofollow">https://www.danielzarick.com/uploads/2018-05-draw-the-owl.jp...</a></p>
]]></description><pubDate>Wed, 10 Jun 2026 15:25:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48477765</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48477765</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48477765</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>Don't forget that LLMs are trained on human code. If they cannot understand what your code does then they cannot make changes to it, or at least - having them understand your codebase becomes expensive (more trips to Anthropic servers)</p>
]]></description><pubDate>Wed, 10 Jun 2026 12:03:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48475040</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48475040</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48475040</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>I disagree. What I should have added is that with agents (as well as humans) you do need to have tests that verify what was done.</p>
]]></description><pubDate>Wed, 10 Jun 2026 11:54:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48474958</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48474958</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48474958</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>I mirror your thoughts. I think we'll end up with "perfect map" paradox = you cannot be vague or indecisive on what you want (and if you are then these decisions don't matter) and you're creating a 1:1 representation of what the code needs to be.<p>I'd substitute "owner" for the team and in that sense the owner will not need to be human.<p>We're at this state where Claude is great at doing the "middle" part of work, but it's crap at gathering requirements and verification of what it has done. I also don't see people caring about these aspects of software development as shown in the article</p>
]]></description><pubDate>Wed, 10 Jun 2026 11:47:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48474875</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48474875</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48474875</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>Exactly - the normalization of craft (?) is interesting</p>
]]></description><pubDate>Wed, 10 Jun 2026 07:20:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48472688</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48472688</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48472688</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>Don't forget that you can adjust your requirements (either via plan or skill) to ensure the mistakes do not happen. The problem is that neither LLMs, nor humans (that don't work with the domain) will know they made these  mistakes. Even coders don't think about everything <i>all</i> the time</p>
]]></description><pubDate>Wed, 10 Jun 2026 07:12:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48472598</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48472598</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48472598</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>The road will be built to <i>some</i> specs, including features nobody asked for. If the corpus was trained for roads built in Arctic, you will get penguin crossings.</p>
]]></description><pubDate>Wed, 10 Jun 2026 06:05:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48472084</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48472084</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48472084</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>I agree. But if I'm paying for the road (even as a taxpayer) I get angry that after a year it's full of potholes and that there are unnecessary signs warning about penguin crossing, making it cost 2 times more than it should have (and dont get me started why this road is really a highway leading to my house). I'd want certain qualities. And this article is basically = you will get a road, built quickly<p>But yes, you are right - I don't build roads and don't know what is a price to build a road and how to determine the quality of correctly built one, nor I will ever care or learn.</p>
]]></description><pubDate>Tue, 09 Jun 2026 22:48:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48468905</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48468905</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48468905</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>True, but you should say that about every thing. Does it matter to you how the car drives, as long as it takes you to your destination? Well, yes, it matters: how will it deal with a crash, and if it's possible to replace a part and if anybody can just open it if you leave it outside. I will be amazed if somebody shows me their home-printed car, but if they'll try to sell it to me like a new one...</p>
]]></description><pubDate>Tue, 09 Jun 2026 22:34:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48468740</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48468740</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48468740</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>Fair enough, but enterpreunership should, I guess, ask questions if given Next Big Thing has substance behind it or is it just snake oil.</p>
]]></description><pubDate>Tue, 09 Jun 2026 20:13:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=48467014</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48467014</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48467014</guid></item><item><title><![CDATA[New comment by eithed in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>What I find fascinating that there is so little substance in this article about the quality of produced code and the medium. Is the code documented and tested? Is it understandable and extendable? Is it secure? What language, framework, database was used? Author mentions judgement and taste - well, is the code tasteful? Will the model rearchitecture the entire thing if I ask it to add new functionality, spending another 9.5h in tokens? I assume that the research part is domain knowledge = how different types of travel translate to time making it presentable; how did the author verify this?<p>These questions are even not about AI: if I were to give money to a human agency and were given something they tell me works, I would ask the same questions. If I did not know how to evaluate, I would hire people that do. With LLMs the verification part is what bothers me the most.</p>
]]></description><pubDate>Tue, 09 Jun 2026 19:32:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48466463</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48466463</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48466463</guid></item><item><title><![CDATA[New comment by eithed in "Cleaning up after AI rockstar developers"]]></title><description><![CDATA[
<p>I assumed that as it was hitting too close home (both PMs and EMs were expected to use AI, with PM trying to code in solutions that they didn't have expertise nor domain knowledge to deal with; EM was prototyping solutions that had access to prod DB that were shut down as soon as we found out). I can only symphatize nonetheless and thank you for giving your PoV.</p>
]]></description><pubDate>Tue, 09 Jun 2026 15:57:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48462786</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48462786</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48462786</guid></item><item><title><![CDATA[New comment by eithed in "Cleaning up after AI rockstar developers"]]></title><description><![CDATA[
<p>Even if it's a mandate top-down, you can:<p>- show increase in errors and outages caused by this approach<p>- integrate manager changes into your CI pipeline (coding / reviewing / testing / documentation)<p>- discuss how your manager can do the changes they need to do without sidetracking all other work<p>Make it indeed about the money: coding by PM + fixing what was coded + dealing with fallout is greater expense than coding by PM + automated guidelines + reviewing what was coded.<p>That is - if the environment you're working in is reasonable and it's not a power play by your PM</p>
]]></description><pubDate>Tue, 09 Jun 2026 14:11:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48461408</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48461408</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48461408</guid></item><item><title><![CDATA[New comment by eithed in "Cleaning up after AI rockstar developers"]]></title><description><![CDATA[
<p>> Please write a manual on how to cleanup after AI rockstar managers who think they can code.<p>Why are you allowing AI rockstar managers to (I assume) push without code review? Why are you cleaning up the fallout? It's not AI issue, it's people issue</p>
]]></description><pubDate>Tue, 09 Jun 2026 12:59:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48460538</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48460538</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48460538</guid></item><item><title><![CDATA[New comment by eithed in "Domain expertise has always been the real moat"]]></title><description><![CDATA[
<p>Sure, producing code has become cheap. Yet again the taste matters and LLMs do not have taste - they will apply patterns that are unnecessary or not extendible, producing unmaintainable systems that nobody understands. Capturing domain knowledge was the crux of development process, but so was verifying, documenting, ensuring that multiple systems work together, maintaining uniformity. I don't know where the assumptions, done by developers, that they only need to produce code that just works or goes brrr fast comes from.<p>Domain expert can develop working code, but they will not be able to ensure above.</p>
]]></description><pubDate>Sun, 31 May 2026 10:38:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48344612</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48344612</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48344612</guid></item><item><title><![CDATA[New comment by eithed in "Dynamic Workflows in Claude Code"]]></title><description><![CDATA[
<p>Depends - using Sonnet here and generally it should be as you say: plan would produce the result.<p>Still Claude will sneak things in - in my recent plan, for example I had defined, per acceptance criteria what colours the statuses should be: green for live, blue for sold, grey for anything else; it changed this to: green for live, orange for in progress, blue for sold, red in demolition, etc. When pressed why did it to this, it was unable to explain why. This is with a plan where AC were explicitly provided from the task in Given/When/Then format and were to be adhered to strictly. I've caught this within planning, but I shouldn't need to be doing this.<p>Even in standard prompts where I tell it "Change this label from X to Y", it ended reordering the tabs unrelated to ask. Again I was not able for it to explain why - it was so abrupt. And it was in fresh context, without any pollution on what I expect it to do.<p>I also noticed a different behaviour regarding skill; today and yesterday it would not be following skill guidance at all ie: skill writing skill - I'd have to explicitly tell it to test skills after writing them, when this is a behaviour expected by default. Similarly with other skills - knowing that it should have done something per skill guidelines and it not doing it at all. This is new behaviour that I've not seen a week ago.</p>
]]></description><pubDate>Fri, 29 May 2026 17:52:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48326751</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48326751</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48326751</guid></item><item><title><![CDATA[New comment by eithed in "Show HN: AISlop, a CLI for catching AI generated code smells"]]></title><description><![CDATA[
<p>Can you explain the benefits of running this over rector / eslint? (and to certain degree phpstan / deptrac)</p>
]]></description><pubDate>Fri, 29 May 2026 14:14:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=48323377</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48323377</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48323377</guid></item><item><title><![CDATA[New comment by eithed in "Dynamic Workflows in Claude Code"]]></title><description><![CDATA[
<p>Write a skill outlining your expectations of the code, put that skill into the pipeline, so that it can be included within your workflow.<p>Webdev here, but currently I have:
- a skill where I outlined how the architecture of the system should look like, with guards (static analysis, architecture tests, linting) confirming that the code it generates adheres to standards<p>- a skill that tells it how tests should look like (use generators, write both feature / unit tests)<p>- a skill that tells it to generate docs from the code in a form of acceptance criteria (Given / When / Then)<p>- a skill that tells it to generate frontend uat tests + accompanying backend seeders given the AC<p>- a skill that tells it to verify that ticket objectives match what was delivered<p>At this point I still need to guide it to move task from one stage to the other (coding, testing, verification that indeed what was coded adheres to what was required), but I believe that these dynamic workflows can automate this work as well.</p>
]]></description><pubDate>Fri, 29 May 2026 13:40:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48322993</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48322993</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48322993</guid></item><item><title><![CDATA[New comment by eithed in "[dead]"]]></title><description><![CDATA[
<p>Fair enough, I was looking at budget fields on wiki rather than reading the tidbits - thanks for the correction still!</p>
]]></description><pubDate>Fri, 29 May 2026 13:08:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48322643</link><dc:creator>eithed</dc:creator><comments>https://news.ycombinator.com/item?id=48322643</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48322643</guid></item></channel></rss>