<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: zby</title><link>https://news.ycombinator.com/user?id=zby</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 26 Jun 2026 22:34:45 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=zby" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by zby in "Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion"]]></title><description><![CDATA[
<p>select text and comment on it - then let the llm read all comments</p>
]]></description><pubDate>Fri, 26 Jun 2026 07:29:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48683481</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48683481</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48683481</guid></item><item><title><![CDATA[New comment by zby in "Show HN: OpenKnowledge – open source AI-first alternative to Obsidian/Notion"]]></title><description><![CDATA[
<p>The feature I am waiting for in all of these editors is integrating 'red lining' as a channel for LLM input. This is the best interface for working on a text. <a href="https://www.roughdraft.md/" rel="nofollow">https://www.roughdraft.md/</a> does the core idea pretty well - but is not well integrated with the rest (browsing, etc).</p>
]]></description><pubDate>Fri, 26 Jun 2026 05:02:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48682519</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48682519</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48682519</guid></item><item><title><![CDATA[New comment by zby in "Prompt Injection as Role Confusion"]]></title><description><![CDATA[
<p>If the action is decided by code based on metadata - then what is really the LLM task? And if you say that it is only the type of action that is decided by code - then this is maybe a  mitigation - but the llm still can do a lot of harm. And also it is very limiting - using the llm to decide the action is very useful. This is different from SQL injection - where the action is determined by the code and the injection is really making a code parsing error.<p>It might still be the way to go - but calling it 'the real solution' is overselling it.</p>
]]></description><pubDate>Mon, 22 Jun 2026 21:02:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48636147</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48636147</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48636147</guid></item><item><title><![CDATA[New comment by zby in "Prompt Injection as Role Confusion"]]></title><description><![CDATA[
<p>They do predict what injections might be effective - so it is a theory. I don't know how novel it is and it is not very deep (as you noted the general mechanism is quite obvious) - but they do it quite systematically so it is useful.</p>
]]></description><pubDate>Mon, 22 Jun 2026 20:49:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48635967</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48635967</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48635967</guid></item><item><title><![CDATA[New comment by zby in "Agentic coding deserves more than a chat box bolted onto VS Code"]]></title><description><![CDATA[
<p>The video seems to be about mushrooms.</p>
]]></description><pubDate>Wed, 17 Jun 2026 16:54:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48573149</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48573149</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48573149</guid></item><item><title><![CDATA[New comment by zby in "But yak shaving is fun (2019)"]]></title><description><![CDATA[
<p>The fact that programmers can be nerd sniped into yak shaving some random libraries is the only thing that keeps Open Source running.</p>
]]></description><pubDate>Tue, 16 Jun 2026 17:26:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48558722</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48558722</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48558722</guid></item><item><title><![CDATA[New comment by zby in "Israeli firm BlackCore suspected of meddling in New York and Scotland votes"]]></title><description><![CDATA[
<p>I would love to hear from someone knowledgeable - is that bad for the company or good?</p>
]]></description><pubDate>Sat, 13 Jun 2026 09:37:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48515334</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48515334</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48515334</guid></item><item><title><![CDATA[New comment by zby in "Show HN: Skill for your agent to visualize your gbrain and Obsidian"]]></title><description><![CDATA[
<p>I am open to changing my mind and I am looking forward for answers to your question - but I think it is like a click bait - looks interesting - but not really useful.</p>
]]></description><pubDate>Sat, 13 Jun 2026 09:35:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48515315</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48515315</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48515315</guid></item><item><title><![CDATA[New comment by zby in "Ask HN: What are tools you have made for yourself since the advent of AI?"]]></title><description><![CDATA[
<p>I am building my self-hosting llm-wiki system (<a href="https://gist.github.com/karpathy/442a6bf555914893e9891c11519de94f" rel="nofollow">https://gist.github.com/karpathy/442a6bf555914893e9891c11519...</a>). My approach is to start with a theory of how such systems could work. Then since llms can interpret theory - this  theory becomes an executable llm-wiki system itself.<p>It's called Commonplace: <a href="https://zby.github.io/commonplace/" rel="nofollow">https://zby.github.io/commonplace/</a></p>
]]></description><pubDate>Mon, 08 Jun 2026 21:39:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48452557</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48452557</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48452557</guid></item><item><title><![CDATA[New comment by zby in "Zerostack – A Unix-inspired coding agent written in pure Rust"]]></title><description><![CDATA[
<p>I think I fixed this in a later revision. Does that persist?</p>
]]></description><pubDate>Sun, 17 May 2026 09:27:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48167355</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48167355</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48167355</guid></item><item><title><![CDATA[New comment by zby in "Zerostack – A Unix-inspired coding agent written in pure Rust"]]></title><description><![CDATA[
<p>There is also <a href="https://github.com/Dicklesworthstone/pi_agent_rust" rel="nofollow">https://github.com/Dicklesworthstone/pi_agent_rust</a><p>I vibed a comparison/review of these two systems using my llm wiki: <a href="https://zby.github.io/commonplace/work/pi-agent-zerostack-comparison/pi-agent-rust-vs-zerostack/" rel="nofollow">https://zby.github.io/commonplace/work/pi-agent-zerostack-co...</a><p>(the prompt is in <a href="https://zby.github.io/commonplace/work/pi-agent-zerostack-comparison/review-instruction/" rel="nofollow">https://zby.github.io/commonplace/work/pi-agent-zerostack-co...</a>)</p>
]]></description><pubDate>Sun, 17 May 2026 09:13:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48167275</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48167275</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48167275</guid></item><item><title><![CDATA[New comment by zby in "Inventing Cyrillic (2024)"]]></title><description><![CDATA[
<p>"In the 890s, having recently converted to Orthodox Christianity, Boris ensured his church would be independent from the Patriarchate of Constantinople." --- I thought Orthodox Christianity was created by the Great Schism in 1054.</p>
]]></description><pubDate>Fri, 08 May 2026 09:13:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48060621</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48060621</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48060621</guid></item><item><title><![CDATA[New comment by zby in "Nobody Reviews Compiler Output"]]></title><description><![CDATA[
<p>And how they are doing? I think this might be a solid research program - but that blog presented it as some kind of practical approach.</p>
]]></description><pubDate>Fri, 08 May 2026 08:53:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48060469</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48060469</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48060469</guid></item><item><title><![CDATA[New comment by zby in "AI slop is killing online communities"]]></title><description><![CDATA[
<p>So no hope for <a href="https://xkcd.com/810/" rel="nofollow">https://xkcd.com/810/</a>?</p>
]]></description><pubDate>Thu, 07 May 2026 20:46:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48054719</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48054719</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48054719</guid></item><item><title><![CDATA[New comment by zby in "Nobody Reviews Compiler Output"]]></title><description><![CDATA[
<p>First you need to write these specifications and if you say just tell the llm to write them - then how would it be different from just tell the llm to write the program?<p>I guess you can argue that these are two independent processes so you can combine them to get something more reliable than both - this might be a viable path. But from what I heard writing formal specifications is just really hard - I haven't seen anything practical in this area.</p>
]]></description><pubDate>Thu, 07 May 2026 20:37:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48054620</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48054620</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48054620</guid></item><item><title><![CDATA[New comment by zby in "Agents need control flow, not more prompts"]]></title><description><![CDATA[
<p>I concur - it does not make sense to do in llm prompts what can be done in code. Code is cheaper, faster, deterministic and we have lots of experience with working with code.<p>Especially all bookkeeping logic should move into the symbolic layer: <a href="https://zby.github.io/commonplace/notes/scheduler-llm-separation-exploits-an-error-correction-asymmetry/" rel="nofollow">https://zby.github.io/commonplace/notes/scheduler-llm-separa...</a></p>
]]></description><pubDate>Thu, 07 May 2026 20:15:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48054291</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48054291</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48054291</guid></item><item><title><![CDATA[New comment by zby in "Nobody Reviews Compiler Output"]]></title><description><![CDATA[
<p>"""
we need to build:<p><pre><code>    Formal specification layers that agents execute against, not just prompts</code></pre>
"""<p>It is probably easier to just write that program.</p>
]]></description><pubDate>Thu, 07 May 2026 19:51:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48054011</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=48054011</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48054011</guid></item><item><title><![CDATA[New comment by zby in "Show HN: Auto-Architecture: Karpathy's Loop, pointed at a CPU"]]></title><description><![CDATA[
<p>It is not novel - but with the new models it is just becoming practical.</p>
]]></description><pubDate>Wed, 29 Apr 2026 12:48:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47947576</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=47947576</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47947576</guid></item><item><title><![CDATA[New comment by zby in "Simulacrum of Knowledge Work"]]></title><description><![CDATA[
<p>If you have a test that fails 50% times - is that test valuable or not? A 50% failure rate alone looks like a coin toss, but by itself that does not tell us whether the test is noise or whether it is separating bad states from good ones. For a test to be useful it needs to have positive Youden’s statistic (<a href="https://en.wikipedia.org/wiki/Youden%27s_J_statistic" rel="nofollow">https://en.wikipedia.org/wiki/Youden%27s_J_statistic</a>): sensitivity + specificity - 1. A 50% failure rate alone does not let us calculate sensitivity and specificity.<p>I can see a similar problem with this article - the author notices that LLMs produce a lot of errors - then concludes that they are useless and produce only simulacrum of work. The author has an interesting observation about how llms disrupt the way we judge knowledge work. But when he concludes that llms do only simulacrum of work - this is where his arguments fail.</p>
]]></description><pubDate>Sat, 25 Apr 2026 21:32:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47904703</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=47904703</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47904703</guid></item><item><title><![CDATA[New comment by zby in "AI agents that argue with each other to improve decisions"]]></title><description><![CDATA[
<p>I don't know - looks like an interesting idea - but ... I am struggling to put that in a polite manner. When I go into the repo and find out that it does stuff like lip syncing of talking avatars then I start to think what percentage of the development effort goes into marketing?</p>
]]></description><pubDate>Sat, 25 Apr 2026 20:29:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47904315</link><dc:creator>zby</dc:creator><comments>https://news.ycombinator.com/item?id=47904315</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47904315</guid></item></channel></rss>