<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: reedlaw</title><link>https://news.ycombinator.com/user?id=reedlaw</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 18:44:21 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=reedlaw" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by reedlaw in "GLM 5.2 Is Out"]]></title><description><![CDATA[
<p>Since the Renaissance got started before the printing press, maybe you mean the press fueled it? The idea that the church found printing dangerous seems like a conflation with events that happened during the Protestant Reformation. The Catholic Church did censor works it found heretical, including unauthorized Bible translations.<p>One could argue the opposite conclusion, that technology helps break monopolies, but either view depends on reductionist historical readings. The truth is somewhere in between.</p>
]]></description><pubDate>Sun, 14 Jun 2026 14:05:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48527326</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=48527326</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48527326</guid></item><item><title><![CDATA[New comment by reedlaw in "Arch Linux Now Believes Malware Incident Under Control: More Than 1,500 Packages"]]></title><description><![CDATA[
<p>Good instinct. I did both. The script came out later.</p>
]]></description><pubDate>Sat, 13 Jun 2026 18:00:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48519759</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=48519759</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48519759</guid></item><item><title><![CDATA[New comment by reedlaw in "Arch Linux Now Believes Malware Incident Under Control: More Than 1,500 Packages"]]></title><description><![CDATA[
<p>I did the malware check using Claude, providing it with the same list (<a href="https://md.archlinux.org/s/SxbqukK6IA" rel="nofollow">https://md.archlinux.org/s/SxbqukK6IA</a>), and it did essentially the same things as this script does to verify. So either way should do the trick.</p>
]]></description><pubDate>Sat, 13 Jun 2026 15:49:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48518428</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=48518428</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48518428</guid></item><item><title><![CDATA[New comment by reedlaw in "AUR Packages Compromised with Infostealer and Rootkit"]]></title><description><![CDATA[
<p>I also had an affected package installed, fortunately it was from the official repo before it was dropped and became an AUR package.</p>
]]></description><pubDate>Fri, 12 Jun 2026 12:16:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48503165</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=48503165</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48503165</guid></item><item><title><![CDATA[New comment by reedlaw in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>Yeah, the README looks like slop to me.</p>
]]></description><pubDate>Wed, 10 Jun 2026 15:21:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=48477706</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=48477706</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48477706</guid></item><item><title><![CDATA[New comment by reedlaw in "Show HN: Agent-skills-eval – Test whether Agent Skills improve outputs"]]></title><description><![CDATA[
<p>Codex is only slightly better, and that fluctuates so I switch back and forth.</p>
]]></description><pubDate>Fri, 08 May 2026 12:44:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48062258</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=48062258</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48062258</guid></item><item><title><![CDATA[New comment by reedlaw in "Show HN: Agent-skills-eval – Test whether Agent Skills improve outputs"]]></title><description><![CDATA[
<p>I tried to create a hook that would detect when token usage was running out and write HANDOFF.md so I could switch to another agent and finish the current task. It never worked reliably. To make a hook for db queries, it would need to run before each bash call, check if it looks like a query, and then exit with a new prompt, e.g.: "Use tidewave's execute_sql_query for DB access". But then it could just ignore the prompt the same as CLAUDE.me. What if I really wanted to use bash for a specific task? The real issue is that prompts are not tightly coupled with capabilities. If we admit that, then skills are over hyped.</p>
]]></description><pubDate>Thu, 07 May 2026 17:11:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48051949</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=48051949</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48051949</guid></item><item><title><![CDATA[New comment by reedlaw in "Show HN: Agent-skills-eval – Test whether Agent Skills improve outputs"]]></title><description><![CDATA[
<p>I'm skeptical skills will outperform training given that Opus 4.7 already ignores a 720-byte CLAUDE.md telling it to use tidewave (a Rails MCP server with 6 tools) for db queries. When I asked a new claude session about a record it called<p>> Bash(DATABASE_URL=$(grep -E '^DATABASE_URL=' .env 2>/dev/null | head -1) echo "ok")<p>even though I have in CLAUDE.md:<p>> For database queries, use tidewave first.<p>I then prompted:<p>> use tidewave as per CLAUDE.md. also diagnose why you failed to heed that<p>> ● Diagnosis first: I defaulted to shell habits (env grep → psql) instead of pausing to recall the CLAUDE.md rule that tidewave is the first-line DB tool. The trigger was "look at this record" — I should have read that as "run a SQL query" and reached for tidewave immediately.<p>If Opus 4.7 doesn't follow simple CLAUDE.md instructions, I'm not sure what benefits other markdown files could bring. I don't trust Opus's own explanation, but it could point to the fact that the system prompt for bash is much longer than CLAUDE.md with tidewave.<p>While LLM judging could be helpful, I think the tool-call assertions (<a href="https://github.com/darkrishabh/agent-skills-eval#what-you-get" rel="nofollow">https://github.com/darkrishabh/agent-skills-eval#what-you-ge...</a>) may be the most useful thing in agent-skills-eval given that it's the only objective measure of compliance.</p>
]]></description><pubDate>Thu, 07 May 2026 15:19:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48050489</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=48050489</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48050489</guid></item><item><title><![CDATA[New comment by reedlaw in "Spinel: Ruby AOT Native Compiler"]]></title><description><![CDATA[
<p>Why do you say HashWithIndifferentAccess shows a lack of understanding? Like many Rails features, it's a convenience that abstracts away details that some find unpleasant to work with. Rails sometimes takes "magic" to the extreme through meta-programming. However, looking at the source [1], HashWithIndifferentAccess doesn't use eval, send, method_missing, or define_method. So I'm not sure how it seems weird to someone who works more with plain Ruby.<p>1. <a href="https://github.com/rails/rails/blob/fa8f0812160665bff083a089d2bb2fc1817ea03e/activesupport/lib/active_support/hash_with_indifferent_access.rb#L167" rel="nofollow">https://github.com/rails/rails/blob/fa8f0812160665bff083a089...</a></p>
]]></description><pubDate>Fri, 24 Apr 2026 15:01:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=47891211</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47891211</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47891211</guid></item><item><title><![CDATA[New comment by reedlaw in "Caveman: Why use many token when few token do trick"]]></title><description><![CDATA[
<p>Chinese omits articles, verbs aren't conjugated, and individual characters carry more meaning than English letters, but other than those differences I don't have the impression that Chinese communication is inherently more concise. Some forms of official speech are wordy. Writing is denser, but the amount of information conveyed through speech is about the same. There are jokes about ambiguous words or phrases in both Chinese and English. So I was surprised at your take, but no objection to your points above. Ancient Chinese, on the other hand, is extremely concise, but so are other ancient languages like Hebrew, although in a different way. So it seems that ancient languages are compressed but challenging and modern languages have unpacked the compression for ease of understanding.</p>
]]></description><pubDate>Sun, 05 Apr 2026 19:28:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47652959</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47652959</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47652959</guid></item><item><title><![CDATA[New comment by reedlaw in "Caveman: Why use many token when few token do trick"]]></title><description><![CDATA[
<p>Are you saying Chinese is more concise than English? Chinese poetry is concise, but that can be true in any language. For LLMs, it depends on the tokenizer. Chinese models are of course more Chinese-friendly and so would encode the same sentence with fewer tokens than Western models.</p>
]]></description><pubDate>Sun, 05 Apr 2026 16:56:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=47651324</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47651324</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47651324</guid></item><item><title><![CDATA[New comment by reedlaw in "Show HN: I built a frontpage for personal blogs"]]></title><description><![CDATA[
<p>I've come to the conclusion that Hacker News is the best aggregator out there. Substack knows my interests yet gives terrible recommendations. Youtube constantly recommends the same videos or exaggerates my interest in a topic based on a few views, spamming me with related content until I watch something unrelated. The only downside of Hacker News is that its focus is narrower than other sites. But perhaps because the focus is "Anything that good hackers would find interesting" there is a bias towards things I find interesting with less noise than more commercial offerings.</p>
]]></description><pubDate>Fri, 03 Apr 2026 15:35:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=47627946</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47627946</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47627946</guid></item><item><title><![CDATA[New comment by reedlaw in "Universal Claude.md – cut Claude output tokens"]]></title><description><![CDATA[
<p>Claude has trained me on the use of the word 'invariant'. I never used it before, but it makes sense as a term for a rule the system guarantees. I would have used 'validation' for application-side rules or 'constraint' for db rules, but 'invariant' is a nice generic substitute.</p>
]]></description><pubDate>Tue, 31 Mar 2026 14:16:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47587743</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47587743</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47587743</guid></item><item><title><![CDATA[New comment by reedlaw in "A sufficiently detailed spec is code"]]></title><description><![CDATA[
<p>Then I would say this is another proof that LLMs lack intellect or ability to reason about universals. See <a href="https://michaelmangialardi.substack.com/i/186405810/test-4-particular-vs-universal-judgments" rel="nofollow">https://michaelmangialardi.substack.com/i/186405810/test-4-p...</a></p>
]]></description><pubDate>Fri, 20 Mar 2026 11:29:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47453148</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47453148</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47453148</guid></item><item><title><![CDATA[New comment by reedlaw in "A sufficiently detailed spec is code"]]></title><description><![CDATA[
<p>Why is Haskell irrelevant to the argument that LLMs can't reliably permute programming knowledge from one language to another? In fact, the purity of the language and dearth of training data seems like the perfect test case to see whether concepts found in more mainstream languages are actually understood.</p>
]]></description><pubDate>Thu, 19 Mar 2026 17:30:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=47442886</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47442886</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47442886</guid></item><item><title><![CDATA[New comment by reedlaw in "Get Shit Done: A meta-prompting, context engineering and spec-driven dev system"]]></title><description><![CDATA[
<p>This is the second endorsement I've seen today. I gave OpenSpec a shot and was dismayed by the Explore prompt. [1] Over 1,000 words with verbose, repetitive instructions which will lead to context drift. The examples refer to specific tools like SQLite and OAuth. That won't help if your project isn't related to those.<p>I do like the basic concept and directory structure, but those are easy enough to adopt without all the cruft.<p>1. <a href="https://github.com/Fission-AI/OpenSpec/blob/main/src/core/templates/workflows/explore.ts#L9" rel="nofollow">https://github.com/Fission-AI/OpenSpec/blob/main/src/core/te...</a></p>
]]></description><pubDate>Wed, 18 Mar 2026 15:35:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47427078</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47427078</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47427078</guid></item><item><title><![CDATA[New comment by reedlaw in "Ask HN: What Are You Working On? (March 2026)"]]></title><description><![CDATA[
<p>Do you have examples of the task maturation cycle? I'm not sure how it would work for tasks like extracting structured data from images. It seems it could only work for tasks that can be scripted and wouldn't work well for tasks that need individual reasoning in every instance.</p>
]]></description><pubDate>Mon, 09 Mar 2026 17:40:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47312433</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47312433</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47312433</guid></item><item><title><![CDATA[New comment by reedlaw in "Tell HN: I'm 60 years old. Claude Code has re-ignited a passion"]]></title><description><![CDATA[
<p>How do you even begin to define objective measurements of software engineering productivity? You could use DORA metrics [1] which are about how effectively software is delivered. Or you could use the SPACE Framework [2] which is more about the developer experience.<p>1. <a href="https://cloud.google.com/blog/products/devops-sre/using-the-four-keys-to-measure-your-devops-performance" rel="nofollow">https://cloud.google.com/blog/products/devops-sre/using-the-...</a><p>2. <a href="https://space-framework.com/" rel="nofollow">https://space-framework.com/</a></p>
]]></description><pubDate>Sat, 07 Mar 2026 15:37:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47288560</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47288560</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47288560</guid></item><item><title><![CDATA[New comment by reedlaw in "Plasma Bigscreen – 10-foot interface for KDE plasma"]]></title><description><![CDATA[
<p>Plasma Bigscreen has been around for 6 years: <a href="https://itsfoss.com/news/plasma-bigscreen-comeback/" rel="nofollow">https://itsfoss.com/news/plasma-bigscreen-comeback/</a></p>
]]></description><pubDate>Sat, 07 Mar 2026 13:59:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=47287690</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47287690</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47287690</guid></item><item><title><![CDATA[New comment by reedlaw in "Ki Editor - an editor that operates on the AST"]]></title><description><![CDATA[
<p><a href="https://github.com/mickeynp/combobulate" rel="nofollow">https://github.com/mickeynp/combobulate</a></p>
]]></description><pubDate>Sat, 07 Mar 2026 13:14:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=47287355</link><dc:creator>reedlaw</dc:creator><comments>https://news.ycombinator.com/item?id=47287355</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47287355</guid></item></channel></rss>