<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News - Newest: &#34;Claude Code&#34;</title><link>https://news.ycombinator.com/newest</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 29 Jun 2026 00:09:20 +0000</lastBuildDate><atom:link href="https://hnrss.org/newest?q=Claude+Code" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Mux – A tmux overlay for managing Claude Code sessions]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/fashton28/mux">https://github.com/fashton28/mux</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48712990">https://news.ycombinator.com/item?id=48712990</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 28 Jun 2026 23:42:55 +0000</pubDate><link>https://github.com/fashton28/mux</link><dc:creator>fashton28</dc:creator><comments>https://news.ycombinator.com/item?id=48712990</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48712990</guid></item><item><title><![CDATA[Claude Code now uses dark UI patterns to gain Google account access via MCP]]></title><description><![CDATA[
<p>Article URL: <a href="https://claude.com/docs/connectors/google/gmail">https://claude.com/docs/connectors/google/gmail</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48711306">https://news.ycombinator.com/item?id=48711306</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 28 Jun 2026 20:30:29 +0000</pubDate><link>https://claude.com/docs/connectors/google/gmail</link><dc:creator>janpeuker</dc:creator><comments>https://news.ycombinator.com/item?id=48711306</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48711306</guid></item><item><title><![CDATA[How to Use Claude Code: A Complete Beginner's Guide (2026)]]></title><description><![CDATA[
<p>Article URL: <a href="https://dest.host/b/how-to-use-claude-code/">https://dest.host/b/how-to-use-claude-code/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48710860">https://news.ycombinator.com/item?id=48710860</a></p>
<p>Points: 5</p>
<p># Comments: 2</p>
]]></description><pubDate>Sun, 28 Jun 2026 19:46:28 +0000</pubDate><link>https://dest.host/b/how-to-use-claude-code/</link><dc:creator>snorbleck</dc:creator><comments>https://news.ycombinator.com/item?id=48710860</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48710860</guid></item><item><title><![CDATA[Show HN: Caliper – pass@k reliability testing for Claude Code and Codex skills]]></title><description><![CDATA[
<p>Skills for Claude Code and Codex are hard to test. What I mean by hard is that there's no standard way to do it. You evaluate the skill once on something, it looks like it works. You publish it. Then the new super model releases (GLM 5.2 anyone?), it will quietly break for some part, and you won't find out until your users complain.<p>I also faced the same problem, so I tried to build something lightweight to stop doing that. Caliper.<p>It's a local and lightweight harness that runs a skill k times in isolated environments and gives you a pass@k score (How much times it succeeded in these k times). As a non-deterministic technology, you can't just say "it worked once". You need to answer how much it passed in k times.<p>You define success in a YAML spec. I picked YAML to keep a schema and make it still readable for a human. You either use a LLM judge, a Python assertion, or both:<p>Here's an simple evaluation example with a JSON extraction, so you write this in a YAML file:<p><pre><code>  tasks:
    - name: Extracts action items as clean JSON
      prompt: "Read /tmp/transcript.txt and write the
               action items to /tmp/actions.json."
      expect: "A valid JSON array where every item has
               owner, task, due. No markdown fences."
      assert: |
        import json
        items = json.load(open("/tmp/actions.json"))
        assert isinstance(items, list)
        assert all({"owner","task","due"} <= i.keys()
                   for i in items)
</code></pre>
Then with the CLI, you'll run it:<p>caliper run extract-actions.eval.yaml --k 5 --baseline<p>What's cool about the --baseline flag is that it will re-runs everything without the skill, so you can see whether the skill is doing the work or the base agent was going to pass anyway:<p><pre><code>  ID      Task                           k(5)  pass@k
  task-1  Extracts action items as JSON  5/5   100%  PASS
  With skill   100%
  No skill      60%
  Delta        +40%
</code></pre>
Most models know how to get the JSON right most of the time (JSON extraction was solved by 2 years old already). But that's it, "most of the time" is the bug. That delta shows how the skill actually helped. (It's sometimes 0%, sometimes -100%!)<p>I also created two skills you can get started right away with your favorite harness, e.g. Claude Code, Codex or Pi:<p>- evaluate-skill: run and manage evals without leaving your workflow<p>- grill-skill: reads your SKILL.md, interviews you about what "good" looks like, writes a 3-task spec (happy path, edge case, adversarial), and runs it<p>You can install the skill with the command: npx skills@latest add edonadei/caliper<p>I for now support claude-code, codex, pi, claude-api, openai-api. You can run the agent and the judge as separate backends, so you can run a skill on one and judge with another.<p>GitHub: <a href="https://github.com/edonadei/caliper" rel="nofollow">https://github.com/edonadei/caliper</a>
PyPI: <a href="https://pypi.org/project/caliper-eval/" rel="nofollow">https://pypi.org/project/caliper-eval/</a><p>Of course, it's a first step. I think the autorater layer can be vastly improved, more handholding to create and iterate on evaluation specs, supporting more harness, why not including this layer into a self-improvement bigger system?<p>If you're also building agentic evaluations, I'm genuinely interested to hear how you are handling that.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48709606">https://news.ycombinator.com/item?id=48709606</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 28 Jun 2026 17:42:41 +0000</pubDate><link>https://github.com/edonadei/caliper</link><dc:creator>edonadei</dc:creator><comments>https://news.ycombinator.com/item?id=48709606</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48709606</guid></item><item><title><![CDATA[I used Claude Code to get a second opinion on my MRI]]></title><description><![CDATA[
<p>Article URL: <a href="https://antoine.fi/mri-analysis-using-claude-code-opus">https://antoine.fi/mri-analysis-using-claude-code-opus</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48708941">https://news.ycombinator.com/item?id=48708941</a></p>
<p>Points: 311</p>
<p># Comments: 420</p>
]]></description><pubDate>Sun, 28 Jun 2026 16:35:19 +0000</pubDate><link>https://antoine.fi/mri-analysis-using-claude-code-opus</link><dc:creator>engmarketer</dc:creator><comments>https://news.ycombinator.com/item?id=48708941</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48708941</guid></item><item><title><![CDATA[Show HN: Argus – Capture, replay and QA every Claude Code session your team runs]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.arguslab.co/">https://www.arguslab.co/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48708829">https://news.ycombinator.com/item?id=48708829</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 28 Jun 2026 16:26:46 +0000</pubDate><link>https://www.arguslab.co/</link><dc:creator>zamtam</dc:creator><comments>https://news.ycombinator.com/item?id=48708829</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48708829</guid></item><item><title><![CDATA[Claude Code turned every engineer into three. Now companies need more PMs]]></title><description><![CDATA[
<p>Article URL: <a href="https://venturebeat.com/infrastructure/claude-code-turned-every-engineer-into-three-now-companies-need-more-product-thinkers">https://venturebeat.com/infrastructure/claude-code-turned-every-engineer-into-three-now-companies-need-more-product-thinkers</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48707597">https://news.ycombinator.com/item?id=48707597</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 28 Jun 2026 14:28:01 +0000</pubDate><link>https://venturebeat.com/infrastructure/claude-code-turned-every-engineer-into-three-now-companies-need-more-product-thinkers</link><dc:creator>champagnepapi</dc:creator><comments>https://news.ycombinator.com/item?id=48707597</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48707597</guid></item><item><title><![CDATA[Compete – A Claude Code plugin for interactive competitor intelligence]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/lbj96347/compete">https://github.com/lbj96347/compete</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48707114">https://news.ycombinator.com/item?id=48707114</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 28 Jun 2026 13:31:24 +0000</pubDate><link>https://github.com/lbj96347/compete</link><dc:creator>lbj96347</dc:creator><comments>https://news.ycombinator.com/item?id=48707114</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48707114</guid></item><item><title><![CDATA[Academic-writing kit for Claude Code]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/josefslerka/study-kit">https://github.com/josefslerka/study-kit</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48706763">https://news.ycombinator.com/item?id=48706763</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 28 Jun 2026 12:36:52 +0000</pubDate><link>https://github.com/josefslerka/study-kit</link><dc:creator>josefslerka</dc:creator><comments>https://news.ycombinator.com/item?id=48706763</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48706763</guid></item><item><title><![CDATA[Why is Claude Code ending Converstaion]]></title><description><![CDATA[
<p>Article URL: <a href="https://medium.com/@thesuperrepemail/calude-code-got-sentimental-6fe965a3acc5">https://medium.com/@thesuperrepemail/calude-code-got-sentimental-6fe965a3acc5</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48706068">https://news.ycombinator.com/item?id=48706068</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 28 Jun 2026 10:19:52 +0000</pubDate><link>https://medium.com/@thesuperrepemail/calude-code-got-sentimental-6fe965a3acc5</link><dc:creator>rajkverma123</dc:creator><comments>https://news.ycombinator.com/item?id=48706068</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48706068</guid></item><item><title><![CDATA[Show HN: Claudete – Command a legion of Claude Code instances and shells]]></title><description><![CDATA[
<p>This is a weekend project I created due to my frustration with the classic terminal applications that have the issue that if I restart my Mac, then my Claude Code instances or my shell instances—are stopped, and I have to manually run them again.<p>Plus, they are not quite optimized for running multiple Claude Code instances, so I created this project to address this issue.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48705880">https://news.ycombinator.com/item?id=48705880</a></p>
<p>Points: 3</p>
<p># Comments: 4</p>
]]></description><pubDate>Sun, 28 Jun 2026 09:46:53 +0000</pubDate><link>https://claudete.co</link><dc:creator>morion4000</dc:creator><comments>https://news.ycombinator.com/item?id=48705880</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48705880</guid></item><item><title><![CDATA[Show HN: Claude-CLI – Run Claude Code in a throwaway Docker container]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/shirozuki/claude-cli">https://github.com/shirozuki/claude-cli</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48702113">https://news.ycombinator.com/item?id=48702113</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 27 Jun 2026 21:51:56 +0000</pubDate><link>https://github.com/shirozuki/claude-cli</link><dc:creator>shirozuki</dc:creator><comments>https://news.ycombinator.com/item?id=48702113</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48702113</guid></item><item><title><![CDATA[How Claude Code and Codex Sandbox Untrusted Code]]></title><description><![CDATA[
<p>Article URL: <a href="https://medium.com/@Koukyosyumei/how-claude-code-and-codex-sandbox-untrusted-code-ba39b493046a">https://medium.com/@Koukyosyumei/how-claude-code-and-codex-sandbox-untrusted-code-ba39b493046a</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48701924">https://news.ycombinator.com/item?id=48701924</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Sat, 27 Jun 2026 21:25:57 +0000</pubDate><link>https://medium.com/@Koukyosyumei/how-claude-code-and-codex-sandbox-untrusted-code-ba39b493046a</link><dc:creator>syumei</dc:creator><comments>https://news.ycombinator.com/item?id=48701924</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48701924</guid></item><item><title><![CDATA[Can a robot be a modern artist? I gave Claude Code creative freedom]]></title><description><![CDATA[
<p>Article URL: <a href="https://museum.nal.digital">https://museum.nal.digital</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48700895">https://news.ycombinator.com/item?id=48700895</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 27 Jun 2026 19:18:11 +0000</pubDate><link>https://museum.nal.digital</link><dc:creator>ArtemParsegov</dc:creator><comments>https://news.ycombinator.com/item?id=48700895</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48700895</guid></item><item><title><![CDATA[How Boris Cherny Uses Claude Code]]></title><description><![CDATA[
<p>Article URL: <a href="https://howborisusesclaudecode.com">https://howborisusesclaudecode.com</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48699921">https://news.ycombinator.com/item?id=48699921</a></p>
<p>Points: 5</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 27 Jun 2026 17:10:17 +0000</pubDate><link>https://howborisusesclaudecode.com</link><dc:creator>eustoria</dc:creator><comments>https://news.ycombinator.com/item?id=48699921</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48699921</guid></item><item><title><![CDATA[Show HN: Orchestrate Claude Code agents on a canvas]]></title><description><![CDATA[
<p>Article URL: <a href="https://rondoflow.app/">https://rondoflow.app/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48686649">https://news.ycombinator.com/item?id=48686649</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 26 Jun 2026 13:51:17 +0000</pubDate><link>https://rondoflow.app/</link><dc:creator>beledev</dc:creator><comments>https://news.ycombinator.com/item?id=48686649</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48686649</guid></item><item><title><![CDATA[Show HN: Novakit.tech – pre-built Claude Code skills, no prompting required]]></title><description><![CDATA[
<p>Article URL: <a href="https://novakit.tech/">https://novakit.tech/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48686082">https://news.ycombinator.com/item?id=48686082</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 26 Jun 2026 12:56:17 +0000</pubDate><link>https://novakit.tech/</link><dc:creator>snehalpateldev</dc:creator><comments>https://news.ycombinator.com/item?id=48686082</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48686082</guid></item><item><title><![CDATA[Show HN: Verity – self-healing review gate for Claude Code]]></title><description><![CDATA[
<p>Hi folks,<p>We built what we think the next generation of code review looks like. It's an adversarial, self-healing gate for coding agents that repairs the code on every run, so only clean and secure code moves forward.<p>Here's a short demo <a href="https://www.youtube.com/watch?v=vsq_cHoZCnk" rel="nofollow">https://www.youtube.com/watch?v=vsq_cHoZCnk</a><p>The next agent run also starts smarter than the last thanks to a built-in memory, and because of that agent costs start to stabilize.<p>It's completely free while in beta. We're looking for testers doing serious agentic engineering to help shape the roadmap.<p>Install takes about two minutes, and you'll see your first gated review on your next agent run. Give it a spin:<p>npm install -g @codacy/verity-cli && verity init</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48685853">https://news.ycombinator.com/item?id=48685853</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 26 Jun 2026 12:27:05 +0000</pubDate><link>https://verity.md</link><dc:creator>claudiacsf</dc:creator><comments>https://news.ycombinator.com/item?id=48685853</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48685853</guid></item><item><title><![CDATA[I made a Claude Code session manager for tmux]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.devas.life/i-made-a-claude-code-session-manager-for-tmux/">https://www.devas.life/i-made-a-claude-code-session-manager-for-tmux/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48683595">https://news.ycombinator.com/item?id=48683595</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 26 Jun 2026 07:44:59 +0000</pubDate><link>https://www.devas.life/i-made-a-claude-code-session-manager-for-tmux/</link><dc:creator>philips</dc:creator><comments>https://news.ycombinator.com/item?id=48683595</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48683595</guid></item><item><title><![CDATA[Terminal Agents in 2026: Goose, Claude Code, OpenCode, and Pi Compared]]></title><description><![CDATA[
<p>Article URL: <a href="https://outofcontext.dev/blog/goose-claude-code-opencode-pi/">https://outofcontext.dev/blog/goose-claude-code-opencode-pi/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48683357">https://news.ycombinator.com/item?id=48683357</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 26 Jun 2026 07:12:24 +0000</pubDate><link>https://outofcontext.dev/blog/goose-claude-code-opencode-pi/</link><dc:creator>leianixcheese</dc:creator><comments>https://news.ycombinator.com/item?id=48683357</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48683357</guid></item></channel></rss>