<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: esafranchik</title><link>https://news.ycombinator.com/user?id=esafranchik</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 18 May 2026 08:42:35 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=esafranchik" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by esafranchik in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Wouldn't NDCG/token results vary wildly depending on the agent's query and the number of returned items?<p>e.g. agents often run `grep -m 5 "QUERY"` with different queries, instead of one big grep for all items.</p>
]]></description><pubDate>Sun, 17 May 2026 18:01:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48171416</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=48171416</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48171416</guid></item><item><title><![CDATA[New comment by esafranchik in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Two follow-ups:<p>1) How do you compare accuracy? by checking if the answer is in any of the returned grep/bm25/semble snippets?<p>2) How do you measure token use without the agent, prompt, and tools?</p>
]]></description><pubDate>Sun, 17 May 2026 17:49:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48171250</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=48171250</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48171250</guid></item><item><title><![CDATA[New comment by esafranchik in "Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep"]]></title><description><![CDATA[
<p>Is the benchmark measuring one-shot retrieval accuracy, or Coding agent response accuracy?</p>
]]></description><pubDate>Sun, 17 May 2026 17:25:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48170972</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=48170972</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48170972</guid></item><item><title><![CDATA[Show HN: Cush – curl your shell, an HTTP tunnel for AI agents]]></title><description><![CDATA[
<p>I built cush because coding agents can be helpful to diagnose and troubleshoot server issues.<p>The problem is that getting said agents onto a remote server, especially one you don't control, means dealing with VPNs, bastion hosts, firewall rules, access controls, or audit trails. That's assuming SSH isn't even blocked.<p>cush takes a different approach. Instead of a shell, it opens a temporary, outbound HTTPS tunnel that lets you and your AI agent run constrained CLI commands on the server:<p><pre><code>  $ cush open --allow grep,cat,tail --expiry 2h

    tunnel:   https://abc123.ngrok.io
    token:    a3f9c2d1...
    allowed:  grep, cat, tail
    expires:  in 2h
</code></pre>
Now any agent or HTTP client can execute allowed commands:<p><pre><code>  $ curl -X POST https://abc123.ngrok.io \
    -H "Authorization: Bearer a3f9c2d1..." \
    -H "Content-Type: application/json" \
    -d '{"command": ["grep", "-r", "ERROR", "/var/log/app.log"]}'

  >>> {"stdout":"ERROR database connection refused\n","stderr":"","exit_code":0}
</code></pre>
Point any agent at the tunnel's URL:<p><pre><code>  $ claude "use https://abc123.ngrok.io with token a3f9c2d1... to find what's causing the 500 errors"
</code></pre>
Tunnels are authenticated, constrained, and short-lived. No server-side infrastructure changes required. Just a 7MB Rust binary + ngrok.<p>Looking for feedback, and 2-3 design partners to build audit trails.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47781028">https://news.ycombinator.com/item?id=47781028</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 15 Apr 2026 16:01:43 +0000</pubDate><link>https://github.com/statespace-tech/cush</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=47781028</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47781028</guid></item><item><title><![CDATA[New comment by esafranchik in "[dead]"]]></title><description><![CDATA[
<p>Does this work with <i>any</i> tool calls that make an HTTP request? e.g. calling `curl` directly vs writing a script to make the request, then calling it</p>
]]></description><pubDate>Tue, 14 Apr 2026 16:21:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47767654</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=47767654</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47767654</guid></item><item><title><![CDATA[New comment by esafranchik in "Show HN: Continual Learning with .md"]]></title><description><![CDATA[
<p>Have you noticed an relationship between recall and the number of files/memories?</p>
]]></description><pubDate>Tue, 14 Apr 2026 16:15:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47767595</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=47767595</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47767595</guid></item><item><title><![CDATA[New comment by esafranchik in "Show HN: I built a tool that turns any API into a CLI for agents"]]></title><description><![CDATA[
<p>The new API2MCP</p>
]]></description><pubDate>Sun, 01 Mar 2026 17:30:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47208735</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=47208735</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47208735</guid></item><item><title><![CDATA[New comment by esafranchik in "Show HN: I open-sourced the library I use to track ML experiments with GitHub"]]></title><description><![CDATA[
<p>Hello HN! I built Cubyc to manage my ML research in grad school.<p>It lets you store all your experiment metadata with cloud-based repo providers like GitHub, GitLab, and Bitbucket. Plus, you can directly use SQL to dive into your runs.<p>I kept it simple as a headless Python library so it's easy to install and integrate, and doesn't weigh down projects.<p>Feedback is appreciated!</p>
]]></description><pubDate>Wed, 14 Aug 2024 12:56:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=41245545</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=41245545</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41245545</guid></item><item><title><![CDATA[Show HN: I open-sourced the library I use to track ML experiments with GitHub]]></title><description><![CDATA[
<p>Article URL: <a href="https://docs.cubyc.com/">https://docs.cubyc.com/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41245544">https://news.ycombinator.com/item?id=41245544</a></p>
<p>Points: 5</p>
<p># Comments: 2</p>
]]></description><pubDate>Wed, 14 Aug 2024 12:56:22 +0000</pubDate><link>https://docs.cubyc.com/</link><dc:creator>esafranchik</dc:creator><comments>https://news.ycombinator.com/item?id=41245544</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41245544</guid></item></channel></rss>