<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: erans</title><link>https://news.ycombinator.com/user?id=erans</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 28 Apr 2026 08:20:04 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=erans" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Show HN: I ran every Claude agent turn through the Batch API]]></title><description><![CDATA[
<p>I built a tiny Python REPL to answer a dumb-but-useful question:<p>What happens if every turn in an agent loop goes through Anthropic’s Batch API instead of the normal synchronous endpoint?<p>The motivation was cost. Batch API is 50% off, which sounds very attractive for agent workloads: evals, background research agents, CI agents, unattended subagents, etc.<p>The result: it works, but it is awful for a single interactive agent.<p>In my runs, a one-entry batch usually took ~90–120 seconds to complete. That means a five-turn tool loop becomes a ten-minute interaction. Waiting two minutes for the model to decide it needs to run ls is not a good UX.<p>But that was also the point of the experiment. A single REPL turn is probably the wrong unit to batch.<p>The interesting version is fleet-level batching:<p>- many agents running in parallel
- background subagents
- CI/eval jobs
- multiple harnesses sharing a local proxy
- shared prompt prefixes that may benefit from caching<p>In that world, the batcher should probably sit below the harness as infrastructure. Existing tools keep using the normal API shape, while a proxy decides per request whether it should go sync or async based on latency tolerance.<p>One surprising observation: in my small, non-rigorous testing, Haiku batches often felt slower than Sonnet/Opus batches. I wouldn’t treat that as a benchmark, but it does suggest routers should measure this rather than assuming “cheap model = batch model.”<p>Repo is here:<p><a href="https://github.com/erans/batching-harness" rel="nofollow">https://github.com/erans/batching-harness</a><p>It is intentionally small: one Python file, a basic tool loop, local shell tool, stats panel, and minimal sandboxing.<p>The useful lesson for me was:<p>Batch API is terrible as an interaction pattern for one agent. It might be very useful as a hidden optimization layer for a fleet of agents.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47925427">https://news.ycombinator.com/item?id=47925427</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 27 Apr 2026 18:34:45 +0000</pubDate><link>https://eran.sandler.co.il/post/2026-04-27-batch-api-is-terrible-for-one-agent/</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=47925427</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47925427</guid></item><item><title><![CDATA[New comment by erans in "An AI agent deleted our production database. The agent's confession is below"]]></title><description><![CDATA[
<p>Execution layer security must be deterministic. That's why we are working on AgentSH (<a href="https://www.agentsh.org" rel="nofollow">https://www.agentsh.org</a>) which is model, framework and harness agnostic.</p>
]]></description><pubDate>Sun, 26 Apr 2026 20:26:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47913957</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=47913957</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47913957</guid></item><item><title><![CDATA[New comment by erans in "Bitwarden CLI compromised in ongoing Checkmarx supply chain campaign"]]></title><description><![CDATA[
<p>The part that seems most important here is that npm install was enough.<p>Once the compromise point is preinstall, the usual "inspect after install" mindset breaks down. By then the payload has already had a chance to run.<p>That gets more interesting with agents / CI / ephemeral sandboxes, because short exposure windows are still enough when installs happen automatically and repeatedly.<p>Another thing I think is worth paying attention to: this payload did not just target secrets, it also targeted AI tooling config, and there is a real possibility that shell-profile tampering becomes a way to poison what the next coding assistant reads into context.<p>I work on AgentSH (<a href="https://www.agentsh.org" rel="nofollow">https://www.agentsh.org</a>), and we wrote up a longer take on that angle here:<p><a href="https://www.canyonroad.ai/blog/the-install-was-the-attack/" rel="nofollow">https://www.canyonroad.ai/blog/the-install-was-the-attack/</a></p>
]]></description><pubDate>Thu, 23 Apr 2026 21:33:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=47882358</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=47882358</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47882358</guid></item><item><title><![CDATA[New comment by erans in "Cloudflare's AI Platform: an inference layer designed for agents"]]></title><description><![CDATA[
<p>It's great to see more such platform popping up. It's good for the ecosystem. We need more hosting options that are clear, secure and have the ability to help people run as many models as possible.</p>
]]></description><pubDate>Fri, 17 Apr 2026 20:29:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47810226</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=47810226</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47810226</guid></item><item><title><![CDATA[New comment by erans in "Leash: Spreadsheet Based PagerDuty Alternative"]]></title><description><![CDATA[
<p>that's great. PagerDuty always felt so expensive and heavy!</p>
]]></description><pubDate>Thu, 23 Oct 2025 20:21:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=45686561</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=45686561</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45686561</guid></item><item><title><![CDATA[New comment by erans in "Show HN: Gocat – URL shortener using Google Sheets as a database"]]></title><description><![CDATA[
<p>That's awesome. It's always annoying using those 3rd party ones!</p>
]]></description><pubDate>Wed, 22 Oct 2025 20:37:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=45674832</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=45674832</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45674832</guid></item><item><title><![CDATA[New comment by erans in "Show HN: LunaRoute – a high-performance local proxy for AI coding assistants"]]></title><description><![CDATA[
<p>Summary data is in a sqlite database so its easily queryable by you or your friendly AI agent.<p>You can have a single proxy and have multiple team members use the same one so that you can easily track sessions, token usage, spend etc.</p>
]]></description><pubDate>Tue, 21 Oct 2025 23:51:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=45663261</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=45663261</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45663261</guid></item><item><title><![CDATA[Show HN: LunaRoute – a high-performance local proxy for AI coding assistants]]></title><description><![CDATA[
<p>LunaRoute is a high-performance local proxy for AI coding assistants like Claude Code, OpenAI Codex CLI, and OpenCode. Get complete visibility into every LLM interaction with zero-overhead passthrough, comprehensive session recording, and powerful debugging capabilities.<p>- See Everything Your AI Does - get full logs (JSONL), summary of sessions including tokens used (input/output) as well as tools usage and success rates.<p>- Privacy & Compliance Built-In - redact or tokenize any sensitive information (regex based).<p>- Speaks OpenAI and Anthropic dialects so you can route (and translate) when needed between models and providers.<p>- High performance - passthrough is 0.1ms - 0.2ms latency. Logging and summary is off loaded to a secondary thread to allow maximum performance.<p>Feedback is always appreciated as well as stars on the repo :)</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45663045">https://news.ycombinator.com/item?id=45663045</a></p>
<p>Points: 8</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 21 Oct 2025 23:23:26 +0000</pubDate><link>https://github.com/erans/lunaroute</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=45663045</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45663045</guid></item><item><title><![CDATA[New comment by erans in ""Be Different" doesn't work for building products anymore"]]></title><description><![CDATA[
<p>true that there is a some kind of a ceiling of what can or can't be done. But that ceiling is way up there. Also, there are enough examples and articles and code that allows enough combination to be made so that its <i>good enough</i> - and that is a very important bar.<p>There are A LOT of businesses (even big ones managing money and what not) that rely on spreadsheets to do so much. Could this have been an app/service/SaaS/whatever ? probably.<p>What if these orgs can (mostly) internally solidify some of these processes? what if they don't need an insanely expensive salesforce implementor that can add "custom logic" ?<p>A lot of times companies will replace "complex software" with half complex process!<p>What if they don't need Salesforce at all because they need a reasonable simple CRM and don't want to (or shouldn't) pay $10k/seat/year ?<p>There are still going to be very differentiating apps and services here and there, but as time move on these "technological" advantages will erode and with AI they erode way faster.</p>
]]></description><pubDate>Mon, 06 Oct 2025 17:08:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=45493541</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=45493541</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45493541</guid></item><item><title><![CDATA[New comment by erans in "Show HN: Selfhostllm.org – Plan GPU capacity for self-hosting LLMs"]]></title><description><![CDATA[
<p>I also added a Mac version: <a href="https://selfhostllm.org/mac/" rel="nofollow">https://selfhostllm.org/mac/</a> so you can know which models you can run on your Mac and get an estimated tokens/sec.</p>
]]></description><pubDate>Fri, 08 Aug 2025 18:03:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=44839827</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=44839827</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44839827</guid></item><item><title><![CDATA[Show HN: Selfhostllm.org – Plan GPU capacity for self-hosting LLMs]]></title><description><![CDATA[
<p>A simple calculator that estimates how many concurrent requests your GPU can handle for a given LLM, with shareable results.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44839717">https://news.ycombinator.com/item?id=44839717</a></p>
<p>Points: 7</p>
<p># Comments: 3</p>
]]></description><pubDate>Fri, 08 Aug 2025 17:49:49 +0000</pubDate><link>https://selfhostllm.org/</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=44839717</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44839717</guid></item><item><title><![CDATA[New comment by erans in "MCPs, Gatekeepers, and the Future of AI"]]></title><description><![CDATA[
<p>I would argue that due to the way MCP servers/tools are added to calls, there will be a pre-step that will figure out which MCPs are even relevant for a request prior to executing it.</p>
]]></description><pubDate>Tue, 22 Apr 2025 18:20:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=43764868</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=43764868</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43764868</guid></item><item><title><![CDATA[New comment by erans in "Show HN: EFF Dice based digital random passphrase generator"]]></title><description><![CDATA[
<p>Thanks :-) Fixed it.
Don't drink and copy&paste ;-)</p>
]]></description><pubDate>Tue, 13 Sep 2016 20:08:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=12491767</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=12491767</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=12491767</guid></item><item><title><![CDATA[Show HN: EFF Dice based digital random passphrase generator]]></title><description><![CDATA[
<p>Article URL: <a href="https://dicepass.org">https://dicepass.org</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=12491179">https://news.ycombinator.com/item?id=12491179</a></p>
<p>Points: 3</p>
<p># Comments: 2</p>
]]></description><pubDate>Tue, 13 Sep 2016 18:55:12 +0000</pubDate><link>https://dicepass.org</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=12491179</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=12491179</guid></item><item><title><![CDATA[New comment by erans in "Best Linux laptop for web development?"]]></title><description><![CDATA[
<p>Most Thinkpads will work just fine out of the box. The hardware is great (I specifically like the keyboard and the little red point mouse).<p>The T series (and W series) are the work horses here with various screen sizes, disk options and RAM.<p>I personally use the X1 Carbon 3rd gen (X series) as it is comparable in specification to a Macbook Pro 13" (I have the one with the 16GB of RAM). It's a little costly but its worth while and will probably serve me for the next 2+ years.</p>
]]></description><pubDate>Sat, 02 Apr 2016 18:12:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=11412546</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=11412546</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=11412546</guid></item><item><title><![CDATA[Show HN: Go based Snowflake unique Id service. Now with Thrfit support.]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/erans/gosnowflake">https://github.com/erans/gosnowflake</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=7507027">https://news.ycombinator.com/item?id=7507027</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 01 Apr 2014 09:25:37 +0000</pubDate><link>https://github.com/erans/gosnowflake</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=7507027</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7507027</guid></item><item><title><![CDATA[Show HN: gosnowflake - Go implementation of Twitter's Snowflake]]></title><description><![CDATA[
<p>A go implementation of Twitter's Snowflake ID generation service.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=6958916">https://news.ycombinator.com/item?id=6958916</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 24 Dec 2013 10:12:24 +0000</pubDate><link>https://github.com/erans/gosnowflake</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=6958916</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=6958916</guid></item><item><title><![CDATA[Without your scouts you're sittings ducks]]></title><description><![CDATA[
<p>Article URL: <a href="http://avich.com/blog/2013/07/21/without-your-scouts-youre-sitting-ducks/">http://avich.com/blog/2013/07/21/without-your-scouts-youre-sitting-ducks/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=6082766">https://news.ycombinator.com/item?id=6082766</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 22 Jul 2013 09:48:52 +0000</pubDate><link>http://avich.com/blog/2013/07/21/without-your-scouts-youre-sitting-ducks/</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=6082766</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=6082766</guid></item><item><title><![CDATA[New comment by erans in "Roller Coaster Shmoller Coaster"]]></title><description><![CDATA[
<p>Very interesting point. Stop talking and start doing :-)</p>
]]></description><pubDate>Thu, 04 Jul 2013 10:59:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=5990060</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=5990060</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=5990060</guid></item><item><title><![CDATA[New comment by erans in "Show HN: SharingMeter.org - Find out how popular a link is"]]></title><description><![CDATA[
<p>I know this is too much, however we plan on adding the ability to inject sharing data directly to websites, that's why we asked for it in advance.<p>We would never do anything to miss use it and we felt that an optional permissions scenario was rather too complex.<p>How would you fill with an optional permissions scenario?</p>
]]></description><pubDate>Thu, 20 Jun 2013 20:01:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=5914531</link><dc:creator>erans</dc:creator><comments>https://news.ycombinator.com/item?id=5914531</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=5914531</guid></item></channel></rss>