<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: johalmed</title><link>https://news.ycombinator.com/user?id=johalmed</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 10:40:19 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=johalmed" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[LLM Budget Guard – open-source runtime cutoff for OpenAI/Anthropic]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.llmeter.org/validate/budget-guard">https://www.llmeter.org/validate/budget-guard</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47936677">https://news.ycombinator.com/item?id=47936677</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 28 Apr 2026 16:28:32 +0000</pubDate><link>https://www.llmeter.org/validate/budget-guard</link><dc:creator>johalmed</dc:creator><comments>https://news.ycombinator.com/item?id=47936677</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47936677</guid></item><item><title><![CDATA[Sow HN: LLMeter – Track per-customer LLM costs across OpenAI, Anthropic,and more]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.llmeter.org/">https://www.llmeter.org/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47653709">https://news.ycombinator.com/item?id=47653709</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 05 Apr 2026 20:48:01 +0000</pubDate><link>https://www.llmeter.org/</link><dc:creator>johalmed</dc:creator><comments>https://news.ycombinator.com/item?id=47653709</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47653709</guid></item><item><title><![CDATA[New comment by johalmed in "My API cost was at $13.19 when my persistent Claude named himself Thales"]]></title><description><![CDATA[
<p>I've been experimenting with routing requests between different models depending on the task complexity to keep costs down. It's surprising how much you can save just by defaulting to a smaller model for simple extraction tasks and reserving the heavy hitters only for complex reasoning. The tooling around this is definitely getting better, but tracking the actual spend per model in real-time is still a bit of a headache for side projects.</p>
]]></description><pubDate>Sun, 15 Mar 2026 00:49:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47383045</link><dc:creator>johalmed</dc:creator><comments>https://news.ycombinator.com/item?id=47383045</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47383045</guid></item></channel></rss>