<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: 44za12</title><link>https://news.ycombinator.com/user?id=44za12</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 07 Apr 2026 07:19:08 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=44za12" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by 44za12 in "Qwen3.5 Fine-Tuning Guide"]]></title><description><![CDATA[
<p>Specialised models easily beat SOTA, case in point: <a href="https://nehmeailabs.com/flashcheck" rel="nofollow">https://nehmeailabs.com/flashcheck</a></p>
]]></description><pubDate>Wed, 04 Mar 2026 16:47:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47250200</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=47250200</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47250200</guid></item><item><title><![CDATA[New comment by 44za12 in "Number Research Inc"]]></title><description><![CDATA[
<p>All of us use the same keyboards more or less, maybe us randomly typing a large number is not as random as we would like to think. Just like how “asdf”, “xcyb” are common strings because these keys are together, there has to be some pattern here as well.</p>
]]></description><pubDate>Wed, 04 Mar 2026 05:14:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=47243376</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=47243376</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47243376</guid></item><item><title><![CDATA[In Defense of Boring Technology]]></title><description><![CDATA[
<p>Article URL: <a href="https://aazar.me/posts/in-defense-of-boring-technology">https://aazar.me/posts/in-defense-of-boring-technology</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47044412">https://news.ycombinator.com/item?id=47044412</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 17 Feb 2026 06:43:27 +0000</pubDate><link>https://aazar.me/posts/in-defense-of-boring-technology</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=47044412</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47044412</guid></item><item><title><![CDATA[Show HN: RightSize CLI, Find the cheapest LLM that works for your prompt]]></title><description><![CDATA[
<p>RightSize runs your prompt against candidate models (Kimi, GLM, Qwen, Gemma etc.) in parallel via OpenRouter. Then it uses a stronger model as a Judge to score accuracy against a baseline model.<p>Happy to answer questions.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46793694">https://news.ycombinator.com/item?id=46793694</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 28 Jan 2026 10:50:41 +0000</pubDate><link>https://github.com/NehmeAILabs/rightsize-cli</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=46793694</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46793694</guid></item><item><title><![CDATA[New comment by 44za12 in "Tell HN: I cut Claude API costs from $70/month to pennies"]]></title><description><![CDATA[
<p>Yes, I included a 'Model Selection Cheat Sheet' in the README (scroll down a bit).<p>I map them by task type:<p>Tiny (<3B): Gemma 3 1B (could try 4B as well), Phi-4-mini (Good for classification).
Small (8B-17B): Qwen 3 8B, Llama 4 Scout (Good for RAG/Extraction).
Frontier: GPT-5, Llama 4 Maverick, GLM, Kimi<p>Is that what you meant?</p>
]]></description><pubDate>Mon, 26 Jan 2026 13:24:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=46765335</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=46765335</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46765335</guid></item><item><title><![CDATA[Show HN: LLM Sanity Checks – A practical guide to not over-engineering AI]]></title><description><![CDATA[
<p>I keep seeing teams use frontier models for tasks a regex or a 4B model could do cheaper and faster.<p>This repo is a collection of opinionated patterns and heuristics to help you rethink the architecture of your AI workflows and ease the decision-making process while ensuring maximum efficiency.<p>It covers:<p>- A decision tree for architectural sanity checks.
- Tradeoffs between JSON and delimiter-separated output.
- Patterns for cascading models (verifying small models before calling big ones).<p>Open to feedback on other anti-patterns you've seen in production.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46765011">https://news.ycombinator.com/item?id=46765011</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 26 Jan 2026 12:49:12 +0000</pubDate><link>https://github.com/NehmeAILabs/llm-sanity-checks</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=46765011</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46765011</guid></item><item><title><![CDATA[New comment by 44za12 in "Tell HN: I cut Claude API costs from $70/month to pennies"]]></title><description><![CDATA[
<p>This is the way. I actually mapped out the decision tree for this exact process and more here:<p><a href="https://github.com/NehmeAILabs/llm-sanity-checks" rel="nofollow">https://github.com/NehmeAILabs/llm-sanity-checks</a></p>
]]></description><pubDate>Mon, 26 Jan 2026 07:03:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=46762668</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=46762668</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46762668</guid></item><item><title><![CDATA[New comment by 44za12 in "Stop using JSON for LLM structured output"]]></title><description><![CDATA[
<p>For simple extraction tasks, a delimiter-separated string uses 11 tokens vs 35 for JSON. Output tokens are the latency bottleneck.</p>
]]></description><pubDate>Sat, 24 Jan 2026 07:46:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=46741820</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=46741820</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46741820</guid></item><item><title><![CDATA[Stop using JSON for LLM structured output]]></title><description><![CDATA[
<p>Article URL: <a href="https://nehmeailabs.com/post/structured-output-overhead">https://nehmeailabs.com/post/structured-output-overhead</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46741819">https://news.ycombinator.com/item?id=46741819</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Sat, 24 Jan 2026 07:46:52 +0000</pubDate><link>https://nehmeailabs.com/post/structured-output-overhead</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=46741819</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46741819</guid></item><item><title><![CDATA[FlashCheck-270M: Open weights for fact verification (Apache 2.0, WASM Demo)]]></title><description><![CDATA[
<p>Article URL: <a href="https://huggingface.co/nehmeailabs-org/nehme-flashcheck-270m">https://huggingface.co/nehmeailabs-org/nehme-flashcheck-270m</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46278797">https://news.ycombinator.com/item?id=46278797</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 15 Dec 2025 19:02:08 +0000</pubDate><link>https://huggingface.co/nehmeailabs-org/nehme-flashcheck-270m</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=46278797</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46278797</guid></item><item><title><![CDATA[New comment by 44za12 in "De-Googling TOTP Authenticator Codes"]]></title><description><![CDATA[
<p>Love the minimalism.</p>
]]></description><pubDate>Mon, 01 Sep 2025 12:35:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=45092249</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=45092249</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45092249</guid></item><item><title><![CDATA[New comment by 44za12 in "De-Googling TOTP Authenticator Codes"]]></title><description><![CDATA[
<p>Shameless plug.<p>I’ve been using a cli tool i had created for over 2 years now, it just works. I had more ideas but never got to incorporate those.<p><a href="https://github.com/44za12/horcrux" rel="nofollow">https://github.com/44za12/horcrux</a></p>
]]></description><pubDate>Mon, 01 Sep 2025 10:21:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=45091377</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=45091377</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45091377</guid></item><item><title><![CDATA[New comment by 44za12 in "[dead]"]]></title><description><![CDATA[
<p>Have been using remove.bg for this for years now.</p>
]]></description><pubDate>Sat, 30 Aug 2025 07:16:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=45072582</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=45072582</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45072582</guid></item><item><title><![CDATA[New comment by 44za12 in "Show HN: Rust macro utility for batching expensive async operations"]]></title><description><![CDATA[
<p>Like a sempahore?</p>
]]></description><pubDate>Sun, 17 Aug 2025 15:03:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=44932116</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=44932116</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44932116</guid></item><item><title><![CDATA[New comment by 44za12 in "Gemma 3 270M: Compact model for hyper-efficient AI"]]></title><description><![CDATA[
<p>I’ve had great luck with all gemma 3 variants, on certain tasks it the 27B quantized version has worked as well as 2.5 flash. Can’t wait to get my hands dirty with this one.</p>
]]></description><pubDate>Thu, 14 Aug 2025 16:55:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=44902789</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=44902789</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44902789</guid></item><item><title><![CDATA[New comment by 44za12 in "Benchmarking GPT-5 on 400 real-world code reviews"]]></title><description><![CDATA[
<p>Can you benchmark Kimi K2 and GLM 4.5 as well? Would be interesting to see where they land.</p>
]]></description><pubDate>Fri, 08 Aug 2025 06:20:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=44834069</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=44834069</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44834069</guid></item><item><title><![CDATA[New comment by 44za12 in "Vibechart"]]></title><description><![CDATA[
<p>That was quick, vibe coded, I presume?</p>
]]></description><pubDate>Thu, 07 Aug 2025 22:07:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=44830972</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=44830972</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44830972</guid></item><item><title><![CDATA[New comment by 44za12 in "C10kday"]]></title><description><![CDATA[
<p>The ability to submit a story using a curl would have been fun.</p>
]]></description><pubDate>Tue, 05 Aug 2025 12:54:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=44797421</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=44797421</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44797421</guid></item><item><title><![CDATA[New comment by 44za12 in "My bytecode optimizer beats Copilot by 2x"]]></title><description><![CDATA[
<p>Tried that it’s taking exactly as much time as my program.</p>
]]></description><pubDate>Sun, 03 Aug 2025 20:45:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=44779620</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=44779620</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44779620</guid></item><item><title><![CDATA[New comment by 44za12 in "My bytecode optimizer beats Copilot by 2x"]]></title><description><![CDATA[
<p>I have breach parser that i had written to parse through over 3 billion rows of compressed data (by parsing i simply mean searching for a particular substring), I’ve tried multiple LLMs to make it faster (currently it does so in <45 seconds on an M3 pro mac) none have been able to do that yet.<p><a href="https://github.com/44za12/breach-parse-rs">https://github.com/44za12/breach-parse-rs</a><p>Feel free to drop ideas if any.</p>
]]></description><pubDate>Sun, 03 Aug 2025 17:26:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=44778145</link><dc:creator>44za12</dc:creator><comments>https://news.ycombinator.com/item?id=44778145</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44778145</guid></item></channel></rss>