<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: rapatel0</title><link>https://news.ycombinator.com/user?id=rapatel0</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 08:25:59 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=rapatel0" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by rapatel0 in "Running local models on an M4 with 24GB memory"]]></title><description><![CDATA[
<p>I'm super happy with the performance, I generally run with 2 parallel slots so I only get about 128K context window. My experience with all llms is that they get more forgetful if you use the full window. (256-512K is the sweet spot for frontier models, 128k works for me with this current qwen)</p>
]]></description><pubDate>Sun, 17 May 2026 11:57:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=48168116</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48168116</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48168116</guid></item><item><title><![CDATA[New comment by rapatel0 in "Running local models on an M4 with 24GB memory"]]></title><description><![CDATA[
<p>I forked it to also add rotorquant. This is a specific optimization that uses clifford rotors instead of static compile time random purmutation to store the activations. Reduces space and parameter count for the storage.</p>
]]></description><pubDate>Sun, 17 May 2026 11:55:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48168091</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48168091</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48168091</guid></item><item><title><![CDATA[New comment by rapatel0 in "Running local models on an M4 with 24GB memory"]]></title><description><![CDATA[
<p>I got qwen3.6:27B running on my 4090 (24GB) with ~128K context leveraging some of the recent turboquant/rotorquant memory optimizations for activations. Highly suggest going up to that. the q4_xl+rotorquant combo is pretty good.<p>Some reference code if you want to throw your agent at it. 
<a href="https://github.com/rapatel0/rq-models" rel="nofollow">https://github.com/rapatel0/rq-models</a></p>
]]></description><pubDate>Mon, 11 May 2026 01:48:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48090176</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48090176</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48090176</guid></item><item><title><![CDATA[New comment by rapatel0 in "Google Chrome silently installs a 4 GB AI model on your device without consent"]]></title><description><![CDATA[
<p>Do you not have compression / deduplication on your nfs backing server ?</p>
]]></description><pubDate>Wed, 06 May 2026 12:21:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=48035382</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48035382</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48035382</guid></item><item><title><![CDATA[New comment by rapatel0 in "Musk Settles SEC Suit for Underpaying Twitter Investors by $150M for Just $1.5M"]]></title><description><![CDATA[
<p>Not to defend Elon, but he spent 40B on a company that dropped in value between the day of his bid and the day of closing by almost half (if i remember correctly). He tried to back out of it but then went forward by twitter at the time sued to enforce the transaction.<p>150M is small in comparison to the 20B he overpaid by. Investors were well compensated.</p>
]]></description><pubDate>Tue, 05 May 2026 04:01:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48017900</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48017900</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48017900</guid></item><item><title><![CDATA[New comment by rapatel0 in "Ask HN: Best Embedding Models?"]]></title><description><![CDATA[
<p>I've liked qwen and embeddinggemma for local search. Qwen because 32K is enough to basically fit a whole page into the context window and embeddiggemma because it's crazy efficient.</p>
]]></description><pubDate>Tue, 05 May 2026 03:57:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48017877</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48017877</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48017877</guid></item><item><title><![CDATA[New comment by rapatel0 in "Let's Buy Spirit Air"]]></title><description><![CDATA[
<p>When I lived in europe, RyanAir made most of it's money in the terminal. This is why every RyanAir terminal has a maze like exit from security (ikea-esque) before you get to the actual terminal.<p>The RyanAir CEO was even quoted that he expected some tickets to be come "zero-fare" 
Link: <a href="https://www.theguardian.com/business/2016/nov/22/ryanair-flights-free-michael-oleary-airports" rel="nofollow">https://www.theguardian.com/business/2016/nov/22/ryanair-fli...</a><p>The point stands, airlines don't make money on flights. Flights are loss leaders.</p>
]]></description><pubDate>Mon, 04 May 2026 16:31:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48010964</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48010964</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48010964</guid></item><item><title><![CDATA[New comment by rapatel0 in "Let's Buy Spirit Air"]]></title><description><![CDATA[
<p>I'm definitely not this type of person. see other comment</p>
]]></description><pubDate>Mon, 04 May 2026 15:31:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48009963</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48009963</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48009963</guid></item><item><title><![CDATA[New comment by rapatel0 in "Let's Buy Spirit Air"]]></title><description><![CDATA[
<p>Fair point and I don't disagree. The more meta point I would make is that airlines are still fairly capex heavy (even if there is no point-to-point infrastructure). Each incremental new route operating during standard hours still requires 90m+ on a new airplane.<p>So if they tend to compete themselves into oblivion, or need to turn into banks to subsidize their product, then it might make sense that they should be regulated monopolies.<p>Still you're probably right, if they can turn into banks and stay profitable, then maybe that's a better market outcome overall.</p>
]]></description><pubDate>Mon, 04 May 2026 15:28:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48009919</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48009919</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48009919</guid></item><item><title><![CDATA[New comment by rapatel0 in "Let's Buy Spirit Air"]]></title><description><![CDATA[
<p>Power companies are the classic example. If power companies were forced to compete, their costs + competition tend to drive them out of business. As a result most power companies are forced to operate in really tight constraints  with very limited but predictable margin.<p>I'm not saying that this a better outcome (power companies have their problems too). I was just commenting that this issue parallels the historical solution that was applied to utility companies.</p>
]]></description><pubDate>Mon, 04 May 2026 15:00:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48009583</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48009583</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48009583</guid></item><item><title><![CDATA[New comment by rapatel0 in "Let's Buy Spirit Air"]]></title><description><![CDATA[
<p>Fundamental problem: Flights don't make money. Airlines actually make all of their money through loyalty programs and credit card payments. They basically should have turned into regulated utilities long ago, but loyalty program revenue saved them.<p>Unless this initiative will turn into a credit card company (which nobody likes or wants to do) it won't go anywhere<p>Private equity will likely sell the company for parts. There is no operational improvements for cash flow that they can do.<p>Useful watch (skip to 2:20): <a href="https://youtu.be/ggUduBmvQ_4?si=cyysP7aH_CIEDZRq" rel="nofollow">https://youtu.be/ggUduBmvQ_4?si=cyysP7aH_CIEDZRq</a></p>
]]></description><pubDate>Mon, 04 May 2026 01:15:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=48003498</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=48003498</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48003498</guid></item><item><title><![CDATA[New comment by rapatel0 in "Laws of Software Engineering"]]></title><description><![CDATA[
<p>The list is great but the explanation are clearly AI slop.<p>"Before SpaceX, launching rockets was costly because industry practice used expensive materials and discarded rockets after one use. Elon Musk applied first-principles thinking: What is a rocket made of? Mainly aluminum, titanium, copper, and carbon fiber. Raw material costs were a fraction of finished rocket prices. From that insight, SpaceX decided to build rockets from scratch and make them reusable."<p>Everything including humans are made of cheap materials but that doesn't convey the value. The AI got close to the answer with it's first sentence (re-usability) but it clearly missed the mark.</p>
]]></description><pubDate>Tue, 21 Apr 2026 15:22:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=47850103</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47850103</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47850103</guid></item><item><title><![CDATA[New comment by rapatel0 in "EFF is leaving X"]]></title><description><![CDATA[
<p>This is the fallacy. These organizations no longer have any ability to “legitimize” as trust is fundamentally eroded. Leaving will simply remove any engagement with the very people they want to influence- people that are unengaged and people that actively disagree</p>
]]></description><pubDate>Sun, 12 Apr 2026 03:54:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=47736011</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47736011</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47736011</guid></item><item><title><![CDATA[New comment by rapatel0 in "EFF is leaving X"]]></title><description><![CDATA[
<p>Credibility with who?  We’re so polarized that a single binary label  will shift all credibility.<p>Experience, success, credentials none of it matters anymore. The left thinks everything on the right is stupid and evil, the right does the same, and everyone drinks their own kool aid.<p>We’ve all stopped listening.</p>
]]></description><pubDate>Thu, 09 Apr 2026 21:50:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47710648</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47710648</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47710648</guid></item><item><title><![CDATA[New comment by rapatel0 in "Coordination patterns for multi-model AI systems"]]></title><description><![CDATA[
<p>This is Part 2 of a series on agentic systems — This was especially weird given how eosteric it can be to describe how to work better with agents, but this is my shot at it.<p>The article walks through the coordination patterns to address types of error:<p>- Single-writer: one agent writes, others review read-only. Eliminates oscillation. Maps to
   the single-writer principle from concurrent systems.<p>- Sequential planning: parallel planners cluster even across different models. Sequential
  divergence acts as a covering algorithm — 3 sequential planners explore more than 5
  parallel.<p>- Sequential vs parallel review: parallel voting catches common issues (mean quality).
  Sequential review compounds scrutiny but risks scope creep. Both are useful.<p>- Human interview gating: open-ended questions yield ~5x more useful context than closed
  ones. "REST or GraphQL?" vs "What should we know about how this API will be consumed?"<p>- Adversarial validation: separate environment, separate agent, explicit goal of breaking
  the application. Tests the spec, not the implementation.<p>Hope it's useful reading</p>
]]></description><pubDate>Wed, 01 Apr 2026 20:31:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47606165</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47606165</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47606165</guid></item><item><title><![CDATA[Coordination patterns for multi-model AI systems]]></title><description><![CDATA[
<p>Article URL: <a href="https://datda.substack.com/p/towards-reliable-agentic-systems">https://datda.substack.com/p/towards-reliable-agentic-systems</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47606164">https://news.ycombinator.com/item?id=47606164</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 01 Apr 2026 20:31:01 +0000</pubDate><link>https://datda.substack.com/p/towards-reliable-agentic-systems</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47606164</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47606164</guid></item><item><title><![CDATA[New comment by rapatel0 in "Arm AGI CPU"]]></title><description><![CDATA[
<p>RISC-V will start making more waves now</p>
]]></description><pubDate>Tue, 24 Mar 2026 22:39:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47510545</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47510545</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47510545</guid></item><item><title><![CDATA[New comment by rapatel0 in "Show HN: How I Topped the HuggingFace Open LLM Leaderboard on Two Gaming GPUs"]]></title><description><![CDATA[
<p>Thanks. This is cool</p>
]]></description><pubDate>Tue, 10 Mar 2026 21:47:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47329182</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47329182</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47329182</guid></item><item><title><![CDATA[New comment by rapatel0 in "Show HN: How I topped the HuggingFace open LLM leaderboard on two gaming GPUs"]]></title><description><![CDATA[
<p>Clarification. Duplicating multiple groups of layers in a "reasoning" loop<p>Normal:<p><pre><code>  L1 -> L2 -> L3 -> L4 -> out
</code></pre>
Unrolled (current framing):<p><pre><code>  L1 -> [L2->L3] -> [L2->L3] -> L4 -> out
</code></pre>
Looped (proposed):<p><pre><code>       --<--loop----
       |           |

  L1 -> [L2->L3] x N --> L4 -> out
</code></pre>
"reasoning loop"<p>Note: ascii rendering HN is not trivial</p>
]]></description><pubDate>Tue, 10 Mar 2026 19:29:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47327766</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47327766</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47327766</guid></item><item><title><![CDATA[New comment by rapatel0 in "Show HN: How I topped the HuggingFace open LLM leaderboard on two gaming GPUs"]]></title><description><![CDATA[
<p>I think you may have cracked latent space reasoning. I've had a hunch that something like this would work, but couldn't figure out how the training would back propagate. But you've shown that you just need to duplicate existing layers.<p>Have you tried a simple inline loop over the duplicated layers? Would be interesting to see performance. Also, would be interesting to compare with a MOE model. See if these layers are acting like different agreeing "experts" or if there is reasoning happening in the latent space.</p>
]]></description><pubDate>Tue, 10 Mar 2026 15:04:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=47324257</link><dc:creator>rapatel0</dc:creator><comments>https://news.ycombinator.com/item?id=47324257</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47324257</guid></item></channel></rss>