Hacker News: rapatel0

New comment by rapatel0 in "Running local models on an M4 with 24GB memory"

rapatel0 — Sun, 17 May 2026 11:57:29 +0000

I'm super happy with the performance, I generally run with 2 parallel slots so I only get about 128K context window. My experience with all llms is that they get more forgetful if you use the full window. (256-512K is the sweet spot for frontier models, 128k works for me with this current qwen)

New comment by rapatel0 in "Running local models on an M4 with 24GB memory"

rapatel0 — Sun, 17 May 2026 11:55:09 +0000

I forked it to also add rotorquant. This is a specific optimization that uses clifford rotors instead of static compile time random purmutation to store the activations. Reduces space and parameter count for the storage.

New comment by rapatel0 in "Running local models on an M4 with 24GB memory"

rapatel0 — Mon, 11 May 2026 01:48:52 +0000

I got qwen3.6:27B running on my 4090 (24GB) with ~128K context leveraging some of the recent turboquant/rotorquant memory optimizations for activations. Highly suggest going up to that. the q4_xl+rotorquant combo is pretty good.

Some reference code if you want to throw your agent at it. https://github.com/rapatel0/rq-models

New comment by rapatel0 in "Google Chrome silently installs a 4 GB AI model on your device without consent"

rapatel0 — Wed, 06 May 2026 12:21:29 +0000

Do you not have compression / deduplication on your nfs backing server ?

New comment by rapatel0 in "Musk Settles SEC Suit for Underpaying Twitter Investors by $150M for Just $1.5M"

rapatel0 — Tue, 05 May 2026 04:01:12 +0000

Not to defend Elon, but he spent 40B on a company that dropped in value between the day of his bid and the day of closing by almost half (if i remember correctly). He tried to back out of it but then went forward by twitter at the time sued to enforce the transaction.

150M is small in comparison to the 20B he overpaid by. Investors were well compensated.

New comment by rapatel0 in "Ask HN: Best Embedding Models?"

rapatel0 — Tue, 05 May 2026 03:57:14 +0000

I've liked qwen and embeddinggemma for local search. Qwen because 32K is enough to basically fit a whole page into the context window and embeddiggemma because it's crazy efficient.

New comment by rapatel0 in "Let's Buy Spirit Air"

rapatel0 — Mon, 04 May 2026 16:31:48 +0000

When I lived in europe, RyanAir made most of it's money in the terminal. This is why every RyanAir terminal has a maze like exit from security (ikea-esque) before you get to the actual terminal.

The RyanAir CEO was even quoted that he expected some tickets to be come "zero-fare" Link: https://www.theguardian.com/business/2016/nov/22/ryanair-fli...

The point stands, airlines don't make money on flights. Flights are loss leaders.

New comment by rapatel0 in "Let's Buy Spirit Air"

rapatel0 — Mon, 04 May 2026 15:31:37 +0000

I'm definitely not this type of person. see other comment

New comment by rapatel0 in "Let's Buy Spirit Air"

rapatel0 — Mon, 04 May 2026 15:28:57 +0000

Fair point and I don't disagree. The more meta point I would make is that airlines are still fairly capex heavy (even if there is no point-to-point infrastructure). Each incremental new route operating during standard hours still requires 90m+ on a new airplane.

So if they tend to compete themselves into oblivion, or need to turn into banks to subsidize their product, then it might make sense that they should be regulated monopolies.

Still you're probably right, if they can turn into banks and stay profitable, then maybe that's a better market outcome overall.

New comment by rapatel0 in "Let's Buy Spirit Air"

rapatel0 — Mon, 04 May 2026 15:00:33 +0000

Power companies are the classic example. If power companies were forced to compete, their costs + competition tend to drive them out of business. As a result most power companies are forced to operate in really tight constraints with very limited but predictable margin.

I'm not saying that this a better outcome (power companies have their problems too). I was just commenting that this issue parallels the historical solution that was applied to utility companies.

New comment by rapatel0 in "Let's Buy Spirit Air"

rapatel0 — Mon, 04 May 2026 01:15:22 +0000

Fundamental problem: Flights don't make money. Airlines actually make all of their money through loyalty programs and credit card payments. They basically should have turned into regulated utilities long ago, but loyalty program revenue saved them.

Unless this initiative will turn into a credit card company (which nobody likes or wants to do) it won't go anywhere

Private equity will likely sell the company for parts. There is no operational improvements for cash flow that they can do.

Useful watch (skip to 2:20): https://youtu.be/ggUduBmvQ_4?si=cyysP7aH_CIEDZRq

New comment by rapatel0 in "Laws of Software Engineering"

rapatel0 — Tue, 21 Apr 2026 15:22:42 +0000

The list is great but the explanation are clearly AI slop.

"Before SpaceX, launching rockets was costly because industry practice used expensive materials and discarded rockets after one use. Elon Musk applied first-principles thinking: What is a rocket made of? Mainly aluminum, titanium, copper, and carbon fiber. Raw material costs were a fraction of finished rocket prices. From that insight, SpaceX decided to build rockets from scratch and make them reusable."

Everything including humans are made of cheap materials but that doesn't convey the value. The AI got close to the answer with it's first sentence (re-usability) but it clearly missed the mark.

New comment by rapatel0 in "EFF is leaving X"

rapatel0 — Sun, 12 Apr 2026 03:54:13 +0000

This is the fallacy. These organizations no longer have any ability to “legitimize” as trust is fundamentally eroded. Leaving will simply remove any engagement with the very people they want to influence- people that are unengaged and people that actively disagree

New comment by rapatel0 in "EFF is leaving X"

rapatel0 — Thu, 09 Apr 2026 21:50:00 +0000

Credibility with who? We’re so polarized that a single binary label will shift all credibility.

Experience, success, credentials none of it matters anymore. The left thinks everything on the right is stupid and evil, the right does the same, and everyone drinks their own kool aid.

We’ve all stopped listening.

New comment by rapatel0 in "Coordination patterns for multi-model AI systems"

rapatel0 — Wed, 01 Apr 2026 20:31:01 +0000

This is Part 2 of a series on agentic systems — This was especially weird given how eosteric it can be to describe how to work better with agents, but this is my shot at it.

The article walks through the coordination patterns to address types of error:

- Single-writer: one agent writes, others review read-only. Eliminates oscillation. Maps to the single-writer principle from concurrent systems.

- Sequential planning: parallel planners cluster even across different models. Sequential divergence acts as a covering algorithm — 3 sequential planners explore more than 5 parallel.

- Sequential vs parallel review: parallel voting catches common issues (mean quality). Sequential review compounds scrutiny but risks scope creep. Both are useful.

- Human interview gating: open-ended questions yield ~5x more useful context than closed ones. "REST or GraphQL?" vs "What should we know about how this API will be consumed?"

- Adversarial validation: separate environment, separate agent, explicit goal of breaking the application. Tests the spec, not the implementation.

Hope it's useful reading

Coordination patterns for multi-model AI systems

rapatel0 — Wed, 01 Apr 2026 20:31:01 +0000

Article URL: https://datda.substack.com/p/towards-reliable-agentic-systems

Comments URL: https://news.ycombinator.com/item?id=47606164

Points: 1

# Comments: 1

New comment by rapatel0 in "Arm AGI CPU"

rapatel0 — Tue, 24 Mar 2026 22:39:50 +0000

RISC-V will start making more waves now

New comment by rapatel0 in "Show HN: How I Topped the HuggingFace Open LLM Leaderboard on Two Gaming GPUs"

rapatel0 — Tue, 10 Mar 2026 21:47:15 +0000

Thanks. This is cool

New comment by rapatel0 in "Show HN: How I topped the HuggingFace open LLM leaderboard on two gaming GPUs"

rapatel0 — Tue, 10 Mar 2026 19:29:09 +0000

Clarification. Duplicating multiple groups of layers in a "reasoning" loop

Normal:

  L1 -> L2 -> L3 -> L4 -> out

Unrolled (current framing):

  L1 -> [L2->L3] -> [L2->L3] -> L4 -> out

Looped (proposed):

       --<--loop----
       |           |

  L1 -> [L2->L3] x N --> L4 -> out

"reasoning loop"

Note: ascii rendering HN is not trivial

New comment by rapatel0 in "Show HN: How I topped the HuggingFace open LLM leaderboard on two gaming GPUs"

rapatel0 — Tue, 10 Mar 2026 15:04:46 +0000

I think you may have cracked latent space reasoning. I've had a hunch that something like this would work, but couldn't figure out how the training would back propagate. But you've shown that you just need to duplicate existing layers.

Have you tried a simple inline loop over the duplicated layers? Would be interesting to see performance. Also, would be interesting to compare with a MOE model. See if these layers are acting like different agreeing "experts" or if there is reasoning happening in the latent space.