Hacker News: amitport

New comment by amitport in "Israeli firm BlackCore suspected of meddling in New York and Scotland votes"

amitport — Sun, 14 Jun 2026 07:48:15 +0000

"parents and grandparents,"

You don't have to go into historical events. This is still happening now.

Jews are still fighting for their survival and the moment Israel stops fighting, millions of Jews will die.

New comment by amitport in "Upcoming breaking changes for npm v12"

amitport — Wed, 10 Jun 2026 02:56:59 +0000

To be fair, NPM sucked long before it got acquired by Github/Microsoft.

And to be fair 2: The other package repos also suck.

New comment by amitport in "Show HN: A 4-year-old "TurboQuant" implementation"

amitport — Fri, 15 May 2026 08:56:45 +0000

Hi, thanks! I appreciate your input and generally agree. The TDS article wasn't really aimed at the HN crowd, but it did help a bit with the more general audience.

I do plan to also develop an interactive guide that breaks down post-rotation quantization fundamentals in a more educational, hands-on way.

New comment by amitport in "Show HN: A 4-year-old "TurboQuant" implementation"

amitport — Sun, 03 May 2026 16:20:57 +0000

For context: https://towardsdatascience.com/how-a-2021-quantization-algor...

Show HN: A 4-year-old "TurboQuant" implementation

amitport — Sun, 03 May 2026 15:58:49 +0000

Article URL: https://github.com/amitport/EDEN-Distributed-Mean-Estimation

Comments URL: https://news.ycombinator.com/item?id=47998261

Points: 3

# Comments: 3

A Note on TurboQuant and the Earlier Eden Work

amitport — Sat, 02 May 2026 18:53:06 +0000

Article URL: https://arxiv.org/abs/2604.18555

Comments URL: https://news.ycombinator.com/item?id=47989262

Points: 2

# Comments: 0

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Sat, 02 May 2026 15:31:31 +0000

I recently wrote a beginner-friendly explanation of this situation in TDS:

https://towardsdatascience.com/how-a-2021-quantization-algor...

New comment by amitport in "[dead]"

amitport — Sat, 02 May 2026 13:39:46 +0000

Hi, I’m the author of the post above.

tl;dr

TurboQuant is a recent paper from Google and NYU that has gained massive traction in mainstream media and the AI community. As implementations of TurboQuant are integrated into various popular projects, it is important to note its relation to EDEN quantization.

TurboQuant is essentially a partial implementation of EDEN quantization (first work published in NeurIPS 2021, extention published on ICML 2022). The few differences that do exist make EDEN significantly better.

We have also published a detailed comparative report here: https://arxiv.org/abs/2604.18555

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 15:22:20 +0000

In the vLLM documentation quoted above, TurboQuant (which is a restricted version of EDEN) is referred to as a specific case of HIGGS. Note the symmetry: EDEN acts as a special case of HIGGS; hence, HIGGS functions as a generalization of EDEN.

In any case, the quantizer is indeed an extension, regardless of whether it was explicitly framed that way in the paper. I say this not to diminish their contribution at all, but just to clarify the relationship, as it was also stated in the vLLM doc.

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 15:13:56 +0000

Thanks for the pushback, and I appreciate the reference to classical information theory.

While I probably overstated things by using the very general phrase "taking advantage," I want to be very precise about the claim, as I believe these works are foundational to quantization, beyond the scope of deep learning. The mechanism of applying a deterministic biased quantizer, such as Lloyd-Max, to the induced post-rotation distribution, alongside mathematically correcting its inherent bias, is a distinct contribution (which asymptotically improves the worst-case error).

If there is a classical paper that utilizes such a combination, I would genuinely be very eager to review it. But to my knowledge, this was not introduced prior to DRIVE and EDEN.

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 14:07:14 +0000

Thanks for that!

It is worth noting that taking advantage of the post-rotation distribution was not actually done until DRIVE (2021), which was made possible via our proper scaling. Furthermore, applying a Lloyd-Max codebook post-rotation was introduced EDEN.

We consider these to be the foundational works in this regard.

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 13:27:14 +0000

Thanks for that! Note that the residual chain is empirically and theoretically inferior to our unbiased scale; furthermore, it requires an additional bit in certain cases. Additionally, TurboQuant was not the first to apply EDEN to KV-cache (see for example https://arxiv.org/abs/2411.17525 from 2024).

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 12:46:59 +0000

Those works did cite DRIVE/EDEN :)

HIGGS is an extension of EDEN (using the well known method for blockwise Lloyd-Max).

The proper framing of this "TurboQuant" layer in vllm (which does not include JQL) is precisely EDEN 22 without the scale correction.

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 11:35:59 +0000

I believe our claim at this point is more fundamental than just lack of citation.

The quantizer in TurboQuant is EDEN quantization (2021) applied to the KV-cache. It is neither a novel quantizer nor an improvement in quantization techniques.

In DRIVE/EDEN, we already introduced the version used in "TurboQuant"'s paper and suggested an optimal scale configurations which are better in both mse-minimizing and unbiased scenarios.

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 09:14:41 +0000

Thanks for the quick response and for being willing to update the explainer. I really appreciate the clarification.

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 08:24:04 +0000

When you use TurboQuant, you are essentially using the EDEN quantizer under a different name applied to KV-cache.

Both EDEN and its 1-bit variant have been implemented in PyTorch, JAX, and TensorFlow across numerous open-source libraries and are used in various applications. I am currently writing a blog post that will document these in detail.

EDEN defines a scale parameter, S, for which we suggest specific optimal values for both biased and unbiased versions. As shown in the note I shared, these values lead to clear empirical improvements. Consequently, users who rely on the less optimal S value and the unbiasing method popularized by TurboQuant will generally see inferior results compared to those using EDEN with the optimal scale values suggested in our original papers.

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 05:26:54 +0000

The note includes extensive experiments and reproduces many of the figures from the TurboQuant paper in our Section 5. Honestly, I think our case is pretty clear-cut as is. I am not sure what the overhead for those specific benchmarks would be, but we will look into it.

(In any case, I want to emphasize that TurboQuant quantizer is a private case of EDEN)

New comment by amitport in "TurboQuant: A first-principles walkthrough"

amitport — Mon, 27 Apr 2026 04:01:24 +0000

TurboQuant is a restricted version of EDEN quantization (NeurIPS 21, ICML 22). It lacks the optimal scale derivations, which makes the TurboQuant variant considerably less accurate than those works. We show this thoroughly in a new note at https://arxiv.org/abs/2604.18555.

We were the first to introduce post-rotation distribution-aware quantization in 2021. This was later implemented in many fields, including federated learning, vector retrieval, databases, inference engines, and KV-cache.

It would be appropriate to receive credit for this. Furthermore, it is baffling to see the name "TurboQuant" repeated in this context, considering the many works published from 2021 onwards.

The blog post mentioned above essentially guides you through EDEN quantization but ultimately settles on a sub-optimal MSE-minimizing version and an unbiasing trick. This trick often costs a full bit more than DRIVE/EDEN requires to achieve the same results using the unbiasing scale shown in the original 2021 paper.

New comment by amitport in "America Has Lost the Arab World"

amitport — Thu, 09 Apr 2026 10:39:55 +0000

They must mean 'The Arab World Has Lost America,' don't they?

New comment by amitport in "TurboQuant: Redefining AI efficiency with extreme compression"

amitport — Thu, 26 Mar 2026 11:42:19 +0000

We do mention and the paper you shared. Please read our paper to see how the rotation-aware bias correction we introduced efficiently fixes the bias and provides a better worst-case error.