<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: amitport</title><link>https://news.ycombinator.com/user?id=amitport</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 14 Jun 2026 12:57:04 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=amitport" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by amitport in "Israeli firm BlackCore suspected of meddling in New York and Scotland votes"]]></title><description><![CDATA[
<p>"parents and grandparents,"<p>You don't have to go into historical events. This is still happening now.<p>Jews are still fighting for their survival and the moment Israel stops fighting, millions of Jews will die.</p>
]]></description><pubDate>Sun, 14 Jun 2026 07:48:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48525103</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=48525103</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48525103</guid></item><item><title><![CDATA[New comment by amitport in "Upcoming breaking changes for npm v12"]]></title><description><![CDATA[
<p>To be fair, NPM sucked long before it got acquired by Github/Microsoft.<p>And to be fair 2: The other package repos also suck.</p>
]]></description><pubDate>Wed, 10 Jun 2026 02:56:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48470820</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=48470820</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48470820</guid></item><item><title><![CDATA[New comment by amitport in "Show HN: A 4-year-old "TurboQuant" implementation"]]></title><description><![CDATA[
<p>Hi, thanks! I appreciate your input and generally agree. The TDS article wasn't really aimed at the HN crowd, but it did help a bit with the more general audience.<p>I do plan to also develop an interactive guide that breaks down post-rotation quantization fundamentals in a more educational, hands-on way.</p>
]]></description><pubDate>Fri, 15 May 2026 08:56:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=48146203</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=48146203</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48146203</guid></item><item><title><![CDATA[New comment by amitport in "Show HN: A 4-year-old "TurboQuant" implementation"]]></title><description><![CDATA[
<p>For context: <a href="https://towardsdatascience.com/how-a-2021-quantization-algorithm-quietly-outperforms-its-2026-successor/" rel="nofollow">https://towardsdatascience.com/how-a-2021-quantization-algor...</a></p>
]]></description><pubDate>Sun, 03 May 2026 16:20:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47998505</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47998505</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47998505</guid></item><item><title><![CDATA[Show HN: A 4-year-old "TurboQuant" implementation]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/amitport/EDEN-Distributed-Mean-Estimation">https://github.com/amitport/EDEN-Distributed-Mean-Estimation</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47998261">https://news.ycombinator.com/item?id=47998261</a></p>
<p>Points: 3</p>
<p># Comments: 3</p>
]]></description><pubDate>Sun, 03 May 2026 15:58:49 +0000</pubDate><link>https://github.com/amitport/EDEN-Distributed-Mean-Estimation</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47998261</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47998261</guid></item><item><title><![CDATA[A Note on TurboQuant and the Earlier Eden Work]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2604.18555">https://arxiv.org/abs/2604.18555</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47989262">https://news.ycombinator.com/item?id=47989262</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 02 May 2026 18:53:06 +0000</pubDate><link>https://arxiv.org/abs/2604.18555</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47989262</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47989262</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>I recently wrote a beginner-friendly explanation of this situation in TDS:<p><a href="https://towardsdatascience.com/how-a-2021-quantization-algorithm-quietly-outperforms-its-2026-successor/" rel="nofollow">https://towardsdatascience.com/how-a-2021-quantization-algor...</a></p>
]]></description><pubDate>Sat, 02 May 2026 15:31:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=47987284</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47987284</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47987284</guid></item><item><title><![CDATA[New comment by amitport in "[dead]"]]></title><description><![CDATA[
<p>Hi, I’m the author of the post above.<p>tl;dr<p>TurboQuant is a recent paper from Google and NYU that has gained massive traction in mainstream media and the AI community. As implementations of TurboQuant are integrated into various popular projects, it is important to note its relation to EDEN quantization.<p>TurboQuant is essentially a partial implementation of EDEN quantization (first work published in NeurIPS 2021, extention published on ICML 2022). The few differences that do exist make EDEN significantly better.<p>We have also published a detailed comparative report here: <a href="https://arxiv.org/abs/2604.18555" rel="nofollow">https://arxiv.org/abs/2604.18555</a></p>
]]></description><pubDate>Sat, 02 May 2026 13:39:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=47986318</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47986318</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47986318</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>In the vLLM documentation quoted above, TurboQuant (which is a restricted version of EDEN) is referred to as a specific case of HIGGS. Note the symmetry: EDEN acts as a special case of HIGGS; hence, HIGGS functions as a generalization of EDEN.<p>In any case, the quantizer is indeed an extension, regardless of whether it was explicitly framed that way in the paper. I say this not to diminish their contribution at all, but just to clarify the relationship, as it was also stated in the vLLM doc.</p>
]]></description><pubDate>Mon, 27 Apr 2026 15:22:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47922845</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47922845</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47922845</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>Thanks for the pushback, and I appreciate the reference to classical information theory.<p>While I probably overstated things by using the very general phrase "taking advantage," I want to be very precise about the claim, as I believe these works are foundational to quantization, beyond the scope of deep learning. The mechanism of applying a deterministic biased quantizer, such as Lloyd-Max, to the induced post-rotation distribution, alongside mathematically correcting its inherent bias, is a distinct contribution (which asymptotically improves the worst-case error).<p>If there is a classical paper that utilizes such a combination, I would genuinely be very eager to review it. But to my knowledge, this was not introduced prior to DRIVE and EDEN.</p>
]]></description><pubDate>Mon, 27 Apr 2026 15:13:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47922731</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47922731</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47922731</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>Thanks for that!<p>It is worth noting that <i>taking advantage</i> of the post-rotation distribution was not actually done until DRIVE (2021), which was made possible via our proper scaling. Furthermore, applying a Lloyd-Max codebook post-rotation was introduced EDEN.<p>We consider these to be the foundational works in this regard.</p>
]]></description><pubDate>Mon, 27 Apr 2026 14:07:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47921796</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47921796</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47921796</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>Thanks for that!
 Note that the residual chain is empirically and theoretically inferior to our unbiased scale; furthermore, it requires an additional bit in certain cases.
 Additionally, TurboQuant was not the first to apply EDEN to KV-cache (see for example <a href="https://arxiv.org/abs/2411.17525" rel="nofollow">https://arxiv.org/abs/2411.17525</a> from 2024).</p>
]]></description><pubDate>Mon, 27 Apr 2026 13:27:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47921299</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47921299</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47921299</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>Those works did cite DRIVE/EDEN :)<p>HIGGS is an extension of EDEN (using the well known method for blockwise Lloyd-Max).<p>The proper framing of this "TurboQuant" layer in vllm (which does not include JQL) is precisely EDEN 22 without the scale correction.</p>
]]></description><pubDate>Mon, 27 Apr 2026 12:46:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=47920896</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47920896</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47920896</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>I believe our claim at this point is more fundamental than just lack of citation.<p>The quantizer in TurboQuant <i>is</i> EDEN quantization (2021) applied to the KV-cache. It is neither a novel quantizer nor an improvement in quantization techniques.<p>In DRIVE/EDEN, we already introduced the version used in "TurboQuant"'s paper and suggested an optimal scale configurations which are better in both mse-minimizing and unbiased scenarios.</p>
]]></description><pubDate>Mon, 27 Apr 2026 11:35:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=47920259</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47920259</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47920259</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>Thanks for the quick response and for being willing to update the explainer. I really appreciate the clarification.</p>
]]></description><pubDate>Mon, 27 Apr 2026 09:14:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47919360</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47919360</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47919360</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>When you use TurboQuant, you are essentially using the EDEN quantizer under a different name applied to KV-cache.<p>Both EDEN and its 1-bit variant have been implemented in PyTorch, JAX, and TensorFlow across numerous open-source libraries and are used in various applications. I am currently writing a blog post that will document these in detail.<p>EDEN defines a scale parameter, S, for which we suggest specific optimal values for both biased and unbiased versions. As shown in the note I shared, these values lead to clear empirical improvements. Consequently, users who rely on the less optimal S value and the unbiasing method popularized by TurboQuant will generally see inferior results compared to those using EDEN with the optimal scale values suggested in our original papers.</p>
]]></description><pubDate>Mon, 27 Apr 2026 08:24:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47919036</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47919036</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47919036</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>The note includes extensive experiments and reproduces many of the figures from the TurboQuant paper in our Section 5. Honestly, I think our case is pretty clear-cut as is. I am not sure what the overhead for those specific benchmarks would be, but we will look into it.<p>(In any case, I want to emphasize that TurboQuant quantizer is a private case of EDEN)</p>
]]></description><pubDate>Mon, 27 Apr 2026 05:26:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47917998</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47917998</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47917998</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: A first-principles walkthrough"]]></title><description><![CDATA[
<p>TurboQuant is a restricted version of EDEN quantization (NeurIPS 21, ICML 22). It lacks the optimal scale derivations, which makes the TurboQuant variant considerably less accurate than those works. We show this thoroughly in a new note at <a href="https://arxiv.org/abs/2604.18555" rel="nofollow">https://arxiv.org/abs/2604.18555</a>.<p>We were the first to introduce post-rotation distribution-aware quantization in 2021. This was later implemented in many fields, including federated learning, vector retrieval, databases, inference engines, and KV-cache.<p>It would be appropriate to receive credit for this. Furthermore, it is baffling to see the name "TurboQuant" repeated in this context, considering the many works published from 2021 onwards.<p>The blog post mentioned above essentially guides you through EDEN quantization but ultimately settles on a sub-optimal MSE-minimizing version and an unbiasing trick. This trick often costs a full bit more than DRIVE/EDEN requires to achieve the same results using the unbiasing scale shown in the original 2021 paper.</p>
]]></description><pubDate>Mon, 27 Apr 2026 04:01:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47917577</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47917577</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47917577</guid></item><item><title><![CDATA[New comment by amitport in "America Has Lost the Arab World"]]></title><description><![CDATA[
<p>They must mean 'The Arab World Has Lost America,' don't they?</p>
]]></description><pubDate>Thu, 09 Apr 2026 10:39:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47701869</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47701869</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47701869</guid></item><item><title><![CDATA[New comment by amitport in "TurboQuant: Redefining AI efficiency with extreme compression"]]></title><description><![CDATA[
<p>We do mention and the paper you shared. Please read our paper to see how the rotation-aware bias correction we introduced efficiently fixes the bias and provides a better worst-case error.</p>
]]></description><pubDate>Thu, 26 Mar 2026 11:42:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47529258</link><dc:creator>amitport</dc:creator><comments>https://news.ycombinator.com/item?id=47529258</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47529258</guid></item></channel></rss>