<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: oxxoxoxooo</title><link>https://news.ycombinator.com/user?id=oxxoxoxooo</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 17 Apr 2026 13:07:50 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=oxxoxoxooo" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by oxxoxoxooo in "Optimization of 32-bit Unsigned Division by Constants on 64-bit Targets"]]></title><description><![CDATA[
<p>On x86, there is no vector instruction to get the upper half of integer product (64-bits x 64-bits). ARM SVE2 and RISC-V RVV have one, x86 unfortunately does not (and probably wont for a long time as AVX10 does not add it, either).</p>
]]></description><pubDate>Mon, 13 Apr 2026 10:40:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47750169</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=47750169</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47750169</guid></item><item><title><![CDATA[RISC-V Vector Primer]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/simplex-micro/riscv-vector-primer/blob/main/index.md">https://github.com/simplex-micro/riscv-vector-primer/blob/main/index.md</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46923051">https://news.ycombinator.com/item?id=46923051</a></p>
<p>Points: 69</p>
<p># Comments: 22</p>
]]></description><pubDate>Sat, 07 Feb 2026 11:38:12 +0000</pubDate><link>https://github.com/simplex-micro/riscv-vector-primer/blob/main/index.md</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=46923051</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46923051</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Milk-V Titan: A $329 8-Core 64-bit RISC-V mini-ITX board with PCIe Gen4x16"]]></title><description><![CDATA[
<p>Do you happen to know how does one access/use those A100 cores?</p>
]]></description><pubDate>Sun, 18 Jan 2026 18:47:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=46670845</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=46670845</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46670845</guid></item><item><title><![CDATA[The RISC-V Instruction Tier List [video]]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.youtube.com/watch?v=qLEKOfVQEZI">https://www.youtube.com/watch?v=qLEKOfVQEZI</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45719633">https://news.ycombinator.com/item?id=45719633</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 27 Oct 2025 11:14:47 +0000</pubDate><link>https://www.youtube.com/watch?v=qLEKOfVQEZI</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=45719633</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45719633</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Qualcomm to acquire Arduino"]]></title><description><![CDATA[
<p>If you ever wondered, how Arduino came about: The Untold History of Arduino (<a href="https://arduinohistory.github.io/" rel="nofollow">https://arduinohistory.github.io/</a>).</p>
]]></description><pubDate>Tue, 07 Oct 2025 13:36:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=45502907</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=45502907</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45502907</guid></item><item><title><![CDATA[Crab Nebula (time-lapse movie 2008-2022)]]></title><description><![CDATA[
<p>Article URL: <a href="https://app.astrobin.com:443/u/DetlefHartmann?i=ija7jc">https://app.astrobin.com:443/u/DetlefHartmann?i=ija7jc</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43909032">https://news.ycombinator.com/item?id=43909032</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 06 May 2025 20:01:02 +0000</pubDate><link>https://app.astrobin.com:443/u/DetlefHartmann?i=ija7jc</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=43909032</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43909032</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Tell HN: GpuOwl/PRPLL, GPU software used to find the largest prime number"]]></title><description><![CDATA[
<p>Thank you very much for the answers, very informative!<p>And congratulations on the discovery!</p>
]]></description><pubDate>Sun, 27 Oct 2024 15:14:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=41963172</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=41963172</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41963172</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Tell HN: GpuOwl/PRPLL, GPU software used to find the largest prime number"]]></title><description><![CDATA[
<p>Hi! Please, I also have a few questions:<p>1. I guess the most time consuming part is multiplication, right? What kind of FFT do you use? Schönhage-Strassen, multi-prime NTT, ..? Is it implemented via floating-point numbers or integers?<p>2. Not sure if you encountered this, but do you have any advice for small mulmod (multiplication reduced by prime modulus)? By small I mean machine-word size (i.e. preferably 64-bits).<p>3. For larger modulus, what do you use? Is it worth precomputing the inverse by, say, Newton iteration or is it faster to use asymptotically slower algorithms? Do you use Montgomery representation?<p>4. Does the code use any kind of GCD? What algorithm did you choose?<p>5. Now this is a bit broad question, but could you perhaps compare the traditional algorithms implemented sequentially (e.g. GMP) and algorithm suitable to run on GPUs? I mean, does it make sense to use, say, a quadratic algorithm amenable to parallel execution, rather than a asymptotically faster (and sequential) algorithm?</p>
]]></description><pubDate>Sun, 27 Oct 2024 10:12:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=41961295</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=41961295</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41961295</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Symbolica Computer Algebra System"]]></title><description><![CDATA[
<p>Thanks for the reply! If you don't mind asking more: what do you use for polynomial GCD? Apparently it is quite fast, do you use some standard algorithm implemented well or is there some kind of algorithmic improvement? Is it described somewhere, say a paper or a book? Have you tried to benchmark it against NTL, for example? Thanks again!</p>
]]></description><pubDate>Fri, 10 May 2024 19:35:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=40323009</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=40323009</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40323009</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Symbolica Computer Algebra System"]]></title><description><![CDATA[
<p>Please, what do you use for bigints? GMP?</p>
]]></description><pubDate>Thu, 09 May 2024 17:29:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=40310570</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=40310570</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40310570</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "The Era of 1-bit LLMs: ternary parameters for cost-effective computing"]]></title><description><![CDATA[
<p>Prior art:<p>Binarized Neural Networks: Training Deep Neural Networks with Weights and Activations Constrained to +1 or -1<p><a href="https://arxiv.org/abs/1602.02830" rel="nofollow">https://arxiv.org/abs/1602.02830</a><p>Ternary Neural Networks for Resource-Efficient AI Applications<p><a href="https://arxiv.org/abs/1609.00222" rel="nofollow">https://arxiv.org/abs/1609.00222</a></p>
]]></description><pubDate>Wed, 28 Feb 2024 19:23:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=39542445</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=39542445</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39542445</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "STB: Single-file public domain libraries for C/C++"]]></title><description><![CDATA[
<p>True. Except, it's trivial to mitigate this: one only needs to wrap the whole library under one giant #ifndef. Like here, for example:
 <a href="https://github.com/sheredom/utf8.h/blob/b7ed0a28eb92803c81d6c3f9f33ec95a1697dc38/utf8.h#L29">https://github.com/sheredom/utf8.h/blob/b7ed0a28eb92803c81d6...</a></p>
]]></description><pubDate>Sun, 07 Jan 2024 09:56:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=38899919</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=38899919</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38899919</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Passive SSH Key Compromise via Lattices [pdf]"]]></title><description><![CDATA[
<p>Thanks!</p>
]]></description><pubDate>Mon, 06 Nov 2023 18:26:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=38166574</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=38166574</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38166574</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Passive SSH Key Compromise via Lattices [pdf]"]]></title><description><![CDATA[
<p>> Implementations of big number math can and does contain bugs. (I used to hunt for those via fuzzing, which turned up an amazing number of them.)<p>I'm curious, can you give some examples what kind of bugs did you discover?</p>
]]></description><pubDate>Mon, 06 Nov 2023 18:08:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=38166259</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=38166259</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38166259</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "My favorite prime number generator"]]></title><description><![CDATA[
<p>And another one:<p>The Genuine Sieve of Eratosthenes
<a href="https://www.cs.hmc.edu/~oneill/papers/Sieve-JFP.pdf" rel="nofollow noreferrer">https://www.cs.hmc.edu/~oneill/papers/Sieve-JFP.pdf</a></p>
]]></description><pubDate>Wed, 23 Aug 2023 17:33:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=37239424</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=37239424</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37239424</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Zen4's AVX512 Teardown"]]></title><description><![CDATA[
<p>> This instruction is used in some bignum code<p>Could you be more specific? I think for that to work one would also need the upper half of 64x64 multiplication and `vpmullq` provides only the lower half. You could break one 64x64 multiplication into four 32x32 multiplications (i.e. emulate the full 64x64 = 128 bits multiplication) but I was under the impression that this was slow.</p>
]]></description><pubDate>Mon, 26 Sep 2022 20:48:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=32988141</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=32988141</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32988141</guid></item><item><title><![CDATA[What If? 2]]></title><description><![CDATA[
<p>Article URL: <a href="https://xkcd.com/2575/">https://xkcd.com/2575/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=30151367">https://news.ycombinator.com/item?id=30151367</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 31 Jan 2022 17:56:52 +0000</pubDate><link>https://xkcd.com/2575/</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=30151367</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=30151367</guid></item><item><title><![CDATA[“Risc V greatly underperforms”]]></title><description><![CDATA[
<p>Article URL: <a href="https://gmplib.org/list-archives/gmp-devel/2021-September/006013.html">https://gmplib.org/list-archives/gmp-devel/2021-September/006013.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=29420622">https://news.ycombinator.com/item?id=29420622</a></p>
<p>Points: 310</p>
<p># Comments: 348</p>
]]></description><pubDate>Thu, 02 Dec 2021 18:51:45 +0000</pubDate><link>https://gmplib.org/list-archives/gmp-devel/2021-September/006013.html</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=29420622</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=29420622</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Benchmarking division and libdivide on Apple M1 and Intel AVX512"]]></title><description><![CDATA[
<p>Hi fish,<p>thanks for very interesting article, again!<p>Do you think the very fast division on M1 has any implications for 128/64 narrowing division as well? Do you know of a faster way than the method by Moller and Granlund? Do you plan on to include 128/64 division in libdivide?<p>And I asked this question before but the parent post got flagged so I'm trying once more: at the very bottom of the Labor of Division (Episode V) post [1], is it really possible for the second `qhat` (i.e. `q0`) to be off by 2? Do you have any examples of that?<p>[1] <a href="https://ridiculousfish.com/blog/posts/labor-of-division-episode-v.html" rel="nofollow">https://ridiculousfish.com/blog/posts/labor-of-division-epis...</a></p>
]]></description><pubDate>Wed, 12 May 2021 23:16:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=27136380</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=27136380</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27136380</guid></item><item><title><![CDATA[New comment by oxxoxoxooo in "Labor of Division: Algorithm D"]]></title><description><![CDATA[
<p>I think that I shall never envision<p>An op unlovely as division<p>An op whose answer must be guessed<p>And then, through multiply, assessed;<p>An op for which we dearly pay,<p>In cycles wasted every day.<p>Division code is often hairy;<p>Long division's downright scary.<p>The proofs can overtax your brain,<p>The ceiling and floor may drive you insane.<p>Good code to divide takes a Knuthian hero<p>But even God can't divide by zero!<p>-- Henry S. Warren Jr.<p>Hacker's Delight</p>
]]></description><pubDate>Wed, 05 May 2021 20:54:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=27055681</link><dc:creator>oxxoxoxooo</dc:creator><comments>https://news.ycombinator.com/item?id=27055681</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=27055681</guid></item></channel></rss>