<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: 37ef_ced3</title><link>https://news.ycombinator.com/user?id=37ef_ced3</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 26 Apr 2026 09:54:41 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=37ef_ced3" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by 37ef_ced3 in "AVX 512 will be the future"]]></title><description><![CDATA[
<p>Linus Torvalds:<p><pre><code>  And I claim that that is the real problem with AVX-512 (and pretty much any vectorization). I personally cannot find a single benchmark that does anything I would ever do - not even remotely close. So if you aren't into some chess engine, if you aren't into parsing (but not using) JSON, if you aren't into software raytracing (as opposed to raytracing in games, which is clearly starting to take off thanks to GPU support), what else is there?
</code></pre>
Answer? Neural net inference, e.g., <a href="https://NN-512.com" rel="nofollow">https://NN-512.com</a><p>If you need a little bit of inference (say, 20 ReNet50s per second per CPU core) as part of a larger system, there's nothing cheaper. If you're doing a small amount of inference, perhaps limited by other parts of the system, you can't keep a GPU fed and the GPU is a huge waste of money.<p>AVX-512, with its masked operations and dual-input permutations, is an expressive and powerful SIMD instruction set. It's a pleasure to write code for, but we need good hardware support (which is literally years overdue).</p>
]]></description><pubDate>Mon, 28 Nov 2022 16:24:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=33775574</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=33775574</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33775574</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Mapping Out the HPC Dependency Chaos"]]></title><description><![CDATA[
<p>There are completely stand-alone systems like:<p><a href="https://NN-512.com" rel="nofollow">https://NN-512.com</a><p>The stand-alone code generator (a statically linked executable written in Go with no dependencies outside the Go standard library) generates stand-alone POSIX C code for the neural net, requiring only gcc to compile.<p>Also see Fabrice Bellard's LibNC:<p><a href="https://bellard.org/libnc/" rel="nofollow">https://bellard.org/libnc/</a><p>C API. Small library, no external dependencies, available for Linux and Windows.</p>
]]></description><pubDate>Fri, 11 Nov 2022 16:52:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=33563206</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=33563206</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33563206</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Tinygrad: A simple and powerful neural network framework"]]></title><description><![CDATA[
<p>And <a href="https://NN-512.com" rel="nofollow">https://NN-512.com</a></p>
]]></description><pubDate>Fri, 04 Nov 2022 14:08:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=33466938</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=33466938</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33466938</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Incremental Parsing in Go"]]></title><description><![CDATA[
<p>You're in for a big surprise. Try using the language.<p>Spend some time using Go, and you will be impressed by its performance.<p>You'll wonder, "Were all those haters on Hacker News misinformed?"</p>
]]></description><pubDate>Sat, 22 Oct 2022 15:31:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=33299099</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=33299099</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33299099</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Decompiling x86 Deep Neural Network Executables"]]></title><description><![CDATA[
<p>Convolutions are fused into convolutions, elementwise operations are fused into convolutions, everything is inlined except where function calls are needed for pthread work units (and those work units are all custom/arbitrary).</p>
]]></description><pubDate>Sun, 09 Oct 2022 22:36:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=33145985</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=33145985</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33145985</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Decompiling x86 Deep Neural Network Executables"]]></title><description><![CDATA[
<p>So...<p>...you were unable to decipher this Hacker News comment thread...<p>...unable find some C code and build it with GCC and make an executable for yourself...<p>...but you think you can reverse engineer the executable?</p>
]]></description><pubDate>Sun, 09 Oct 2022 22:06:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=33145759</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=33145759</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33145759</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Decompiling x86 Deep Neural Network Executables"]]></title><description><![CDATA[
<p>Here is the C code for a DenseNet-121 generated by NN-512:<p><a href="https://nn-512.com/browse/DenseNet121" rel="nofollow">https://nn-512.com/browse/DenseNet121</a><p>Even if you had the C code available to you, you would have a hard time producing the input graph.<p>Good luck reverse engineering it after GCC has compiled it!<p>NN-512 has an incredibly flexible code generator. It can easily be tweaked to produce completely different code for the same convolution, so everyone can apply their own twist to defeat the reverse engineers ("the intellectual property thieves").</p>
]]></description><pubDate>Sun, 09 Oct 2022 21:19:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=33145439</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=33145439</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33145439</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Decompiling x86 Deep Neural Network Executables"]]></title><description><![CDATA[
<p>This will not be able to reverse engineer fully-customized, fully-fused neural networks generated by NN-512:<p><a href="https://NN-512.com" rel="nofollow">https://NN-512.com</a><p>NN-512 generates custom code for all the operations, custom units of work for the threads, custom code around tensor edges, everything is fused and unrolled and customized. If they can deduce the network graph specification from the AVX-512 code, I will be astonished.<p>If you can do it, show me. But I know you can't.<p>Anyone who cares about model privacy will use their own variant of a tool like NN-512. It's security through obscurity, but that's the best you can hope for if you are distributing an executable.</p>
]]></description><pubDate>Sun, 09 Oct 2022 20:07:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=33144742</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=33144742</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=33144742</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Go Developer Survey 2022 Q2 Results"]]></title><description><![CDATA[
<p>Are you seriously suggesting that the language should not have notation for allocating zeroed primitive types and receiving the address of the allocation?<p><pre><code>  var (
      p1 = new(int)
      p2 = new(*int)
      p3 = new(**int)
      p4 = new(complex64)
      p5 = new(complex128)
  )
</code></pre>
I feel like you must be joking. Should we say<p><pre><code>  var (
      c complex128
      p = &c
  )
</code></pre>
every time?</p>
]]></description><pubDate>Thu, 08 Sep 2022 19:05:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=32770436</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32770436</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32770436</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Go Developer Survey 2022 Q2 Results"]]></title><description><![CDATA[
<p>You can think of Go as the modern representative the Plan 9 and Oberon schools of thought -- their answer to (and replacement for) C and C++.<p>The fact that it was developed at Google is incidental. That's where Robert Griesemer (Oberon) and Rob Pike (Plan 9) and Ken Thompson (Unix, Plan 9) and Russ Cox (Plan 9) and Ian Lance Taylor, etc., happened to be employed.</p>
]]></description><pubDate>Thu, 08 Sep 2022 18:37:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=32769995</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32769995</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32769995</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Go Developer Survey 2022 Q2 Results"]]></title><description><![CDATA[
<p>It's used like this<p><pre><code>  p := new(int)
</code></pre>
since you can't say<p><pre><code>  p := &int{}</code></pre></p>
]]></description><pubDate>Thu, 08 Sep 2022 18:12:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=32769533</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32769533</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32769533</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Are GPUs Worth It for ML?"]]></title><description><![CDATA[
<p>For small-scale transformer CPU inference you can use, e.g., Fabrice Bellard's <a href="https://bellard.org/libnc/" rel="nofollow">https://bellard.org/libnc/</a><p>Similarly, for small-scale convolutional CPU inference, where you only need to do maybe 20 ResNet-50 (batch size 1) per second per CPU (cloud CPUs cost $0.015 per hour) you can use inference engines designed for this purpose, e.g., <a href="https://NN-512.com" rel="nofollow">https://NN-512.com</a><p>You can expect about 2x the performance of TensorFlow or PyTorch.</p>
]]></description><pubDate>Mon, 29 Aug 2022 18:52:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=32641993</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32641993</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32641993</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "What are the best resources to learn SIMD and intrinsics?"]]></title><description><![CDATA[
<p><a href="https://NN-512.com" rel="nofollow">https://NN-512.com</a></p>
]]></description><pubDate>Thu, 11 Aug 2022 05:09:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=32421803</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32421803</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32421803</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Programming languages endorsed for server-side use at Meta"]]></title><description><![CDATA[
<p>Go. The obvious right choice.<p>But Meta is in the advertising business and Google is a competitor.</p>
]]></description><pubDate>Wed, 27 Jul 2022 16:59:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=32253020</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32253020</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32253020</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Learning Go as a Python Developer: The Good and the Bad"]]></title><description><![CDATA[
<p>As an engineer who uses Go and appreciates it as a replacement for most purposes where you might use C or C++, the complaints just seem bizarre.<p>You've got people who argue against explicit error checking and people who don't understand the important role of nil pointers and people who don't see the massive net win of garbage collection in concurrent programs, and so on.<p>Endlessly.<p>It feels like a waste of time defending the language when the people criticizing it seem to hold such a vastly different point of view. It's like trying to convince people that Natural Born Killers is a good movie or that Primus makes good music. There's an unbridgeable chasm between you and the people you're trying to convince.</p>
]]></description><pubDate>Mon, 18 Jul 2022 20:28:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=32143235</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32143235</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32143235</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Cruise’s Robot Car Outages Are Jamming Up San Francisco"]]></title><description><![CDATA[
<p>From a customer's perspective, it isn't clear that a hardware/software taxi driver is in any way better than a human taxi driver, particularly with (human) driver assist preventing collisions and all the other warnings provided by a modern car.<p>In both cases someone else is driving for the customer. With a human driver (plus driver assist braking and collision warnings) you have the most flexible, sophisticated intelligence on Earth driving. With a robotaxi you have something inferior. But maybe it's a lot cheaper, right?<p>The robotaxi can only compete on price because that's its only advantage. If you own a car that drives itself, that's a different story. Everybody can see the value proposition.<p>But is the robotaxi actually cheaper at all? We would have to look at the cost of the hardware (how often do lidars fail and how much do they cost to replace?) and the cost of the software development and the cost of the fallback human remote operators (fleet monitoring and teleoperation) and the years of huge R&D investment (billions of dollars) to evaluate whether a robotaxi fleet is indeed cheaper. So how much cheaper is it, exactly? 5%? 10%? 15%?<p>As a customer, would you pay a little more to have the most flexible, sophisticated intelligence on Earth (human brain + driver assist) or would you want to save a few dollars and risk having some dumb piece of software strand you in the middle of the road somewhere?<p>We all use Google Maps or Apple Maps when driving and most of us have seen these systems do boneheaded things. Just imagine the dumb things a robotaxi could do.
It's hard for a normal person to be excited about this. I don't know a single person who is excited by robotaxis.</p>
]]></description><pubDate>Fri, 08 Jul 2022 21:29:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=32027419</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32027419</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32027419</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Elon Musk had twins last year with one of his top executives"]]></title><description><![CDATA[
<p>Yes, but did he buy her a horse?</p>
]]></description><pubDate>Wed, 06 Jul 2022 23:19:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=32008103</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=32008103</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=32008103</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Driverless robotaxi fleet paralyzed for hours in San Francisco"]]></title><description><![CDATA[
<p>From a customer's perspective, it isn't clear that a software taxi driver is in any way better than a human taxi driver, particularly with (human) driver assist preventing collisions and all the other warnings provided by a modern car.<p>In both cases someone else is driving for the customer. With a human driver (plus driver assist braking and collision warnings) you have the most flexible, sophisticated intelligence on Earth driving. With a robotaxi you have something inferior. But maybe it's a lot cheaper, right?<p>The robotaxi can only compete on price because that's its only advantage.<p>If you own a car that drives itself, that's a different story. Everybody can see the value proposition.<p>We would have to look at the cost of the hardware and maintenance and fallback remote operators and the R&D investment to evaluate whether a robotaxi fleet is indeed cheaper. How much cheaper is it, exactly? 5%? 10%? 15%?<p>Would you pay a little more to have the most flexible, sophisticated intelligence on Earth (human brain + driver assist) or would you want to save a few dollars and risk having some dumb piece of software strand you in the middle of the road somewhere?<p>We all use Google Maps or Apple Maps when driving and most of us have seen these systems do boneheaded things. Just imagine the dumb things a robotaxi could do.<p>It's hard for a normal person to be excited about this. I don't know a single person who is excited by robotaxis.</p>
]]></description><pubDate>Fri, 01 Jul 2022 01:32:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=31941061</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=31941061</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31941061</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Ask HN: What is a good medium for daily incremental backups?"]]></title><description><![CDATA[
<p>Colin Percival's Tarsnap: <a href="https://www.tarsnap.com/" rel="nofollow">https://www.tarsnap.com/</a><p>Colin is "Security Officer Emeritus, FreeBSD".</p>
]]></description><pubDate>Mon, 20 Jun 2022 20:33:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=31815541</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=31815541</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31815541</guid></item><item><title><![CDATA[New comment by 37ef_ced3 in "Crypto Platform Celsius Pauses Withdrawals"]]></title><description><![CDATA[
<p>Animats, I was a fan of your Downside site: <a href="http://downside.com/" rel="nofollow">http://downside.com/</a><p><i>Downside's goal is to remind people that we're experiencing a financial bubble, and that financial bubbles always burst. (We originally wrote that in April, 2000. Any questions?)</i><p>And here we are, living through another huge bubble.</p>
]]></description><pubDate>Mon, 13 Jun 2022 05:16:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=31721476</link><dc:creator>37ef_ced3</dc:creator><comments>https://news.ycombinator.com/item?id=31721476</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=31721476</guid></item></channel></rss>