<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: charleshn</title><link>https://news.ycombinator.com/user?id=charleshn</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 25 Jun 2026 11:21:18 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=charleshn" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Workload isolation using shuffle-sharding]]></title><description><![CDATA[
<p>Article URL: <a href="https://aws.amazon.com/builders-library/workload-isolation-using-shuffle-sharding/">https://aws.amazon.com/builders-library/workload-isolation-using-shuffle-sharding/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48303772">https://news.ycombinator.com/item?id=48303772</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 28 May 2026 02:45:28 +0000</pubDate><link>https://aws.amazon.com/builders-library/workload-isolation-using-shuffle-sharding/</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=48303772</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48303772</guid></item><item><title><![CDATA[New comment by charleshn in "A recent experience with ChatGPT 5.5 Pro"]]></title><description><![CDATA[
<p>> Some people like to parrot "next token prediction", "LLMs can only interpolate", and other nonsense<p>Thank you for illustrating my point.</p>
]]></description><pubDate>Sat, 09 May 2026 13:37:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48074880</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=48074880</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48074880</guid></item><item><title><![CDATA[New comment by charleshn in "A recent experience with ChatGPT 5.5 Pro"]]></title><description><![CDATA[
<p>Yes, they can.<p>Some people like to parrot "next token prediction", "LLMs can only interpolate", and other nonsense, but it is obviously not true for many reasons, in particular since we introduced RL.<p>Humans do not have the monopoly on generating novel ideas, modern AI models using post training, RL etc  can come to them in the same way we do, exploration.<p>See also verifier's law [0]: "The ease of training AI to solve a task is proportional to how verifiable the task is. All tasks that are possible to solve and easy to verify will be solved by AI."<p>This applied to chess, go, strategy games, and we can now see it applying to mathematics, algorithmic problems, etc.<p>It is incredibly humbling to see AI outperform humans at creative cognitive tasks, and realise that the bitter lesson [1] applies so generally, but here we are.<p>[0] <a href="https://www.jasonwei.net/blog/asymmetry-of-verification-and-verifiers-law" rel="nofollow">https://www.jasonwei.net/blog/asymmetry-of-verification-and-...</a><p>[1] <a href="http://www.incompleteideas.net/IncIdeas/BitterLesson.html" rel="nofollow">http://www.incompleteideas.net/IncIdeas/BitterLesson.html</a></p>
]]></description><pubDate>Sat, 09 May 2026 10:58:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48073900</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=48073900</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48073900</guid></item><item><title><![CDATA[New comment by charleshn in "AlphaEvolve: Gemini-powered coding agent scaling impact across fields"]]></title><description><![CDATA[
<p>Not sure what you mean: OpenEvolve is an open source implementation of AlphaEvolve: <a href="https://huggingface.co/blog/codelion/openevolve" rel="nofollow">https://huggingface.co/blog/codelion/openevolve</a></p>
]]></description><pubDate>Fri, 08 May 2026 02:56:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48057956</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=48057956</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48057956</guid></item><item><title><![CDATA[New comment by charleshn in "AlphaEvolve: Gemini-powered coding agent scaling impact across fields"]]></title><description><![CDATA[
<p>They'll likely make it available at some point, but for now one can use OpenEvolve [0] which is not quite as good but should be a good start to use the same LLM-driven evolutionary framework.<p>[0] <a href="https://github.com/algorithmicsuperintelligence/openevolve" rel="nofollow">https://github.com/algorithmicsuperintelligence/openevolve</a></p>
]]></description><pubDate>Fri, 08 May 2026 01:24:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48057368</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=48057368</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48057368</guid></item><item><title><![CDATA[New comment by charleshn in "Open sourcing Dicer: Databricks's auto-sharder"]]></title><description><![CDATA[
<p>> Application pods learn the current assignment through a library called the Slicelet (S for server side). The Slicelet maintains a local cache of the latest assignment by fetching it from the Dicer service and watching for updates. When it receives an updated assignment, the Slicelet notifies the application via a listener API.<p>For a critical control plane component like this, I tend to prefer a constant work pattern [0], to avoid metastable failures [1], e.g. periodically pull the data instead of relying on notifications.<p>[0] <a href="https://aws.amazon.com/builders-library/reliability-and-constant-work/" rel="nofollow">https://aws.amazon.com/builders-library/reliability-and-cons...</a><p>[1] <a href="https://brooker.co.za/blog/2021/05/24/metastable.html" rel="nofollow">https://brooker.co.za/blog/2021/05/24/metastable.html</a></p>
]]></description><pubDate>Wed, 14 Jan 2026 00:01:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=46610387</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=46610387</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46610387</guid></item><item><title><![CDATA[What Does a Database for SSDs Look Like?]]></title><description><![CDATA[
<p>Article URL: <a href="https://brooker.co.za/blog/2025/12/15/database-for-ssd.html">https://brooker.co.za/blog/2025/12/15/database-for-ssd.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46334990">https://news.ycombinator.com/item?id=46334990</a></p>
<p>Points: 148</p>
<p># Comments: 121</p>
]]></description><pubDate>Sat, 20 Dec 2025 10:13:33 +0000</pubDate><link>https://brooker.co.za/blog/2025/12/15/database-for-ssd.html</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=46334990</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46334990</guid></item><item><title><![CDATA[New comment by charleshn in "AMD officially confirms fresh next-gen Zen 6 CPU details"]]></title><description><![CDATA[
<p>They should be reintroducing the 3D vcache [0] variants (X) in EPYC, with a higher cache/core ratio, that was present in EPYC4 (e.g. 9684X [1]) they for some reason wasn't available in EPYC5.<p>Makes a massive difference at high density and utilisation, with the standard cache/core performance can really degrade under load.<p>[0] <a href="https://www.amd.com/en/products/processors/technologies/3d-v-cache.html" rel="nofollow">https://www.amd.com/en/products/processors/technologies/3d-v...</a><p>[1] <a href="https://www.amd.com/en/products/processors/server/epyc/4th-generation-9004-and-8004-series/amd-epyc-9684x.html" rel="nofollow">https://www.amd.com/en/products/processors/server/epyc/4th-g...</a></p>
]]></description><pubDate>Fri, 19 Dec 2025 18:39:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=46329286</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=46329286</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46329286</guid></item><item><title><![CDATA[New comment by charleshn in "The highest quality codebase"]]></title><description><![CDATA[
<p>It's fundamentally because of verifier's law [0].<p>Current AI, and in particular RL-based, is already or will soon achieve super human performance on problems that can be - quickly - verified and measured.<p>So maths, algorithms, etc and well defined bugs fall into that category.<p>However architectural decision, design, long-term planning where there is little data, no model allowing synthetic data generation, and long iteration cycles are not so much amenable to it.<p>[0] <a href="https://www.jasonwei.net/blog/asymmetry-of-verification-and-verifiers-law" rel="nofollow">https://www.jasonwei.net/blog/asymmetry-of-verification-and-...</a></p>
]]></description><pubDate>Sat, 13 Dec 2025 09:12:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=46253227</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=46253227</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46253227</guid></item><item><title><![CDATA[New comment by charleshn in "Spinlocks vs. Mutexes: When to Spin and When to Sleep"]]></title><description><![CDATA[
<p>> std::hardware_destructive_interference_size Exists so you don't have to guess, although in practice it'll basically always be 64.<p>Unfortunately it's not quite true, do to e.g. spacial prefetching [0]. See e.g. Folly's definition [1].<p>[0] <a href="https://community.intel.com/t5/Intel-Moderncode-for-Parallel/Hardware-prefetch-and-shared-multi-core-resources-on-Xeon/td-p/1074003" rel="nofollow">https://community.intel.com/t5/Intel-Moderncode-for-Parallel...</a><p>[1] <a href="https://github.com/facebook/folly/blob/d2e6fe65dfd6b30a9d504d0409ac733cbaa73125/folly/lang/Align.h#L172" rel="nofollow">https://github.com/facebook/folly/blob/d2e6fe65dfd6b30a9d504...</a></p>
]]></description><pubDate>Mon, 08 Dec 2025 04:04:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=46188170</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=46188170</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46188170</guid></item><item><title><![CDATA[New comment by charleshn in "TPUs vs. GPUs and why Google is positioned to win AI race in the long term"]]></title><description><![CDATA[
<p>> There's a good reason so much research is done on Nvidia clusters and not TPU clusters.<p>You are aware that Gemini was trained on TPU, and that most research at Deepmind is done on TPU?</p>
]]></description><pubDate>Sat, 29 Nov 2025 00:14:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=46084046</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=46084046</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46084046</guid></item><item><title><![CDATA[Collective Communication for 100k+ GPUs]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2510.20171">https://arxiv.org/abs/2510.20171</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45766956">https://news.ycombinator.com/item?id=45766956</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 31 Oct 2025 00:11:03 +0000</pubDate><link>https://arxiv.org/abs/2510.20171</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=45766956</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45766956</guid></item><item><title><![CDATA[New comment by charleshn in "Tinnitus Neuromodulator"]]></title><description><![CDATA[
<p>I can relate.<p>I had tinnitus for over 10 years.
My tinnitus was not the usual ringing type, it was some sort of humming, low frequency noise. The frequency was not constant, it could vary.
It could sometimes stop for 5-10 minutes, e.g. after a hot bath.<p>Went to see many specialists, tried everything, to no avail.<p>One day I started experiencing recurrent tension and light pain in my neck and shoulder blades, so I started doing some neck and shoulder blades stretches several times a day.<p>After a few weeks, the pain was gone, and I realised the tinnitus had stopped. This was maybe 2 years ago (I am still doing those exercises multiple times a day).</p>
]]></description><pubDate>Sat, 18 Oct 2025 18:48:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=45629544</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=45629544</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45629544</guid></item><item><title><![CDATA[New comment by charleshn in "TernFS – An exabyte scale, multi-region distributed filesystem"]]></title><description><![CDATA[
<p>A few questions if the authors are around!<p>> Is hardware agnostic and uses TCP/IP to communicate.<p>So no RDMA?
It's very hard to make effective use of modern NVMe drives bandwidth over TCP/IP.<p>> A logical shard is further split into five physical instances, one leader and four followers, in a typical distributed consensus setup. The distributed consensus engine is provided by a purpose-built Raft-like implementation, which we call LogsDB<p>Raft-like, so not Raft, a custom algorithm?
Implementing distributed consensus correctly from scratch is very hard - why not use some battle-tested implementations?<p>> Read/write access to the block service is provided using a simple TCP API currently implemented by a Go process. This process is hardware agnostic and uses the Go standard library to read and write blocks to a conventional local file system. We originally planned to rewrite the Go process in C++, and possibly write to block devices directly, but the idiomatic Go implementation has proven performant enough for our needs so far.<p>The document mentions it's designed to reach TB/s though. 
Which means that for an IO intensive workload, one would end up wasting a lot of drive bandwidth, and require a huge number of nodes.<p>Modern parallel filesystems can reach 80-90GB/s per node, using RDMA, DPDK etc.<p>> This is in contrast to protocols like NFS, whereby each connection is very stateful, holding resources such as open files, locks, and so on.<p>This is not true for NFSv3 and older, it tends to be stateless (no notion of open file).<p>No mention of the way this was developed and tested - does it use some formal methods, simulator, chaos engineering etc?</p>
]]></description><pubDate>Thu, 18 Sep 2025 22:50:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=45295973</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=45295973</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45295973</guid></item><item><title><![CDATA[New comment by charleshn in "Strong Eventual Consistency – The Big Idea Behind CRDTs"]]></title><description><![CDATA[
<p>Interesting that neither the article nor the comments mention the CALM theorem [0], which gives a framework to explain when coordination-free consistency is possible, and is arguably the big idea behind SEC.<p>[0] <a href="https://arxiv.org/abs/1901.01930" rel="nofollow">https://arxiv.org/abs/1901.01930</a></p>
]]></description><pubDate>Wed, 10 Sep 2025 11:02:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=45195948</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=45195948</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45195948</guid></item><item><title><![CDATA[New comment by charleshn in "The Bitter Lesson Is Misunderstood"]]></title><description><![CDATA[
<p>You can have a look at the DeepSeek paper, in particular section "2.2 DeepSeek-R1-Zero: Reinforcement Learning on the Base Mode".<p>But generally the idea is that it's, you need some notion of reward, verifiers etc.<p>Works really well for maths, algorithms, amd many things actually.<p>See also this very short essay/introduction: <a href="https://www.jasonwei.net/blog/asymmetry-of-verification-and-verifiers-law" rel="nofollow">https://www.jasonwei.net/blog/asymmetry-of-verification-and-...</a><p>That's why we have IMO gold level models now, and I'm pretty confident we'll have superhuman mathematics, algorithmic etc models before long.<p>Now domains which are very hard to verify - think e.g. theoretical physics etc - that's another story.</p>
]]></description><pubDate>Wed, 03 Sep 2025 23:16:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=45121408</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=45121408</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45121408</guid></item><item><title><![CDATA[New comment by charleshn in "The Bitter Lesson Is Misunderstood"]]></title><description><![CDATA[
<p>> We cannot add more compute to a given compute budget C without increasing data D to maintain the relationship.
> We must either (1) discover new architectures with different scaling laws, and/or (2) compute new synthetic data that can contribute to learning (akin to dreams).<p>Of course we can, this is a non issue.<p>See e.g. AlphaZero [0] that's 8 years old at this point, and any modern RL training using synthetic data, e.g. DeepSeek-R1-Zero [1].<p>[0] <a href="https://en.m.wikipedia.org/wiki/AlphaZero" rel="nofollow">https://en.m.wikipedia.org/wiki/AlphaZero</a><p>[1] <a href="https://arxiv.org/abs/2501.12948" rel="nofollow">https://arxiv.org/abs/2501.12948</a></p>
]]></description><pubDate>Wed, 03 Sep 2025 22:53:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=45121239</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=45121239</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45121239</guid></item><item><title><![CDATA[New comment by charleshn in "How to Think About GPUs"]]></title><description><![CDATA[
<p>Yes, 450GB/s is the per GPU bandwidth in the nvlink domain.
3.2Tbps is the per-host bandwidth in the scale out IB/Ethernet domain.</p>
]]></description><pubDate>Wed, 20 Aug 2025 14:31:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=44962320</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=44962320</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44962320</guid></item><item><title><![CDATA[Demystifying NCCL: An In-Depth Analysis of GPU Communication Protocols and Algos]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2507.04786">https://arxiv.org/abs/2507.04786</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44894413">https://news.ycombinator.com/item?id=44894413</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 13 Aug 2025 22:04:27 +0000</pubDate><link>https://arxiv.org/abs/2507.04786</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=44894413</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44894413</guid></item><item><title><![CDATA[New comment by charleshn in "The Surprising gRPC Client Bottleneck in Low-Latency Networks"]]></title><description><![CDATA[
<p>Could you check the value of your kernel's net.ipv4.tcp_slow_start_after_idle sysctl, and if it's non zero set it to 0?</p>
]]></description><pubDate>Wed, 23 Jul 2025 23:47:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=44665309</link><dc:creator>charleshn</dc:creator><comments>https://news.ycombinator.com/item?id=44665309</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44665309</guid></item></channel></rss>