<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: sluongng</title><link>https://news.ycombinator.com/user?id=sluongng</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 19 May 2026 01:54:51 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=sluongng" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by sluongng in "Cutting inference cold starts by 40x with LP, FUSE, C/R, and CUDA-checkpoint"]]></title><description><![CDATA[
<p>There are plenty of cool advancements in reducing inference cold start when I was meeting with folks in person at FOSDEM this year. However, I still struggle to understand: why would folks care about this?<p>Major AI Labs all have secured their own compute in the form of hardware, data center, and power generation. That means their resource pool is fixed, and they can do all sorts of tricks to pre-load, pre-allocate, etc... to improve on inference latency.<p>Cold start is usually a solution for "cloud" environment when your pool is flexible, and you only pay for what you use. Its effectiveness lowered in bare-metal settings as folks do not care about scaling up and down as much.<p>So my question is: who is this for? AWS and GCP running Anthropic models?</p>
]]></description><pubDate>Mon, 18 May 2026 20:12:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48184916</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=48184916</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48184916</guid></item><item><title><![CDATA[New comment by sluongng in "Show HN: Git bayesect – Bayesian Git bisection for non-deterministic bugs"]]></title><description><![CDATA[
<p>You can run bisect with first-parent</p>
]]></description><pubDate>Thu, 02 Apr 2026 03:35:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47609692</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=47609692</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47609692</guid></item><item><title><![CDATA[New comment by sluongng in "Ninja is a small build system with a focus on speed"]]></title><description><![CDATA[
<p>My teammate has a great time reimplementing Ninja (slop-free) in Go here <a href="https://github.com/buildbuddy-io/reninja" rel="nofollow">https://github.com/buildbuddy-io/reninja</a> to make it even faster with Remote Build Execution.</p>
]]></description><pubDate>Mon, 30 Mar 2026 13:27:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=47574024</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=47574024</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47574024</guid></item><item><title><![CDATA[New comment by sluongng in "Parallel coding agents with tmux and Markdown specs"]]></title><description><![CDATA[
<p>Yeah, I don't disagree with your assessment at all. I think the H2A ratio is still a good metric for the AI adoption rate of an organization. At a higher H2A ratio, you will also start to hear people measuring things using token volumes, which I think is also a similar metric (because most models nowadays run on a relatively fixed Tokens/second speed).<p>All of this is not a direct signal to a productivity boost. I think at higher volumes, you will need to start to account for the "yield" rate of the token volumes above: what are the volumes of tokens that get to the final production deployment? At which stage is it a constraint on the yield? Is it the models, or is it the harness, or something else (i.e. Code Review, CI/CD, Security Scans etc...)? And then it becomes an optimization problem to reduce the Cost of Goods Sold while improving/maintaining Revenues. The "productivity" will then be dissolved into multiple separate but more tangible metrics.</p>
]]></description><pubDate>Tue, 03 Mar 2026 08:32:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=47229755</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=47229755</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47229755</guid></item><item><title><![CDATA[New comment by sluongng in "Parallel coding agents with tmux and Markdown specs"]]></title><description><![CDATA[
<p>I do. The reason why the current generation of agents are good at coding is because the labs have sufficient time and computes to generate synthetic chain-of-thoughts data, feed those data through RL before use them to train the LLMs. These distillation takes time, time which starts from the release of the previous generation of models.<p>So we are just now getting agents which can reliably loop themselves for medium size tasks. This generation opens a new door towards agent-managing-agents chain of thoughts data. I think we would only get multi-agents with high reliability sometimes by the mid to end of 2026, assuming no major geopolitical disruption.</p>
]]></description><pubDate>Mon, 02 Mar 2026 19:43:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=47223060</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=47223060</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47223060</guid></item><item><title><![CDATA[New comment by sluongng in "Parallel coding agents with tmux and Markdown specs"]]></title><description><![CDATA[
<p>Yeah the 8 agents limit aligns well with my conversations with folks in the leading labs<p><a href="https://open.substack.com/pub/sluongng/p/stages-of-coding-agents-adoption" rel="nofollow">https://open.substack.com/pub/sluongng/p/stages-of-coding-ag...</a><p>I think we need much different toolings to go beyond 1 human - 10 agents ratio. And much much different tooling to achieve a higher ratio than that</p>
]]></description><pubDate>Mon, 02 Mar 2026 18:09:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47221691</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=47221691</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47221691</guid></item><item><title><![CDATA[New comment by sluongng in "Cracking the Python Monorepo"]]></title><description><![CDATA[
<p>Most of the time, the CI resources in a python monorepo is not spent on packaging. It’s spent on running the tests.<p>I would love to read more about how the author is tackling the testing problem in their setup.</p>
]]></description><pubDate>Sun, 01 Mar 2026 08:09:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47204723</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=47204723</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47204723</guid></item><item><title><![CDATA[New comment by sluongng in "Move tests to closed source repo"]]></title><description><![CDATA[
<p><a href="https://sluongng.substack.com/i/186718212/test-is-king" rel="nofollow">https://sluongng.substack.com/i/186718212/test-is-king</a> I wrote about this less than a month ago. Things are moving pretty fast in this direction.</p>
]]></description><pubDate>Fri, 27 Feb 2026 08:06:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47177904</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=47177904</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47177904</guid></item><item><title><![CDATA[New comment by sluongng in "Show HN: I ported Tree-sitter to Go"]]></title><description><![CDATA[
<p>Oh this is really neat for the Bazel community, as depending on tree-sitter to build a gazelle language extension, with Gazelle written in Go, requires you to use CGO.<p>Now perhaps we can get rid of the CGO dependency and make it pure Go instead.
I have pinged some folks to take a look at it.</p>
]]></description><pubDate>Wed, 25 Feb 2026 19:01:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47156142</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=47156142</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47156142</guid></item><item><title><![CDATA[New comment by sluongng in "Putting Gemini to Work in Chrome"]]></title><description><![CDATA[
<p>Not yet in Linux?</p>
]]></description><pubDate>Thu, 29 Jan 2026 09:02:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=46807574</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=46807574</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46807574</guid></item><item><title><![CDATA[New comment by sluongng in "Rust at Scale: An Added Layer of Security for WhatsApp"]]></title><description><![CDATA[
<p>I suspect they just use no_std whenever its applicable<p><a href="https://github.com/facebook/buck2/commit/4a1ccdd36e0de0b69ee185fc9dae8c9d9dfdf3de" rel="nofollow">https://github.com/facebook/buck2/commit/4a1ccdd36e0de0b69ee...</a><p><a href="https://github.com/facebook/buck2/commit/bee72b29bc9b67b59ba775ce65057f122c75aa70" rel="nofollow">https://github.com/facebook/buck2/commit/bee72b29bc9b67b59ba...</a><p>Turn out if you have strong control over the compiler and linker instrumentations, there are a lot of ways to optimize binary size</p>
]]></description><pubDate>Wed, 28 Jan 2026 12:49:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=46794639</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=46794639</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46794639</guid></item><item><title><![CDATA[New comment by sluongng in "I made my own Git"]]></title><description><![CDATA[
<p>Zstd dictionary compression is essentially how Meta's Mercurial fork (Sapling VCS) stores blobs <a href="https://sapling-scm.com/docs/dev/internals/zstdelta" rel="nofollow">https://sapling-scm.com/docs/dev/internals/zstdelta</a>. The source code is available in GitHub if folks want to study the tradeoffs vs git delta-compressed packfiles.<p>I think theoratically, Git delta-compression is still a lot more optimized for smaller repos. But for bigger repos where sharding storaged is required, path-based delta dictionary compression does much better. Git recently (in the last 1 year) got something called "path-walk" which is fairly similar though.</p>
]]></description><pubDate>Tue, 27 Jan 2026 12:24:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=46779073</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=46779073</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46779073</guid></item><item><title><![CDATA[New comment by sluongng in "Transfering Files with gRPC"]]></title><description><![CDATA[
<p>The evolving schema is much more attractive than a bunch of plain text HTTP headers when you want to communicate additional metadata with the file download/upload.<p>For example, there are common metadata such as the digest (hash) of the blob, the compression algorithm, the base compression dictionary, whether Reed-Solomon is applicable or not, etc...<p>And like others have pointed out, having existing grpc infrastructure in place definitely helps using it a lot easier.<p>But yeah, it's a tradeoff.</p>
]]></description><pubDate>Mon, 26 Jan 2026 15:36:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=46766925</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=46766925</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46766925</guid></item><item><title><![CDATA[New comment by sluongng in "Transfering Files with gRPC"]]></title><description><![CDATA[
<p><a href="https://github.com/googleapis/googleapis/blob/master/google/bytestream/bytestream.proto" rel="nofollow">https://github.com/googleapis/googleapis/blob/master/google/...</a> is a more complete version of this. It supports resumable uploads, and the download can start from an offset within a file, allowing you to download only part of the file instead of the whole.<p>Another version of this is to use grpc to communicate the "metadata" of a download file, and then "side" load the file using a side channel with http (or some other light-weight copy methods). Gitlab uses this to transfer Git packfiles and serve git fetch requests iirc <a href="https://gitlab.com/gitlab-org/gitaly/-/blob/master/doc/sidechannel.md?ref_type=heads" rel="nofollow">https://gitlab.com/gitlab-org/gitaly/-/blob/master/doc/sidec...</a></p>
]]></description><pubDate>Mon, 26 Jan 2026 13:56:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=46765663</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=46765663</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46765663</guid></item><item><title><![CDATA[Post-Agentic Code Forges]]></title><description><![CDATA[
<p>Article URL: <a href="https://sluongng.substack.com/p/post-agentic-code-forges">https://sluongng.substack.com/p/post-agentic-code-forges</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46741909">https://news.ycombinator.com/item?id=46741909</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 24 Jan 2026 08:10:08 +0000</pubDate><link>https://sluongng.substack.com/p/post-agentic-code-forges</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=46741909</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46741909</guid></item><item><title><![CDATA[New comment by sluongng in "A faster path to container images in Bazel"]]></title><description><![CDATA[
<p>The underlying problem is that most container images are not cache efficient. Compressed tarballs arent and that’s what most of container images are. And Bazel relies heavily on caching to stay fast.<p>Most of the hyper scaler actually do not store container images as tarballs at scale. They usually flatten the layers and either cache the entire file system merkle tree, or breaking it down to even smaller blocks to cache them efficiently. See Alibaba Firefly Nydus, AWS Firecracker, etc… There is also various different forms of snapshotters that can lazily materialize the layers like estargz, soci, nix, etc… but none of them are widely adopted.</p>
]]></description><pubDate>Thu, 25 Dec 2025 09:28:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=46383278</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=46383278</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46383278</guid></item><item><title><![CDATA[New comment by sluongng in "Fast trigram based code search"]]></title><description><![CDATA[
<p>> They use Google's web indexing technology adapted for trigrams, which was mostly developed to support their massive internal monorepo<p>Do you have a source for this? I would love to read more about it.<p>In the doc of the Zoekt repo, it says<p>> What does cs.bazel.build run on?<p>> Currently, it runs on a single Google Cloud VM with 16 vCPUs, 60G RAM and an attached physical SSD.<p><a href="https://github.com/sourcegraph/zoekt/blob/main/doc/faq.md#what-does-csbazelbuild-run-on" rel="nofollow">https://github.com/sourcegraph/zoekt/blob/main/doc/faq.md#wh...</a><p>so at least they were using Zoekt up until a certain point in the past.</p>
]]></description><pubDate>Fri, 05 Dec 2025 14:17:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=46161588</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=46161588</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46161588</guid></item><item><title><![CDATA[New comment by sluongng in "Introducing architecture variants"]]></title><description><![CDATA[
<p>Nice. This is one of the main reasons why I picked CachyOS recently.
Now I can fallback to Ubuntu if CachyOS gets me stuck somewhere.</p>
]]></description><pubDate>Fri, 31 Oct 2025 17:08:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=45774290</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=45774290</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45774290</guid></item><item><title><![CDATA[New comment by sluongng in "Modern CI is too complex and misdirected (2021)"]]></title><description><![CDATA[
<p>The most concerning part about modern CI to me is how most of it is running on GitHub Actions, and how GitHub itself has been deprioritizing GitHub Actions maintenance and improvements over AI features.<p>Seriously, take a look at their pinned repo: <a href="https://github.com/actions/starter-workflows" rel="nofollow">https://github.com/actions/starter-workflows</a><p>> Thank you for your interest in this GitHub repo, however, right now we are not taking contributions.<p>> We continue to focus our resources on strategic areas that help our customers be successful while making developers' lives easier. While GitHub Actions remains a key part of this vision, we are allocating resources towards other areas of Actions and are not taking contributions to this repository at this time.</p>
]]></description><pubDate>Wed, 20 Aug 2025 11:59:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=44961044</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=44961044</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44961044</guid></item><item><title><![CDATA[New comment by sluongng in "Claude says “You're absolutely right!” about everything"]]></title><description><![CDATA[
<p>I don't view it as a bug. It's a personality trait of the model that made "user steering" much easier, thus helping the model to handle a wider range of tasks.<p>I also think that there will be no "perfect" personality out there. There will always be folks who view some traits as annoying icks. So, some level of RL-based personality customization down the line will be a must.</p>
]]></description><pubDate>Wed, 13 Aug 2025 12:12:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=44887471</link><dc:creator>sluongng</dc:creator><comments>https://news.ycombinator.com/item?id=44887471</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44887471</guid></item></channel></rss>