Hacker News: bytepoet

New comment by bytepoet in "GPUs: Anatomy of high performance matmul kernels"

bytepoet — Tue, 30 Sep 2025 14:32:12 +0000

Wonderful! Great, detailed explanation. I look forward to reading the vLLM post as well.

New comment by bytepoet in "Compiling LLMs into a MegaKernel: A path to low-latency inference"

bytepoet — Fri, 20 Jun 2025 00:41:34 +0000

Thanks for the inputs. It's very helpful to know.

I look forward to following mirage development.

New comment by bytepoet in "Compiling LLMs into a MegaKernel: A path to low-latency inference"

bytepoet — Thu, 19 Jun 2025 20:36:31 +0000

This is very cool. I enjoyed going through the writeup and GitHub README.

I was wondering if these same optimizations can be brought to bear on training as well, rather than only inference. I guess the challenge here is fusing backward computations with gradient communication.

I also saw that this currently does not handle dynamic workloads such as MoE. I recently came across this paper that does exactly this:

FlashDMoE: Fast Distributed MoE in a Single Kernel - https://arxiv.org/pdf/2506.04667

New comment by bytepoet in "Writing in the Age of LLMs"

bytepoet — Tue, 17 Jun 2025 19:13:29 +0000

I really enjoyed reading this, particularly the first part where the author was specific about why we invariably (and often vaguely) find LLM generated text slightly off.

I cherish writing and find it a wonderful tool for thinking. So far, I've tried to do technical writing without much LLM help. I do run the final writing through a good model to point out factual inaccuracies.

New comment by bytepoet in "How much do language models memorize?"

bytepoet — Fri, 06 Jun 2025 08:03:50 +0000

I enjoyed reading this paper. The experiments are well-designed and it's well-written.

Much work on generalization of ML models deals with asymptotic bounds. Here, there's a precise way of measuring these, even for relatively large models with millions of parameters.

New comment by bytepoet in "The Who Cares Era"

bytepoet — Wed, 28 May 2025 13:55:30 +0000

Such a well-written and thoughtful blog post. Loved it!

New comment by bytepoet in "LLMs get lost in multi-turn conversation"

bytepoet — Thu, 15 May 2025 05:52:13 +0000

The inability of LLMs of ask for clarification was exactly the flaw we encountered when testing them on open-ended problems, stated somewhat ambiguously. This was in the context of paradoxical situations, tested on DeepSeek-R1 and Claude-3.7-Sonnet. Blog post about our experiments: https://pankajpansari.github.io/posts/paradoxes/

New comment by bytepoet in "GPU Puzzles"

bytepoet — Mon, 23 Sep 2024 13:27:36 +0000

Thanks a lot, Sasha, for creating these. I found your LLM training puzzles to be excellent as well.