<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: pavpanchekha</title><link>https://news.ycombinator.com/user?id=pavpanchekha</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 30 May 2026 21:27:18 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=pavpanchekha" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by pavpanchekha in "Notes from the Mistral AI Now Summit"]]></title><description><![CDATA[
<p>OpenAI used to make Codex-specific models, but they stopped. What I've gathered from interviews and similar is that training two models isn't worth the (small) lift from having a coding-specific model. You're pre-training on everything anyway, and coding RL is reasonably useful for general-purpose models too.</p>
]]></description><pubDate>Fri, 29 May 2026 18:54:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=48327680</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=48327680</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48327680</guid></item><item><title><![CDATA[Can LLMs accelerate science? An experiment]]></title><description><![CDATA[
<p>Article URL: <a href="https://pavpanchekha.com/blog/llk.html">https://pavpanchekha.com/blog/llk.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47699408">https://news.ycombinator.com/item?id=47699408</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 09 Apr 2026 04:52:23 +0000</pubDate><link>https://pavpanchekha.com/blog/llk.html</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47699408</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47699408</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Herbie: Automatically improve imprecise floating point formulas"]]></title><description><![CDATA[
<p>University of Washington Programming Languages and Software Engineering (research group).<p>I'm not at UW any more, I'm now at Utah, but some of the Herbie team is at UW and they provide the infrastructure</p>
]]></description><pubDate>Sat, 04 Apr 2026 18:09:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=47641645</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47641645</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47641645</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Herbie: Automatically improve imprecise floating point formulas"]]></title><description><![CDATA[
<p>Documented here but yes it's an average, of something similar to but not exactly the same as relative error: <a href="https://herbie.uwplse.org/doc/latest/error.html" rel="nofollow">https://herbie.uwplse.org/doc/latest/error.html</a><p>It's true that averages can be misleading but we encourage users to think about it instead as a percentage of inputs. In practice the error distribution is very bimodal, the two modes being "basically fine" (a few ulps of error) and "garbage" (usually 0 instead of some actual value)</p>
]]></description><pubDate>Sat, 04 Apr 2026 14:04:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47639172</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47639172</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47639172</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Herbie: Automatically improve imprecise floating point formulas"]]></title><description><![CDATA[
<p>Author here. The speed up is modeled throughput, though the model is relatively naive. It's possible to disable branches by turning off the regimes flag, see <a href="https://herbie.uwplse.org/doc/1.0/options.html" rel="nofollow">https://herbie.uwplse.org/doc/1.0/options.html</a></p>
]]></description><pubDate>Sat, 04 Apr 2026 14:01:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47639156</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47639156</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47639156</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Herbie: Automatically improve imprecise floating point formulas"]]></title><description><![CDATA[
<p>Author here. I've got a few papers about this problem (including one in submission), but it is very very hard to do, especially with acceptable overhead. The state of the art is maybe 100x overhead.</p>
]]></description><pubDate>Sat, 04 Apr 2026 13:59:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47639140</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47639140</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47639140</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Herbie: Automatically improve imprecise floating point formulas"]]></title><description><![CDATA[
<p>It is, there's a page in the documentation about how errors are defined. Let me also add: Herbie generally gives the most accurate option it found first, and then the other stuff might be useful for speed (0.5x is way faster than two square roots and a divide!) but it's not as accurate</p>
]]></description><pubDate>Sat, 04 Apr 2026 13:58:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=47639129</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47639129</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47639129</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Herbie: Automatically improve imprecise floating point formulas"]]></title><description><![CDATA[
<p>Author here! Yes, the float distribution isn't what you want in practice, but distribution selector isn't really the right thing either, because a low probability bad result can still be pretty bad! Hence the range selector; the float distribution is good at picking extreme values that trigger FP error.<p>We usually recommend looking for 90%+ accuracy or carefully examining the accuracy plot</p>
]]></description><pubDate>Sat, 04 Apr 2026 13:56:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47639112</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47639112</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47639112</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Herbie: Automatically improve imprecise floating point formulas"]]></title><description><![CDATA[
<p>It was me. Damn it you're right! Will fix!</p>
]]></description><pubDate>Sat, 04 Apr 2026 13:49:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47639054</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47639054</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47639054</guid></item><item><title><![CDATA[New comment by pavpanchekha in "In math, rigor is vital, but are digitized proofs taking it too far?"]]></title><description><![CDATA[
<p>In calculus the core issue is that the concept of a "function" was undefined but generally understood to be something like what we'd call today an "expression" in a programming language. So, for example, "x^2 + 1" was widely agreed to be a function, but "if x < 0 then x else 0" was controversial. What's nice about the "function as expression" idea is that generally speaking these functions are continuous, analytic [1], etc and the set of such functions is closed under differentiation and integration [2]. There's a good chance that if you took AP Calculus you basically learned this definition.<p>The formal definition of "function" is totally different! This is typically a big confusion in Calculus 2 or 3! Today, a function is defined as literally any input→output mapping, and the "rule" by which this mapping is defined is irrelevant. This definition is much worse for basic calculus—most mappings are not continuous or differentiable. But it has benefits for more advanced calculus; the initial application was Fourier series. And it is generally much easier to formalize because it is "canonical" in a certain sense, it doesn't depend on questions like "which exact expressions are allowed".<p>This is <i>exactly</i> what the article is complaining about. The non-rigorous intuition preferred for basic calculus and the non-rigorous intuition required for more advanced calculus are different. If you formalize, you'll end up with one rigorous definition, which necessarily will have to incorporate a lot of complexity required for advanced calculus but confusing to beginners.<p>Programming languages are like this too. Compare C and Python. Some things <i>must</i> be written in C, but most things can be more easily written in Python. If the whole development must be one language, the more basic code will suffer. In programming we fix this by developing software as assemblages of different programs written in different languages, but mechanisms for this kind of modularity in formal systems are still under-studied and, today, come with significant untrusted pieces or annoying boilerplate, so this solution isn't yet available.<p>[1] Later it was discovered that in fact this set isn't analytic, but that wasn't known for a long time.<p>[2] I am being imprecise; integrating and solving various differential equations often yields functions that are nice but aren't defined by combinations of named functions. The solution at the time was to name these new discovered functions.</p>
]]></description><pubDate>Mon, 30 Mar 2026 14:56:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47575159</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47575159</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47575159</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Python: The Optimization Ladder"]]></title><description><![CDATA[
<p>They're cheap but not free, especially at the front end of the CPU where it's just a lot more instructions to churn through. What the branch predictor gets you is it turns branches, which would normally cause a pipeline bubble, to be executed like straightline code if they're predicted right. It's a bit like a tracing jit. But you will still have a bunch of extra instructions to, like, compute the branch predicate.</p>
]]></description><pubDate>Sat, 14 Mar 2026 17:02:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=47378681</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47378681</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47378681</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Faster asin() was hiding in plain sight"]]></title><description><![CDATA[
<p>Horner's form is typically also more accurate, or at least, it is not bit-identical, so the compiler won't do it unless you pass -funsafe-math, and maybe not even then.</p>
]]></description><pubDate>Wed, 11 Mar 2026 18:54:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47339653</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47339653</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47339653</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Deterministic Programming with LLMs"]]></title><description><![CDATA[
<p>Deterministic output is incompatible with batching, which in turn is critical to high utilization on GPUs, which in turn is necessary to keep costs low.</p>
]]></description><pubDate>Sun, 01 Mar 2026 01:32:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47202671</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47202671</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47202671</guid></item><item><title><![CDATA[The Token Production System]]></title><description><![CDATA[
<p>Article URL: <a href="https://pavpanchekha.com/blog/tps.html">https://pavpanchekha.com/blog/tps.html</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47088858">https://news.ycombinator.com/item?id=47088858</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 20 Feb 2026 15:02:10 +0000</pubDate><link>https://pavpanchekha.com/blog/tps.html</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=47088858</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47088858</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Challenges and Research Directions for Large Language Model Inference Hardware"]]></title><description><![CDATA[
<p>Frontier models are now much bigger than an individual query, hence batching, MoE, etc. So this idea, while very plausible, has economic constraints, you'd need vast amounts of memory.</p>
]]></description><pubDate>Sun, 25 Jan 2026 17:23:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=46755989</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=46755989</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46755989</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Should CSS be constraints?"]]></title><description><![CDATA[
<p>That pretty much <i>is</i> how CSS works! At the most basic level, Flow level is about widths down, heights up. But this basic model doesn't let you do a lot of things some people want to do, like distributing left-over space in a container equally among children (imagine a table). So then CSS added more stuff, like Flex-box, which also fundamentally works like this though adds a second pass.</p>
]]></description><pubDate>Thu, 11 Dec 2025 00:03:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=46225882</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=46225882</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46225882</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Should CSS be constraints?"]]></title><description><![CDATA[
<p>Author here—it is from Tufte CSS. I have a blog post [1] about how floats work. It is a nice example of there being unintuitive and also more-intuitive ways to achieve things in CSS. These days I believe CSS Anchor Positioning provides a simpler way to do this, but I haven't used it yet.<p>[1]: <a href="https://pavpanchekha.com/blog/css-floats.html" rel="nofollow">https://pavpanchekha.com/blog/css-floats.html</a></p>
]]></description><pubDate>Thu, 11 Dec 2025 00:01:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46225861</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=46225861</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46225861</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Should CSS be constraints?"]]></title><description><![CDATA[
<p>Author here. You're right that a lot of CSS's edge cases and implicit rules stem from other choices and implicit rules that maybe need to be reconsidered. But take this logic a step further. The way text with mixed font sizes is laid out is kinda weird—should we just get rid of that? Mixed Chinese-Latin text is weird (search "idiographic baseline"); should we get rid of that? In fact, variable-size characters are weird, maybe just stick to all-Chinese? I'm joking, of course, but my point isn't that a simpler system is inconceivable, just that it would be inconvenient.</p>
]]></description><pubDate>Wed, 10 Dec 2025 23:58:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=46225839</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=46225839</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46225839</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Should CSS be constraints?"]]></title><description><![CDATA[
<p>Author here. I suppose it depends on what "rely on" means, but... have you ever used CSS to center text? Did you think much at all about what happens if the zoom level is high enough and the screen size small enough that the text doesn't fit? I assume not (I don't think I'd ever thought about that before I read that part of the standard), so in that sense you were relying on this behavior. I do think that in most cases where it activates, the quirk implemented by CSS probably improves the layout.</p>
]]></description><pubDate>Wed, 10 Dec 2025 23:54:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=46225806</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=46225806</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46225806</guid></item><item><title><![CDATA[New comment by pavpanchekha in "Should CSS be constraints?"]]></title><description><![CDATA[
<p>Author here. This specific quirk of CSS is minor, and probably if CSS didn't have this quirk it'd be fine. But I'd guess that you've at least once in your life been on your phone and been browsing a website which used a really long word (or a really long line of code!) in centered text (maybe a heading) and you've scrolled right to read the whole thing. Are you sure <i>your website</i> doesn't have such a thing, if you have centered text somewhere?<p>So, yes, CSS could have <i>fewer</i> edge cases and workarounds---what I refer to in the post as less implicit knowledge---and then it would be simpler. But the resulting layouts would probably be worse. And a radical simplification like a constraint system would probably be even simpler and the results (I assert) would be even worse. It's fine to want a better life for browser developers, but I don't think it's unthinkable for CSS to create new edge cases and sometimes-surprising behavior if it also results in, typically, better outcomes.</p>
]]></description><pubDate>Wed, 10 Dec 2025 23:52:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=46225787</link><dc:creator>pavpanchekha</dc:creator><comments>https://news.ycombinator.com/item?id=46225787</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46225787</guid></item></channel></rss>