<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: andy_xor_andrew</title><link>https://news.ycombinator.com/user?id=andy_xor_andrew</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 12 Apr 2026 20:12:18 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=andy_xor_andrew" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by andy_xor_andrew in "Embarrassingly simple self-distillation improves code generation"]]></title><description><![CDATA[
<p>I've been wondering about adaptive decoding! It seems obvious to me that at some points during decoding (reasoning, "creative thinking") you would want a higher temperature, while at other points (emitting syntactically correct code, following a plan that was already established) you would want lower temperature.</p>
]]></description><pubDate>Sat, 04 Apr 2026 22:05:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=47643989</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=47643989</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47643989</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Embarrassingly simple self-distillation improves code generation"]]></title><description><![CDATA[
<p>Not really? If you read it, there is no validation, no correctness signal, no verification, none of that. They're just passing in benchmark inputs, collecting the outputs (regardless of their quality), training on those outputs, and then sweeping the decode settings (temp, topk) of the resulting model. Their conclusion is that this results in a better model than the original - even when taking into consideration the same temp/topk sweep of the original.<p>So no, they are not fine-tuning a general purpose model to produce "valid benchmark code results."</p>
]]></description><pubDate>Sat, 04 Apr 2026 22:04:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=47643976</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=47643976</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47643976</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Show HN: Apfel – The free AI already on your Mac"]]></title><description><![CDATA[
<p>I find the branding to be a little odd. Like, it should be a github page with a README that says "here's how to use this." Like, the full explanation of this project is right there in the HN title: "The free AI already on your Mac."<p>I guess LLMs have made it too simple to instantly build startup landing page slop, which causes this? Like, do we need to see the github star count chart? Do we need all the buzzwords and stuff? You'd think this was a startup trying to get a billion dollar evaluation. It feels disingenuous.<p>Maybe I'm just being a hater.</p>
]]></description><pubDate>Fri, 03 Apr 2026 19:04:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=47630739</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=47630739</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47630739</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "EmDash – A spiritual successor to WordPress that solves plugin security"]]></title><description><![CDATA[
<p>> x402 is an open, neutral standard for Internet-native payments. It lets anyone on the Internet easily charge, and any client pay on-demand, on a pay-per-use basis. A client, such as an agent, sends a HTTP request and receives a HTTP 402 Payment Required status code. In response, the client pays for access on-demand, and the server can let the client through to the requested content.<p>Fascinating. Cloudflare is envisioning a future where agents are given debit cards by their owners, so they can autonomously send microtransactions to website owners to scrape content or possibly purchase goods on the owner's behalf. I don't know how I feel about that but there's no doubt it's a fascinating concept.<p>Brb, setting up a honeypot that always responds with HTTP 402 Payment Required demanding 10cents per visit... That's the next "selling 1 million pixels on my website for $1 each", I guess</p>
]]></description><pubDate>Wed, 01 Apr 2026 17:01:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=47603526</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=47603526</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47603526</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Wine-Staging 11.1 Adds Patches for Enabling Recent Photoshop Versions on Linux"]]></title><description><![CDATA[
<p>Curious if someone could enlighten me-<p>How much of these sorts of patches are specifically checking if a certain application is running, and then changing behavior to match what that application expects? And how much of it is simply better emulating the Windows API in general?<p>I think there are benefits to both approaches, not criticizing either one. I'm just curious if the implementation of a patch like this is "We fixed an inconsistency between Wine and Windows" vs "We're checking if Photoshop is running and using a different locking primitive" or whatever.</p>
]]></description><pubDate>Sun, 25 Jan 2026 18:24:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=46756631</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=46756631</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46756631</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Understanding Rust Closures"]]></title><description><![CDATA[
<p>if I'm not mistaken (and I very well may be!) my primary confusion with closures comes from the fact that: the trait they implement (FnOnce / Fn / FnMut) depends entirely upon what happens inside the closure.<p>It will automatically implement the most general, relaxed version (FnMut I think?) and only restrict itself further to FnOnce and Fn based on what you do inside the closure.<p>So, it can be tricky to know what's going on, and making a code change can change the contract of the closure and therefore where and how it can be used.<p>(I invite rust experts to correct me if any of the above is mistaken - I always forget the order of precedence for FnOnce/Fn/FnMut and which implies which)</p>
]]></description><pubDate>Sat, 24 Jan 2026 22:41:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=46748466</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=46748466</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46748466</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Anthropic Economic Index report: economic primitives"]]></title><description><![CDATA[
<p>no payoff whatsoever? I just asked Claude to do a task that would have previously taken me four days. Then I got up and got lunch, and when I was back, it was done.<p>I would never make the argument that there are no risks. But there's also no way you can make the argument there are no payoffs!</p>
]]></description><pubDate>Thu, 22 Jan 2026 23:14:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=46726341</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=46726341</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46726341</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "GPTZero finds 100 new hallucinations in NeurIPS 2025 accepted papers"]]></title><description><![CDATA[
<p>I guess that makes this "standing on the shoulders of fabrications"</p>
]]></description><pubDate>Thu, 22 Jan 2026 18:35:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=46723342</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=46723342</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46723342</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Qwen3-Omni-Flash-2025-12-01：a next-generation native multimodal large model"]]></title><description><![CDATA[
<p>> This is a 30B parameter MoE with 3B active parameters<p>Where are you finding that info? Not saying you're wrong; just saying that I didn't see that specified anywhere in the linked page, or on their HF.</p>
]]></description><pubDate>Wed, 10 Dec 2025 18:37:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=46221587</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=46221587</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46221587</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "PRC elites voice AI-skepticism"]]></title><description><![CDATA[
<p>> former Dean of Electronics Engineering and Computer Science at Peking University, has noted that Chinese data makes up only 1.3 percent of global large-model datasets (The Paper, March 24). Reflecting these concerns, the Ministry of State Security (MSS) has issued a stark warning that “poisoned data” (数据投毒) could “mislead public opinion” (误导社会舆论) (Sina Finance, August 5).<p>from a technical point of view, I suppose it's actually not a problem like he suggests. You can use all the pro-democracy, pro-free-speech, anti-PRC data in the world, but the pretraining stages (on the planet's data) are more for instilling core language abilities, and are far less important than the SFT / RL / DPO / etc stages, which require far less data, and can tune a model towards whatever ideology you'd like. Plus, you can do things like selectively identify vectors that encode for certain high-level concepts, and emphasize them during inference, like Golden Gate Claude.</p>
]]></description><pubDate>Tue, 25 Nov 2025 19:08:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=46049454</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=46049454</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46049454</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "How to turn liquid glass into a solid interface"]]></title><description><![CDATA[
<p>I truly, genuinely wanted to like Liquid Glass. I think the default reaction to ANY change in UX, even changes that are generally improvements, is: "I don't like this, it's different!"<p>I thought that'd be the case for ios 26. But after installing it... yeesh. I can barely see anything. It's just awful.</p>
]]></description><pubDate>Tue, 14 Oct 2025 20:40:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=45584556</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=45584556</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45584556</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Google Safe Browsing incident"]]></title><description><![CDATA[
<p>> In order to limit the impact of similar issues in the future, all sites on statichost.eu are now created with a statichost.page domain instead.<p>This read like a dark twist in a horror novel - the .page tld is controlled by Google!<p><a href="https://get.page/" rel="nofollow">https://get.page/</a></p>
]]></description><pubDate>Fri, 10 Oct 2025 18:26:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=45542158</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=45542158</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45542158</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "F-35 pilot held 50-minute airborne conference call with engineers before crash"]]></title><description><![CDATA[
<p>Sure, of course I will trust the report as the source of truth.<p>But I'm interested in the reporting. There are, you know, journalistic standards, which are considered kinda "journalism 101"! For instance, getting the basic facts of a story correct - especially <i>the facts stated in the headline</i>.<p>So I'm curious, did the reporter do their due diligence, and write the article in a way that is factually correct, but highly misleading? Or did they simply not follow basic reporting protocol?</p>
]]></description><pubDate>Wed, 27 Aug 2025 15:58:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=45041421</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=45041421</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45041421</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "F-35 pilot held 50-minute airborne conference call with engineers before crash"]]></title><description><![CDATA[
<p>I read the article (twice) and I <i>still</i> have the impression the pilot was in fact the one in the conference call<p>Opening line:<p>> A US Air Force F-35 pilot spent 50 minutes on an airborne conference call with Lockheed Martin engineers trying to solve a problem with his fighter jet before he ejected<p>Am I illiterate or misreading it?<p>> After going through system checklists in an attempt to remedy the problem, the pilot got on a conference call with engineers from the plane’s manufacturer, Lockheed Martin, *<i>as the plane flew near the air base. *</i><p>Is this actually some insane weasel-wording by CNN? "We never said <i>the pilot</i> (he is in fact a pilot) was the one flying the jet, we just said 'as the plane flew', not 'as he flew the plane', using passive voice, so we're not wrong - but it was another pilot flying the plane"</p>
]]></description><pubDate>Wed, 27 Aug 2025 15:49:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=45041299</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=45041299</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45041299</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "It’s not wrong that "\u{1F926}\u{1F3FC}\u200D\u2642\uFE0F".length == 7 (2019)"]]></title><description><![CDATA[
<p><a href="https://news.ycombinator.com/item?id=27529697">https://news.ycombinator.com/item?id=27529697</a></p>
]]></description><pubDate>Fri, 22 Aug 2025 18:15:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=44987789</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=44987789</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44987789</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Home Depot sued for 'secretly' using facial recognition at self-checkouts"]]></title><description><![CDATA[
<p>I was wondering this as well. The green box could simply indicate it detected a face, using something like YOLO, or even a simpler technique like some point-and-shoot cameras use to decide where to focus (on faces, obviously).</p>
]]></description><pubDate>Thu, 21 Aug 2025 16:20:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=44974597</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=44974597</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44974597</guid></item><item><title><![CDATA[LLMs Are Magic – Their Applications Should Be, Too]]></title><description><![CDATA[
<p>Article URL: <a href="https://debugti.me/posts/llm-magic/">https://debugti.me/posts/llm-magic/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44500588">https://news.ycombinator.com/item?id=44500588</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 08 Jul 2025 14:56:17 +0000</pubDate><link>https://debugti.me/posts/llm-magic/</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=44500588</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44500588</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Xfinity using WiFi signals in your house to detect motion"]]></title><description><![CDATA[
<p>Yeah, it's bizarre.<p>Normally the pathway for this kind of thing would be:<p>1. theorized<p>2. proven in a research lab<p>3. not feasible in real-world use (fizzles and dies)<p>if you're <i>lucky</i> the path is like<p>1. theorized<p>2. proven in a research lab<p>3. actually somewhat feasible in real-world use!<p>4. startups / researchers split off to attempt to market it (fizzles and dies)<p>the fact that this ended up going from research paper to "Comcast can tell if I'm home based on my body's physical interaction with wifi waves" is absolutely wild</p>
]]></description><pubDate>Mon, 30 Jun 2025 20:58:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=44427807</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=44427807</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44427807</guid></item><item><title><![CDATA[Hacker News Clone (Microeval)]]></title><description><![CDATA[
<p>Article URL: <a href="https://artificialanalysis.ai/microevals/hackernews-clone-1749859153173">https://artificialanalysis.ai/microevals/hackernews-clone-1749859153173</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44381246">https://news.ycombinator.com/item?id=44381246</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 25 Jun 2025 19:52:43 +0000</pubDate><link>https://artificialanalysis.ai/microevals/hackernews-clone-1749859153173</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=44381246</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44381246</guid></item><item><title><![CDATA[New comment by andy_xor_andrew in "Q-learning is not yet scalable"]]></title><description><![CDATA[
<p>The article mentions AlphaGo/Mu/Zero was not based on Q-Learning - I'm no expert but I thought AlphaGo was based on DeepMind's "Deep Q-Learning"? Is that not right?</p>
]]></description><pubDate>Sun, 15 Jun 2025 03:58:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=44280430</link><dc:creator>andy_xor_andrew</dc:creator><comments>https://news.ycombinator.com/item?id=44280430</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44280430</guid></item></channel></rss>