<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: aesthesia</title><link>https://news.ycombinator.com/user?id=aesthesia</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 02:40:22 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=aesthesia" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by aesthesia in "There is a shadow hanging over this Fable thing"]]></title><description><![CDATA[
<p>LLM-isms are much less prevalent in base models, which is what GPT-2 was. It had significant problems with maintaining coherence, but GPT-2 generated text did not have the obvious tells of today's LLMs.</p>
]]></description><pubDate>Sat, 13 Jun 2026 16:35:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48518879</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48518879</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48518879</guid></item><item><title><![CDATA[New comment by aesthesia in "Noise infusion banned from statistical products published by Census Bureau"]]></title><description><![CDATA[
<p>They can certainly enforce that you answer the survey. But it's very difficult to enforce a requirement that people answer questions accurately, particularly when they perceive that doing so will expose them to danger.</p>
]]></description><pubDate>Sat, 13 Jun 2026 16:29:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48518815</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48518815</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48518815</guid></item><item><title><![CDATA[New comment by aesthesia in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>I'm skeptical that you're going to be able to reliably exfiltrate ~10TB of model weights using TEMPEST. Which is not to say weights are secure, just that this isn't the threat model I would be concerned about.</p>
]]></description><pubDate>Sat, 13 Jun 2026 06:10:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48513900</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48513900</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48513900</guid></item><item><title><![CDATA[New comment by aesthesia in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>This is not legislation.</p>
]]></description><pubDate>Sat, 13 Jun 2026 03:15:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48512512</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48512512</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48512512</guid></item><item><title><![CDATA[New comment by aesthesia in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>Come on, no one was worried that GPT-2 would help people engineer viruses. The concern was generating misinformation and spam.</p>
]]></description><pubDate>Sat, 13 Jun 2026 03:09:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48512459</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48512459</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48512459</guid></item><item><title><![CDATA[New comment by aesthesia in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>Moolenaar's quote: "The AI models these companies use are trained by China’s censorship regime and introduce hidden vulnerabilities that put Americans’ data and businesses at risk." That is, Americans using Chinese-trained AI models are exposed to some form of cybersecurity risk.<p>That's not really a threat model described in either of the Anthropic posts you share, which mainly talk about the risks of allowing authoritarian regimes to use powerful US-trained models, and the geopolitical risks of authoritarian countries developing strong AI before democratic/liberal countries do.</p>
]]></description><pubDate>Sat, 13 Jun 2026 03:02:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48512401</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48512401</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48512401</guid></item><item><title><![CDATA[New comment by aesthesia in "macOS 27 Beta breaks the ability to boot Asahi Linux"]]></title><description><![CDATA[
<p>Word was originally released for the Mac in 1985, so the deal was not that Office would be ported, just that MS would keep developing Office for the Mac.</p>
]]></description><pubDate>Fri, 12 Jun 2026 15:21:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48505315</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48505315</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48505315</guid></item><item><title><![CDATA[New comment by aesthesia in "Don't let the LLM speak, just probe it"]]></title><description><![CDATA[
<p>This is a neat little trick, but I wonder if you could do substantially the same thing by just prompting/LoRA finetuning the model to produce a single-token output ("yes" or "no"). This only requires a single model forward pass, you can use the same KV caching strategy for shared parts of the prompt, and isotonic regression should work just as well to calibrate the output logits. I guess if you use this method and probe on an internal layer you can skip all the remaining layers, which could be a nice inference speedup.</p>
]]></description><pubDate>Fri, 12 Jun 2026 04:39:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48499994</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48499994</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48499994</guid></item><item><title><![CDATA[New comment by aesthesia in "Ear Training Practice"]]></title><description><![CDATA[
<p>I appreciate the extremely low fuss interface, but I'm always a little disappointed by chord progression ear training that just plays triads one after another with no thought for voice leading. Generating a nice voice leading for an arbitrary chord progression is a little tricky to do automatically but far from impossible, and might be a fun exercise either for you or your favorite LLM.</p>
]]></description><pubDate>Fri, 12 Jun 2026 04:10:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=48499827</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48499827</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48499827</guid></item><item><title><![CDATA[New comment by aesthesia in "Ear Training Practice"]]></title><description><![CDATA[
<p>Using only 3/2 ratios can sound pretty bad in just intonation as well. Major thirds tuned to 81/64 are off (by a ratio of 81/80) compared with the standard 5/4 tuning, and they don't sound great. This difference is called the syntonic comma and it's been a major issue in the history of tuning.</p>
]]></description><pubDate>Fri, 12 Jun 2026 04:04:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48499786</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48499786</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48499786</guid></item><item><title><![CDATA[New comment by aesthesia in "Open Reproduction of DeepSeek-R1"]]></title><description><![CDATA[
<p>If you really want to see fully open training pipelines for modern LLMs, Olmo and to a lesser extent Nemotron are what you should look at.<p><a href="https://github.com/allenai/OLMo" rel="nofollow">https://github.com/allenai/OLMo</a><p><a href="https://github.com/NVIDIA-NeMo/Nemotron" rel="nofollow">https://github.com/NVIDIA-NeMo/Nemotron</a></p>
]]></description><pubDate>Thu, 11 Jun 2026 15:09:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48491420</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48491420</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48491420</guid></item><item><title><![CDATA[New comment by aesthesia in "Policy on the AI Exponential"]]></title><description><![CDATA[
<p>> They are asking for FAA style preclearance and third party audits. That literally means no new AI startup can emerge. Do they not know that audits cost money?<p>Training frontier AI models costs money, orders of magnitude more than third-party audits. If you can afford to build the model, you can afford to have it audited.</p>
]]></description><pubDate>Wed, 10 Jun 2026 21:03:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48482685</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48482685</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48482685</guid></item><item><title><![CDATA[New comment by aesthesia in "AI profitability is mathematically impossible"]]></title><description><![CDATA[
<p>Yep, in their analysis depreciation meant "get no useful work out of the GPU after this point," though.</p>
]]></description><pubDate>Tue, 09 Jun 2026 21:08:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48467779</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48467779</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48467779</guid></item><item><title><![CDATA[New comment by aesthesia in "GPT-2: Too Dangerous To Release (2019)"]]></title><description><![CDATA[
<p>One of the main purposes of model cards, from the beginning, has been to outline the ways that a model could be harmful or dangerous, and mitigations that can be or have been taken to reduce those risks. How do you expect labs to publish model cards without talking about this rationale?</p>
]]></description><pubDate>Tue, 09 Jun 2026 20:59:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48467682</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48467682</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48467682</guid></item><item><title><![CDATA[New comment by aesthesia in "AI profitability is mathematically impossible"]]></title><description><![CDATA[
<p>Oh, just noticed one other very significant error: they evaluate revenue using input token pricing while counting capacity using generated tokens per second. There's a big gap between input and output token pricing, and between prefill TPS and generation TPS.</p>
]]></description><pubDate>Tue, 09 Jun 2026 19:06:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465994</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48465994</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465994</guid></item><item><title><![CDATA[New comment by aesthesia in "AI profitability is mathematically impossible"]]></title><description><![CDATA[
<p>There are some glaring local errors that make this analysis less than trustworthy. For instance, an assumption that corporate income tax applies directly to revenue, or a supposedly generous assumption that GPUs will fully depreciate after 3 years (6-year-old A100s are still in very high demand!). I would love to read a really well thought through investigation of inference costs and how they relate to token pricing, but I have low confidence that this is it.</p>
]]></description><pubDate>Tue, 09 Jun 2026 18:58:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465877</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48465877</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465877</guid></item><item><title><![CDATA[New comment by aesthesia in "Claude Fable 5"]]></title><description><![CDATA[
<p>I mean, they do actually describe what that extra work was, and people elsewhere in this thread are complaining about the effects of those safeguards. So it's not like this is purely empty rhetoric.</p>
]]></description><pubDate>Tue, 09 Jun 2026 18:23:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465294</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48465294</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465294</guid></item><item><title><![CDATA[New comment by aesthesia in "Claude Fable 5"]]></title><description><![CDATA[
<p>Because it's not a user manual? The idea of a model card originated in 2018 (see <a href="https://arxiv.org/abs/1810.03993" rel="nofollow">https://arxiv.org/abs/1810.03993</a>) as a summary of important facts about a model. At the time, this was typically an image classifier or tabular ML model. Model cards became an important concept in AI governance, and they started expanding once models started getting more capable. The point of a model/system card is to document where the model came from and the evaluations that have been run, make a case that the model will be safe and reliable in its intended applications, and warn about any potential dangers from misuse. It's not an explanation of how to use the model.<p>OpenAI also releases system cards; here's GPT-5.5's: <a href="https://deploymentsafety.openai.com/gpt-5-5/safety" rel="nofollow">https://deploymentsafety.openai.com/gpt-5-5/safety</a></p>
]]></description><pubDate>Tue, 09 Jun 2026 18:17:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465190</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48465190</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465190</guid></item><item><title><![CDATA[New comment by aesthesia in "Transformers Are Inherently Succinct"]]></title><description><![CDATA[
<p>Not quite an answer to your question, but you might find this interesting. The Olmo Hybrid paper has some results on relative complexity of problems that can be solved by transformers and RNNs. They don't look at size, just solvability, and find that the sets of problems solvable by the two architectures are incomparable. They actually use these results to inform their architecture design, which includes both attention and state space layers. (Specifically, they choose gated delta-nets with negative eigenvalues, which they show have greater expressivity than those without.)<p><a href="https://arxiv.org/abs/2604.03444" rel="nofollow">https://arxiv.org/abs/2604.03444</a></p>
]]></description><pubDate>Fri, 05 Jun 2026 21:38:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48418682</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48418682</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48418682</guid></item><item><title><![CDATA[New comment by aesthesia in "Did Claude increase bugs in rsync?"]]></title><description><![CDATA[
<p>Sorry, I should have said this explicitly in the original comment: I think you're likely _correct_ that there isn't a clear increase in the rate of bugs attributable to LLM-authored code in rsync. Your analysis provides evidence in this direction; these are just the things that made me go "hmm". They're not accusations or claims that the conclusion is invalid. But they're definitely things to be curious about.<p>Regarding unlabeled LLM-authored commits, I don't think it's unreasonable in general to think that an open-source project might have had unlabeled LLM-authored commits at some point before 2026. Looking more closely at rsync's recent commit history, I think it's less likely in this case. There's just a low number of commits in general, _until_ large batches of Claude-authored commits start showing up early this year. But this then raises some questions about the bugs-per-commit metric; it does correct for something like "size of release", but also obscures a significant shift in commit velocity that may be downstream of adding LLM development tools to the workflow.<p>Like I said, I don't have a dog in this fight, and I try not to approach sorts of questions from a position of explicit advocacy. I do think it's an interesting question, though, and we should try to understand what the data is actually telling us.</p>
]]></description><pubDate>Fri, 05 Jun 2026 19:10:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48416865</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48416865</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48416865</guid></item></channel></rss>