<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: aesthesia</title><link>https://news.ycombinator.com/user?id=aesthesia</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 17 Jun 2026 08:16:20 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=aesthesia" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by aesthesia in "Feds freaked over Fable 5 after 'fix this code', not jailbreak, say researchers"]]></title><description><![CDATA[
<p>You can see their general approach to guardrail classifiers in these posts:<p><a href="https://www.anthropic.com/research/constitutional-classifiers" rel="nofollow">https://www.anthropic.com/research/constitutional-classifier...</a>
<a href="https://www.anthropic.com/research/next-generation-constitutional-classifiers" rel="nofollow">https://www.anthropic.com/research/next-generation-constitut...</a><p>It's not just keyword matching, but I'm sure they tuned the Fable classifiers pretty hard to avoid false negatives.</p>
]]></description><pubDate>Tue, 16 Jun 2026 15:28:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48556787</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48556787</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48556787</guid></item><item><title><![CDATA[New comment by aesthesia in "SubQ 1.1 Small"]]></title><description><![CDATA[
<p>Disappointing they don't actually say how their sparse attention mechanism works.</p>
]]></description><pubDate>Tue, 16 Jun 2026 15:12:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48556528</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48556528</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48556528</guid></item><item><title><![CDATA[New comment by aesthesia in "Apple Foundation Models"]]></title><description><![CDATA[
<p>From the linked docs page:<p>> Requests go directly from your app to the Claude API; Apple is not in the request path and does not see prompts or responses. Usage is billed to your Anthropic account at standard API pricing. Your app decides when to use Claude and when to use Apple's on-device model: pass whichever model you want to each session.</p>
]]></description><pubDate>Mon, 15 Jun 2026 16:46:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48543901</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48543901</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48543901</guid></item><item><title><![CDATA[New comment by aesthesia in "There is a shadow hanging over this Fable thing"]]></title><description><![CDATA[
<p>LLM-isms are much less prevalent in base models, which is what GPT-2 was. It had significant problems with maintaining coherence, but GPT-2 generated text did not have the obvious tells of today's LLMs.</p>
]]></description><pubDate>Sat, 13 Jun 2026 16:35:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48518879</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48518879</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48518879</guid></item><item><title><![CDATA[New comment by aesthesia in "Noise infusion banned from statistical products published by Census Bureau"]]></title><description><![CDATA[
<p>They can certainly enforce that you answer the survey. But it's very difficult to enforce a requirement that people answer questions accurately, particularly when they perceive that doing so will expose them to danger.</p>
]]></description><pubDate>Sat, 13 Jun 2026 16:29:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48518815</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48518815</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48518815</guid></item><item><title><![CDATA[New comment by aesthesia in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>I'm skeptical that you're going to be able to reliably exfiltrate ~10TB of model weights using TEMPEST. Which is not to say weights are secure, just that this isn't the threat model I would be concerned about.</p>
]]></description><pubDate>Sat, 13 Jun 2026 06:10:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48513900</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48513900</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48513900</guid></item><item><title><![CDATA[New comment by aesthesia in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>This is not legislation.</p>
]]></description><pubDate>Sat, 13 Jun 2026 03:15:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48512512</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48512512</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48512512</guid></item><item><title><![CDATA[New comment by aesthesia in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>Come on, no one was worried that GPT-2 would help people engineer viruses. The concern was generating misinformation and spam.</p>
]]></description><pubDate>Sat, 13 Jun 2026 03:09:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48512459</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48512459</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48512459</guid></item><item><title><![CDATA[New comment by aesthesia in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>Moolenaar's quote: "The AI models these companies use are trained by China’s censorship regime and introduce hidden vulnerabilities that put Americans’ data and businesses at risk." That is, Americans using Chinese-trained AI models are exposed to some form of cybersecurity risk.<p>That's not really a threat model described in either of the Anthropic posts you share, which mainly talk about the risks of allowing authoritarian regimes to use powerful US-trained models, and the geopolitical risks of authoritarian countries developing strong AI before democratic/liberal countries do.</p>
]]></description><pubDate>Sat, 13 Jun 2026 03:02:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48512401</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48512401</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48512401</guid></item><item><title><![CDATA[New comment by aesthesia in "macOS 27 Beta breaks the ability to boot Asahi Linux"]]></title><description><![CDATA[
<p>Word was originally released for the Mac in 1985, so the deal was not that Office would be ported, just that MS would keep developing Office for the Mac.</p>
]]></description><pubDate>Fri, 12 Jun 2026 15:21:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48505315</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48505315</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48505315</guid></item><item><title><![CDATA[New comment by aesthesia in "Don't let the LLM speak, just probe it"]]></title><description><![CDATA[
<p>This is a neat little trick, but I wonder if you could do substantially the same thing by just prompting/LoRA finetuning the model to produce a single-token output ("yes" or "no"). This only requires a single model forward pass, you can use the same KV caching strategy for shared parts of the prompt, and isotonic regression should work just as well to calibrate the output logits. I guess if you use this method and probe on an internal layer you can skip all the remaining layers, which could be a nice inference speedup.</p>
]]></description><pubDate>Fri, 12 Jun 2026 04:39:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48499994</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48499994</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48499994</guid></item><item><title><![CDATA[New comment by aesthesia in "Ear Training Practice"]]></title><description><![CDATA[
<p>I appreciate the extremely low fuss interface, but I'm always a little disappointed by chord progression ear training that just plays triads one after another with no thought for voice leading. Generating a nice voice leading for an arbitrary chord progression is a little tricky to do automatically but far from impossible, and might be a fun exercise either for you or your favorite LLM.</p>
]]></description><pubDate>Fri, 12 Jun 2026 04:10:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=48499827</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48499827</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48499827</guid></item><item><title><![CDATA[New comment by aesthesia in "Ear Training Practice"]]></title><description><![CDATA[
<p>Using only 3/2 ratios can sound pretty bad in just intonation as well. Major thirds tuned to 81/64 are off (by a ratio of 81/80) compared with the standard 5/4 tuning, and they don't sound great. This difference is called the syntonic comma and it's been a major issue in the history of tuning.</p>
]]></description><pubDate>Fri, 12 Jun 2026 04:04:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48499786</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48499786</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48499786</guid></item><item><title><![CDATA[New comment by aesthesia in "Open Reproduction of DeepSeek-R1"]]></title><description><![CDATA[
<p>If you really want to see fully open training pipelines for modern LLMs, Olmo and to a lesser extent Nemotron are what you should look at.<p><a href="https://github.com/allenai/OLMo" rel="nofollow">https://github.com/allenai/OLMo</a><p><a href="https://github.com/NVIDIA-NeMo/Nemotron" rel="nofollow">https://github.com/NVIDIA-NeMo/Nemotron</a></p>
]]></description><pubDate>Thu, 11 Jun 2026 15:09:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48491420</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48491420</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48491420</guid></item><item><title><![CDATA[New comment by aesthesia in "Policy on the AI Exponential"]]></title><description><![CDATA[
<p>> They are asking for FAA style preclearance and third party audits. That literally means no new AI startup can emerge. Do they not know that audits cost money?<p>Training frontier AI models costs money, orders of magnitude more than third-party audits. If you can afford to build the model, you can afford to have it audited.</p>
]]></description><pubDate>Wed, 10 Jun 2026 21:03:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48482685</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48482685</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48482685</guid></item><item><title><![CDATA[New comment by aesthesia in "AI profitability is mathematically impossible"]]></title><description><![CDATA[
<p>Yep, in their analysis depreciation meant "get no useful work out of the GPU after this point," though.</p>
]]></description><pubDate>Tue, 09 Jun 2026 21:08:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48467779</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48467779</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48467779</guid></item><item><title><![CDATA[New comment by aesthesia in "GPT-2: Too Dangerous To Release (2019)"]]></title><description><![CDATA[
<p>One of the main purposes of model cards, from the beginning, has been to outline the ways that a model could be harmful or dangerous, and mitigations that can be or have been taken to reduce those risks. How do you expect labs to publish model cards without talking about this rationale?</p>
]]></description><pubDate>Tue, 09 Jun 2026 20:59:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48467682</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48467682</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48467682</guid></item><item><title><![CDATA[New comment by aesthesia in "AI profitability is mathematically impossible"]]></title><description><![CDATA[
<p>Oh, just noticed one other very significant error: they evaluate revenue using input token pricing while counting capacity using generated tokens per second. There's a big gap between input and output token pricing, and between prefill TPS and generation TPS.</p>
]]></description><pubDate>Tue, 09 Jun 2026 19:06:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465994</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48465994</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465994</guid></item><item><title><![CDATA[New comment by aesthesia in "AI profitability is mathematically impossible"]]></title><description><![CDATA[
<p>There are some glaring local errors that make this analysis less than trustworthy. For instance, an assumption that corporate income tax applies directly to revenue, or a supposedly generous assumption that GPUs will fully depreciate after 3 years (6-year-old A100s are still in very high demand!). I would love to read a really well thought through investigation of inference costs and how they relate to token pricing, but I have low confidence that this is it.</p>
]]></description><pubDate>Tue, 09 Jun 2026 18:58:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465877</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48465877</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465877</guid></item><item><title><![CDATA[New comment by aesthesia in "Claude Fable 5"]]></title><description><![CDATA[
<p>I mean, they do actually describe what that extra work was, and people elsewhere in this thread are complaining about the effects of those safeguards. So it's not like this is purely empty rhetoric.</p>
]]></description><pubDate>Tue, 09 Jun 2026 18:23:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465294</link><dc:creator>aesthesia</dc:creator><comments>https://news.ycombinator.com/item?id=48465294</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465294</guid></item></channel></rss>