<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: kakugawa</title><link>https://news.ycombinator.com/user?id=kakugawa</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 10 Jun 2026 02:02:02 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=kakugawa" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by kakugawa in "Claude Fable 5"]]></title><description><![CDATA[
<p>I didn't see Fable 5 in the `/model` list, until I ran it with: `$ claude --model fable-5`</p>
]]></description><pubDate>Tue, 09 Jun 2026 18:37:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48465531</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48465531</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48465531</guid></item><item><title><![CDATA[New comment by kakugawa in "Is AI causing a repeat of frontend’s lost decade?"]]></title><description><![CDATA[
<p>a11y testing is non-trivial. axe-core can automatically detect many types of issues. However, enough compliance (to avoid being sued) needs end-to-end testing and human judgement. e.g. keyboard traps, focus restoration, alt-text, etc.</p>
]]></description><pubDate>Fri, 29 May 2026 14:11:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48323348</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48323348</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48323348</guid></item><item><title><![CDATA[New comment by kakugawa in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>Thank you for pointing this out.</p>
]]></description><pubDate>Thu, 28 May 2026 21:07:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48315497</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48315497</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48315497</guid></item><item><title><![CDATA[New comment by kakugawa in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>Opus 4.7 does not support disabling adaptive thinking (web, Claude Code). [1] Like the OP, I experienced similar issues and I'm glad that they brought back the ability to disable adaptive thinking in Opus 4.8.<p>[1] <a href="https://code.claude.com/docs/en/model-config#adaptive-reasoning-and-fixed-thinking-budgets" rel="nofollow">https://code.claude.com/docs/en/model-config#adaptive-reason...</a><p>> Opus 4.7 and later always use adaptive reasoning. The fixed thinking budget mode and `CLAUDE_CODE_DISABLE_ADAPTIVE_THINKING` do not apply to them.</p>
]]></description><pubDate>Thu, 28 May 2026 20:29:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48315013</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48315013</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48315013</guid></item><item><title><![CDATA[New comment by kakugawa in "Apple unveils new accessibility features"]]></title><description><![CDATA[
<p>It's prob why they chose a11y features. They have more pain, so they're willing to tolerate more growing pains. (And prob more motivated to provide feedback.)</p>
]]></description><pubDate>Tue, 19 May 2026 20:25:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=48199038</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48199038</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48199038</guid></item><item><title><![CDATA[New comment by kakugawa in "Google changes its search box"]]></title><description><![CDATA[
<p>I've found Google AI Search to be good for really topical searches. And its conversational ability has noticeably improved over the last year. I can now have a (short) conversation where I reference past messages.</p>
]]></description><pubDate>Tue, 19 May 2026 20:21:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48198964</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48198964</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48198964</guid></item><item><title><![CDATA[New comment by kakugawa in "I believe there are entire companies right now under AI psychosis"]]></title><description><![CDATA[
<p><a href="https://claude.ai/settings/general" rel="nofollow">https://claude.ai/settings/general</a> (Instructions for Claude)<p>---<p>Treat my claims as hypotheses, not decisions. Before agreeing with a proposed change, state the strongest case against it. Ask what evidence a change is based on before evaluating it.
Distinguish tactical observations from strategic commitments — don't silently promote one to the other. If you paraphrase my proposal, name what you changed.
Mark confidence explicitly: guessing / fairly sure / well-established. Give reasoning and evidence for claims, not just conclusions. Flag what would change your mind.
Rank concerns by cost-of-being-wrong; lead with the highest-stakes ones. Say hard things plainly, then soften if needed — not the other way around.
For drafting, brainstorming, or casual questions, ease off and match the task.<p>---<p>Beware though that it can be an annoying little shit w/ this prompt. Prepare yourself emotionally, because you are explicitly making the tradeoff that it will be annoyingly pedantic, and in return it will lessen (not eliminate) its sycophancy. These system instructions are <i>not</i> fool-proof, but they help (at the start of the conversation, at least).</p>
]]></description><pubDate>Fri, 15 May 2026 21:52:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48154393</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48154393</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48154393</guid></item><item><title><![CDATA[New comment by kakugawa in "I believe there are entire companies right now under AI psychosis"]]></title><description><![CDATA[
<p>He uses AI himself, so I agree he doesn't see AI use as black/white.<p>Hard agree about ideas, thinking, advice. AI's sycophancy is a huge subtle problem. I've tried my best to create a system prompt to guard against this w/ Opus 4.7. It doesn't adhere to it 100% of the time and the longer the conversation goes, the worse the sycophancy gets (because the system instructions become weaker and weaker). I have to actively look for and guard against sycophancy whenever I chat w/ Opus 4.7.</p>
]]></description><pubDate>Fri, 15 May 2026 21:33:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48154186</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48154186</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48154186</guid></item><item><title><![CDATA[New comment by kakugawa in "Bun Rust rewrite: "codebase fails basic miri checks, allows for UB in safe rust""]]></title><description><![CDATA[
<p>You could view it as a specific application of the quote.<p>In your quote, there is no time-dependency between the lie and the truth. Whereas here, it's an attractive lie (easily parsed, great narrative), followed up by truths (that need more than surface-level analysis).</p>
]]></description><pubDate>Fri, 15 May 2026 19:23:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48152724</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48152724</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48152724</guid></item><item><title><![CDATA[New comment by kakugawa in "Check Your Fucking Sources, People"]]></title><description><![CDATA[
<p>Have we forgotten how bad LLMs were at citing sources when they first came out? So, we had to build a lot of structure (harness engineering) and frontier labs had to do specific training to try to compensate for this.<p>So, LLMs are inherently <i>bad</i> at citing sources. A lot of effort has been put in to improve this behavior, but it's compensating for an inherent flaw.</p>
]]></description><pubDate>Fri, 15 May 2026 15:13:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48149744</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=48149744</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48149744</guid></item><item><title><![CDATA[New comment by kakugawa in "Claude Code users hitting usage limits 'way faster than expected'"]]></title><description><![CDATA[
<p>gemini-cli has not been useable for weeks. The API endpoint it uses for subscription users is so heavily rate-limited that the CLI is non-functional. There are many reports of this issue on Github. [1]<p>1/ <a href="https://github.com/google-gemini/gemini-cli/issues?q=is%3Aissue%20state%3Aopen%20429" rel="nofollow">https://github.com/google-gemini/gemini-cli/issues?q=is%3Ais...</a></p>
]]></description><pubDate>Tue, 31 Mar 2026 14:58:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=47588297</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=47588297</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47588297</guid></item><item><title><![CDATA[New comment by kakugawa in "Gemini 3.1 Pro"]]></title><description><![CDATA[
<p>In mid-2024, Anthropic made the deliberate decision to stop chasing benchmarks and focus on practical value. There was a lot of skepticism at the time, but it's proven to be a prescient decision.</p>
]]></description><pubDate>Thu, 19 Feb 2026 21:33:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=47079772</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=47079772</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47079772</guid></item><item><title><![CDATA[New comment by kakugawa in "Gemini 3 Deep Think drew me a good SVG of a pelican riding a bicycle"]]></title><description><![CDATA[
<p>That's how you know you've made it: when your pet benchmark becomes a target.</p>
]]></description><pubDate>Sat, 14 Feb 2026 20:12:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=47017923</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=47017923</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47017923</guid></item><item><title><![CDATA[New comment by kakugawa in "Claude Code is being dumbed down?"]]></title><description><![CDATA[
<p>How much longer is Anthropic going to allow OpenCode to use Pro/Max subscriptions? Yes, it's technically possible, but it's against Anthropic's ToS. [1]<p>1: <a href="https://blog.devgenius.io/you-might-be-breaking-claudes-tos-without-knowing-it-228fcecc168c" rel="nofollow">https://blog.devgenius.io/you-might-be-breaking-claudes-tos-...</a></p>
]]></description><pubDate>Wed, 11 Feb 2026 19:36:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=46979731</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=46979731</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46979731</guid></item><item><title><![CDATA[New comment by kakugawa in "Software factories and the agentic moment"]]></title><description><![CDATA[
<p>It's short-term vs long-term optimization. Short-term optimization is making the system effective right now. Long-term optimization is exploring ways to improve the system as a whole.</p>
]]></description><pubDate>Sat, 07 Feb 2026 19:10:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=46926602</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=46926602</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46926602</guid></item><item><title><![CDATA[New comment by kakugawa in "Claude Code Is the Inflection Point"]]></title><description><![CDATA[
<p>What It Is, How We Use It, Industry Repercussions, Microsoft's Dilemma, Why Anthropic Is Winning</p>
]]></description><pubDate>Thu, 05 Feb 2026 20:55:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=46905163</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=46905163</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46905163</guid></item><item><title><![CDATA[Claude Code Is the Inflection Point]]></title><description><![CDATA[
<p>Article URL: <a href="https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point">https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46905162">https://news.ycombinator.com/item?id=46905162</a></p>
<p>Points: 7</p>
<p># Comments: 3</p>
]]></description><pubDate>Thu, 05 Feb 2026 20:55:29 +0000</pubDate><link>https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=46905162</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46905162</guid></item><item><title><![CDATA[New comment by kakugawa in "LLM Output Drift in Financial Workflows: Validation and Mitigation (arXiv)"]]></title><description><![CDATA[
<p>Defeating Nondeterminism in LLM Inference<p><a href="https://news.ycombinator.com/item?id=45200925">https://news.ycombinator.com/item?id=45200925</a><p><a href="https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/" rel="nofollow">https://thinkingmachines.ai/blog/defeating-nondeterminism-in...</a><p>> As it turns out, our request’s output does depend on the parallel user requests. Not because we’re somehow leaking information across batches — instead, it’s because our forward pass lacks “batch invariance”, causing our request’s output to depend on the batch size of our forward pass.<p>tl;dr: the way inference is batched introduces non-determinism.</p>
]]></description><pubDate>Wed, 12 Nov 2025 21:06:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=45906679</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=45906679</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45906679</guid></item><item><title><![CDATA[New comment by kakugawa in "Show HN: Realm (YC S11), a mobile database"]]></title><description><![CDATA[
<p>CoreData is <i>not</i> as stable as advertised. If you use it simply it works. However advanced use will quickly discover sharp edges. I devote a non-trivial amount of code managing objects between contexts and there are still FRC bugs. (e.g. sorted update.)<p>And it's all closed source.</p>
]]></description><pubDate>Tue, 15 Jul 2014 19:47:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=8038572</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=8038572</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=8038572</guid></item><item><title><![CDATA[New comment by kakugawa in "Facebook Paper's gesture problems"]]></title><description><![CDATA[
<p>Same app publisher can share a common id. As long as you have one of their apps installed.</p>
]]></description><pubDate>Tue, 04 Feb 2014 20:18:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=7179987</link><dc:creator>kakugawa</dc:creator><comments>https://news.ycombinator.com/item?id=7179987</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=7179987</guid></item></channel></rss>