<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: winwang</title><link>https://news.ycombinator.com/user?id=winwang</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 22 Jun 2026 15:35:20 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=winwang" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by winwang in "Project Valhalla, Explained: How a Decade of Work Arrives in JDK 28"]]></title><description><![CDATA[
<p>(Yes, not yet, but...) As a Scala afficionado, "free"/freer runtime performance is very welcome :)<p>Fun read.</p>
]]></description><pubDate>Sat, 20 Jun 2026 00:52:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48605144</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48605144</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48605144</guid></item><item><title><![CDATA[New comment by winwang in "Ask HN: Is anyone using the A2A protocol?"]]></title><description><![CDATA[
<p>(Based off 2-3 month-old recollection, take with a grain of salt)<p>I had wanted to use it for my agent "network". A2A didn't fit the use case of "trusted agent, and was bloated due to "what if rogue actor". Of course, I could have used it, with all of its roughness, but chose to just vibe my own (before Claude Teams, though I haven't really used that, I think). In the process of creating a server to handle this (I already set up a Scala webserver to administrate/orchestrate hooks). Would love to hear others' suggestions for this.</p>
]]></description><pubDate>Thu, 18 Jun 2026 20:58:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48591477</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48591477</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48591477</guid></item><item><title><![CDATA[New comment by winwang in "US holds off blacklisting DeepSeek, more than 100 firms deemed security risks"]]></title><description><![CDATA[
<p>Not saying you're doing this specifically, but I'd be careful with thinking that "company" in China means the same as "company" in America (or in the West more generally).</p>
]]></description><pubDate>Wed, 17 Jun 2026 21:42:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48577315</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48577315</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48577315</guid></item><item><title><![CDATA[New comment by winwang in "Formal methods and the future of programming"]]></title><description><![CDATA[
<p>Love this. I've shifted in the past few months to using highly expressive types in Scala 3 to have types carry more and more compile-time proofs (without macros, though a couple are warranted).
Not only does it help with agentic test "sprawl", it seems to prevent agents from falling into lower-quality modes of operation. One of the more annoying things I've been preventing is what I call "noun accretion", where agents try and make a new monomorphic type for everything, instead of clearly genericizing when sensible.
My bet is on formal-method-shaped tooling (including languages with strong type systems) to accelerate <i>decent-quality</i> agentic coding.<p>When I say "highly expressive types", I mean techniques I'd likely not want to ship in a typical codebase, unless the team was already on the typelevel programming bandwagon (i.e. having HKT and type functions being basic blocks already, not weird). Agents are better at "math" than basically most devs (even category-theory-pilled ones), at least in terms of knowledge. Better yet, they are decent at pedagogy, especially when considering they have "infinite" patience for dialogue.<p>In a more personal setting, I've had Codex Lean-ify a couple of my hobby proofs, it was extremely easy. Note: not saying it did this 100% "correctly" (gotta learn more Lean 4 to check more thoroughly), but it also seems to check for classic proof gotchas by default.
Very excited for the future of formal methods.</p>
]]></description><pubDate>Sun, 14 Jun 2026 21:14:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=48532798</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48532798</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48532798</guid></item><item><title><![CDATA[New comment by winwang in "What it feels like to work with Mythos"]]></title><description><![CDATA[
<p>Minor note, 2x $/tok is not 2x cost. Personally, I see Fable being significantly more token-efficient than Opus 4.8. Then, there's also the compounding costs of quality.</p>
]]></description><pubDate>Wed, 10 Jun 2026 06:23:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48472220</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48472220</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48472220</guid></item><item><title><![CDATA[New comment by winwang in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>My experience has been that 5.4 is slower than 5.5 (confound: I use >512k max context size for 5.4, though it <i>seems</i> slower even below the normal size)</p>
]]></description><pubDate>Thu, 28 May 2026 17:32:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48312465</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48312465</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48312465</guid></item><item><title><![CDATA[New comment by winwang in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>I typically just launch CC with `--model claude-opus-4-6[1m]`, `4-6[1m]` -> `4-8[1m]` works fine. Still 200k max without the `[1m]`.</p>
]]></description><pubDate>Thu, 28 May 2026 17:30:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48312438</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48312438</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48312438</guid></item><item><title><![CDATA[New comment by winwang in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>There's the other (orthogonal) possible explanation of using more GPUs for stress-testing before product launch.</p>
]]></description><pubDate>Thu, 28 May 2026 17:28:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48312400</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48312400</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48312400</guid></item><item><title><![CDATA[New comment by winwang in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>How else would you write this (marketing copy) exactly? "Its output matches better to its CoT which matches to better to our hidden state decoder according to <insert measure here>; see <insert paper ref>"?<p>... Actually, I wouldn't mind that.</p>
]]></description><pubDate>Thu, 28 May 2026 17:25:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48312353</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48312353</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48312353</guid></item><item><title><![CDATA[New comment by winwang in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>Awesome, thanks for posting because I think I hit a possibly-spurious bug in turning Adaptive off when I switched models (4.6 -> 4.8, extra). Tried again, works as intended (I hope).<p>More importantly for me, though, is how CC will respond to 4.6-"only" flags for thinking. For now, it doesn't seem to clobber my setup.</p>
]]></description><pubDate>Thu, 28 May 2026 17:23:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48312319</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48312319</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48312319</guid></item><item><title><![CDATA[New comment by winwang in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>Let's hope I don't have to disable it after a day like with 4.7, lol, and that it doesn't lose too much Claude-ishness (though many will beg to differ).</p>
]]></description><pubDate>Thu, 28 May 2026 17:17:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=48312231</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48312231</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48312231</guid></item><item><title><![CDATA[New comment by winwang in "Claude Code as a Daily Driver: Claude.md, Skills, Subagents, Plugins, and MCPs"]]></title><description><![CDATA[
<p>Yes, but that's also a specific luxury I can choose for myself. Definitely a fun and interesting question. At some level of reliance, people would answer "no", but there's the large middle ground (assuming similarly-frontier models are down): having a weaker(?) AI model help you get up to speed ASAP by summarizing code pedagogically, and linearizing the code read order. Basically like an AI-assisted (but manual) code review to reorient yourself.</p>
]]></description><pubDate>Wed, 27 May 2026 19:08:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48298978</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48298978</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48298978</guid></item><item><title><![CDATA[New comment by winwang in "The sigmoids won't save you"]]></title><description><![CDATA[
<p>Yep. No one bats an eye at eyewitnesses "hallucinating" details, or that I'd rather have Opus as a coworker vs a random middle schooler (err, labor laws notwithstanding).
I think perhaps too much of the dialogue around intelligence has to do with the word (and its connotations) itself.<p>The poster you replied to even used the word "sentient", which is quite interesting (warning: opinionated tangent ahead). Merriam-Webster defines it as "capable of sensing or feeling: conscious of or responsive to the sensations of seeing, hearing, feeling, tasting, or smelling". Feels like qualia. Or if we don't want to go the qualia route... Of course, we wouldn't call Helen Keller non-sentient, so presumably we "really" mean "can it sense or feel" -- well, sense is just "act/feel according to the environment", which you could argue in the case of an LLM would be their context... so we should "really" remove "sense" from the definition, probably. So "do LLMs feel" is probably closer to what "sentient" is being used for here. Since we don't have the obvious symmetry of "you are like me and I feel (therefore you probably feel)", it's way better/easier/feel-good-ier to prefer "LLMs don't feel" rather than "oh shit, it feels and model training is actually just torturing it into the right shape". LLMs as fundamentally non-intelligent also avoids the problems of "what does that say about people" or "we may have made 'AGI' and it wasn't what we thought it would be" or "we're not ready to talk about this yet".</p>
]]></description><pubDate>Sat, 16 May 2026 17:09:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48161964</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48161964</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48161964</guid></item><item><title><![CDATA[New comment by winwang in "Bun Rust rewrite: "codebase fails basic miri checks, allows for UB in safe rust""]]></title><description><![CDATA[
<p>This is, ironically, a pretty good idea. ...Minus the fact that you're presumably talking about having AI generate it all instead.</p>
]]></description><pubDate>Fri, 15 May 2026 19:03:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48152516</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48152516</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48152516</guid></item><item><title><![CDATA[New comment by winwang in "If AI writes your code, why use Python?"]]></title><description><![CDATA[
<p>I think most people agree with you -- that's why. Also because I'd say most programmers don't care much about maintainability or quality.<p>I personally find that AI writes better Scala than Python.</p>
]]></description><pubDate>Tue, 12 May 2026 15:46:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48109973</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48109973</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48109973</guid></item><item><title><![CDATA[New comment by winwang in "I'm going back to writing code by hand"]]></title><description><![CDATA[
<p>Yeah, I pretty much agree. Opus and GPT will both come up with the most "organically-grown" "designs" if you let them. They do slightly better when asked to design first, but they seem to avoid many important questions (and definitely skip asking the user much of anything at all). I can only say it <i>feels</i> they "want" to ship as fast as possible while assuming I'm not going to actually review the PR.</p>
]]></description><pubDate>Mon, 11 May 2026 19:43:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48099729</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48099729</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48099729</guid></item><item><title><![CDATA[New comment by winwang in "Natural Language Autoencoders: Turning Claude's Thoughts into Text"]]></title><description><![CDATA[
<p>I would presume this is shorthand for something like "generated text which would normally be classified as belief". I guess a more ridiculous response could be "what does it mean for a miserable pile of secrets to believe something?", lol.</p>
]]></description><pubDate>Fri, 08 May 2026 11:02:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48061394</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48061394</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48061394</guid></item><item><title><![CDATA[New comment by winwang in "Agentic Coding Is a Trap"]]></title><description><![CDATA[
<p>I absolutely feel like a "different" part of my mind is loaded when seriously engineering something myself vs vibecoding+reviewing. Even the reviewing is more annoying in the latter mental context.</p>
]]></description><pubDate>Mon, 04 May 2026 00:09:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48003014</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=48003014</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48003014</guid></item><item><title><![CDATA[New comment by winwang in "Claude.ai and API unavailable [fixed]"]]></title><description><![CDATA[
<p>Honestly, I gotta agree, I find that I get way more frustrated with Claude recently than Codex.</p>
]]></description><pubDate>Thu, 30 Apr 2026 03:05:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=47957583</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=47957583</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47957583</guid></item><item><title><![CDATA[New comment by winwang in "Amateur armed with ChatGPT solves an Erdős problem"]]></title><description><![CDATA[
<p>Obviously nowhere near Erdos problem complexity but I've been using GPT (in Codex) to prove a couple theorems (for algos) and I've found it a bit better than Claude (Code) in this aspect.</p>
]]></description><pubDate>Sun, 26 Apr 2026 04:23:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47907324</link><dc:creator>winwang</dc:creator><comments>https://news.ycombinator.com/item?id=47907324</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47907324</guid></item></channel></rss>