<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: willj</title><link>https://news.ycombinator.com/user?id=willj</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 24 May 2026 22:30:27 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=willj" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by willj in "Microsoft starts canceling Claude Code licenses"]]></title><description><![CDATA[
<p>Fair point. Though I don’t think time without money is really leisure time :)</p>
]]></description><pubDate>Sat, 23 May 2026 17:57:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48249698</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=48249698</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48249698</guid></item><item><title><![CDATA[New comment by willj in "Microsoft starts canceling Claude Code licenses"]]></title><description><![CDATA[
<p>It’s too bad that, yet again, instead of the productivity gains leading to shorter work weeks, the benefits accrue to the companies. Just once I’d like to see productivity gains lead to more leisure time, not higher expectation.</p>
]]></description><pubDate>Sat, 23 May 2026 14:21:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48247976</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=48247976</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48247976</guid></item><item><title><![CDATA[New comment by willj in "Deterministic Programming with LLMs"]]></title><description><![CDATA[
<p>The temperature parameters largely went away when we moved towards reasoning models, which output lots of reasoning tokens before you get to the actual output tokens. I don’t know if it was found that reasoning works better with a higher temperature, or that having separate temperatures for reasoning vs. output wasn’t practical, but that’s my observation of the timing, anyway. And to the other commenter’s point, even a temperature of 0 is not deterministic if the batches are not invariant, which they’re not in production workloads.</p>
]]></description><pubDate>Sun, 01 Mar 2026 04:05:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=47203582</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=47203582</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47203582</guid></item><item><title><![CDATA[New comment by willj in "LLMs as the new high level language"]]></title><description><![CDATA[
<p>If you’re using a model from a provider (not one that you’re hosting locally), greedy decoding via temperature = 0 does <i>not</i> guarantee determinism. A temperature of 0 doesn’t result in the same responses every time, in part due to floating-point precision and in part to to lack of batch invariance [1]<p>[1] <a href="https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/" rel="nofollow">https://thinkingmachines.ai/blog/defeating-nondeterminism-in...</a></p>
]]></description><pubDate>Sun, 08 Feb 2026 14:28:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=46934473</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46934473</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46934473</guid></item><item><title><![CDATA[New comment by willj in "LLMs as the new high level language"]]></title><description><![CDATA[
<p>A temperature of 0 doesn’t result in the same responses every time, in part due to floating-point precision and in part to to lack of batch invariance [1]<p>[1] <a href="https://thinkingmachines.ai/blog/defeating-nondeterminism-in-llm-inference/" rel="nofollow">https://thinkingmachines.ai/blog/defeating-nondeterminism-in...</a></p>
]]></description><pubDate>Sun, 08 Feb 2026 14:25:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=46934458</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46934458</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46934458</guid></item><item><title><![CDATA[New comment by willj in "Ask HN: What are the metrics for "AI-generated technical debt"?"]]></title><description><![CDATA[
<p>Thanks! That makes sense. I suppose this requires commit messages or PRs to indicate code was AI-generated vs. not, or to assume that commits after a certain time period were all from AI coding. It’d be an interesting analysis. Maybe there’s already a study out there.<p>In any case, thank you again!</p>
]]></description><pubDate>Fri, 30 Jan 2026 12:07:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=46823503</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46823503</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46823503</guid></item><item><title><![CDATA[New comment by willj in "AI code and software craft"]]></title><description><![CDATA[
<p>100%. This is what I posted about on Hacker News ([1] where it got no traction) and Reddit [2] (where it led to a discussion but then got deleted by a mod).<p>[1] <a href="https://news.ycombinator.com/item?id=46705588">https://news.ycombinator.com/item?id=46705588</a><p>[2] <a href="https://www.reddit.com/r/ExperiencedDevs/comments/1qj03gq/what_are_the_metrics_for_aigenerated_technical/" rel="nofollow">https://www.reddit.com/r/ExperiencedDevs/comments/1qj03gq/wh...</a></p>
]]></description><pubDate>Tue, 27 Jan 2026 13:14:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=46779528</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46779528</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46779528</guid></item><item><title><![CDATA[Ask HN: What are the metrics for "AI-generated technical debt"?]]></title><description><![CDATA[
<p>Here’s one place where I think proponents and skeptics of agentic coding tools (Claude Code, Codex, etc.) tend to talk past each other:<p>Proponents say things like:<p>- “I shipped feature X in days instead of weeks.”<p>- “I could build this despite not knowing Rust / the framework / the codebase.”<p>- “This unblocked work that would never have been prioritized.”<p>Skeptics say things like:<p>- “This might work for solo projects, but it won’t scale to large codebases with many developers.”<p>- “You’re trading short-term velocity for long-term maintainability, security, and operability.”<p>- “You’re creating tons of technical debt that will surface later.”<p>I’m sympathetic to both sides. But the <i>asymmetry</i>  is interesting: The pro side has quantifiable metrics (time-to-ship, features delivered, scope unlocked). The con side often relies on qualitative warnings (maintainability, architectural erosion, future cost).<p>In most organizations, leadership is structurally biased toward what can be measured: velocity, throughput, roadmap progress. “This codebase is a mess” or “This will be a problem in two years” is a much harder sell than “we shipped this in a week.”<p>My question: Are there concrete, quantitative ways to measure the quality and long-term cost side of agentic coding?. In other words: if agentic coding optimizes for speed, what are the best metrics that can represent the other side of the tradeoff, so this isn’t just a qualitative craftsmanship argument versus a quantitative velocity argument?</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46705588">https://news.ycombinator.com/item?id=46705588</a></p>
<p>Points: 3</p>
<p># Comments: 2</p>
]]></description><pubDate>Wed, 21 Jan 2026 13:41:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=46705588</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46705588</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46705588</guid></item><item><title><![CDATA[New comment by willj in "Ask HN: What are you working on? (January 2026)"]]></title><description><![CDATA[
<p>Can you say more about the approach you take for summarization? Are the papers short enough that you just put the whole thing in the context window of the model you’re using, or do you do anything fancy? I’ve tried out various summarization approaches (hierarchical, aspect-based, incremental refinement), and am curious what you found works best for your use case.</p>
]]></description><pubDate>Wed, 14 Jan 2026 01:25:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=46611153</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46611153</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46611153</guid></item><item><title><![CDATA[New comment by willj in "Ask HN: What are you working on? (January 2026)"]]></title><description><![CDATA[
<p>Thanks! I got some initial ideas from Nano Banana, actually, but then spent a while iterating on different layouts myself.</p>
]]></description><pubDate>Mon, 12 Jan 2026 11:50:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=46587173</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46587173</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46587173</guid></item><item><title><![CDATA[New comment by willj in "Ask HN: What are you working on? (January 2026)"]]></title><description><![CDATA[
<p>This is something I built over the holidays to support people having a hard time with the short days and early sunsets: <a href="https://sunshineoptimist.com" rel="nofollow">https://sunshineoptimist.com</a>.<p>For the past several years I would look up the day lengths and sunset times for my location and identify milestones like “first 5pm sunset”, “1 hour of daylight gained since the winter solstice”, etc. But that manual process also meant I was limited to sharing updates on just my location, and my friends only benefitted when I made a post. I wanted to make a site anyone could come to at any time to get an optimistic message and a milestone to look forward to.<p>Some features this has:<p>- Calculation of several possible optimistic headlines. No LLMs used here.<p>- Offers comparisons to the earliest sunset of the year and shortest day<p>- Careful consideration of optimistic messaging at all times of year, including after the summer solstice when daylight is being lost<p>- Static-only site, no ads or tracking. All calculations happen in the browser.</p>
]]></description><pubDate>Mon, 12 Jan 2026 02:08:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=46583043</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46583043</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46583043</guid></item><item><title><![CDATA[Show HN: Sunshine Optimist: Optimistic takes on daylight and sunset times]]></title><description><![CDATA[
<p>This is something I built over the holidays to support people having a hard time with the short days and early sunsets. For the past several years I would look up the day lengths and sunset times for my location and identify milestones like “first 5pm sunset”, “1 hour of daylight gained since the winter solstice”, etc. But that manual process also meant I was limited to sharing updates on just my location, and my friends only benefitted when I made a post. I wanted to make a site anyone could come to at any time to get an optimistic message and a milestone to look forward to.<p>Some features this has:<p>- Calculation of several possible optimistic headlines. No LLMs used here.<p>- Offers comparisons to the earliest sunset of the year and shortest day<p>- Careful consideration of optimistic messaging at all times of year, including after the summer solstice when daylight is being lost<p>- Static-only site, no ads or tracking. All calculations happen in the browser.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46579370">https://news.ycombinator.com/item?id=46579370</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 11 Jan 2026 20:00:25 +0000</pubDate><link>https://sunshineoptimist.com</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46579370</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46579370</guid></item><item><title><![CDATA[New comment by willj in "AI coding assistants are getting worse?"]]></title><description><![CDATA[
<p>I think the models are so big that they can’t keep many old versions around because they would take away from the available GPUs they use to serve the latest models, and thereby reduce overall throughput. So they phase out older models over time. However, the major providers usually provide a time snapshot for each model, and keep the latest 2-3 available.</p>
]]></description><pubDate>Thu, 08 Jan 2026 15:52:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=46542421</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46542421</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46542421</guid></item><item><title><![CDATA[New comment by willj in "Avoid Mini-Frameworks"]]></title><description><![CDATA[
<p>This reminds me a bit of using LLM frameworks like langchain, Haystack, etc., especially if you’re only using them for the chat completions or responses APIs and not doing anything fancy.</p>
]]></description><pubDate>Wed, 24 Dec 2025 22:06:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=46379776</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=46379776</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46379776</guid></item><item><title><![CDATA[New comment by willj in "Ask HN: What are some impressive vibe coding projects?"]]></title><description><![CDATA[
<p>DOOMscroll[1] for sure! I still play it since hearing about it on HN.<p>[1] <a href="https://ironicsans.ghost.io/doomscrolling-the-game/" rel="nofollow">https://ironicsans.ghost.io/doomscrolling-the-game/</a></p>
]]></description><pubDate>Mon, 20 Oct 2025 11:49:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=45642848</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=45642848</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45642848</guid></item><item><title><![CDATA[New comment by willj in "Judge mulls sanctions over Google's destruction of internal chats"]]></title><description><![CDATA[
<p>I think this ignores that the monopolies have the power to buy up any new competitors, or to drive them out of business using monopoly power. Regulatory hurdles are only one tool that (can) benefit monopolies.</p>
]]></description><pubDate>Sun, 05 May 2024 12:59:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=40264558</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=40264558</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40264558</guid></item><item><title><![CDATA[New comment by willj in "OpenAI transcribed over a million hours of YouTube videos to train GPT-4"]]></title><description><![CDATA[
<p>I think that’s different. AlphaGo is using reinforcement learning in a context in which there is a clear evaluation function— did a strategy lead to a win or loss.</p>
]]></description><pubDate>Tue, 09 Apr 2024 10:47:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=39978061</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=39978061</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39978061</guid></item><item><title><![CDATA[New comment by willj in "Show HN: Beyond text splitting – improved file parsing for LLMs"]]></title><description><![CDATA[
<p>Relatedly, the OCR component relies on PyMuPDF, which has a license that requires releasing source code, which isn’t possible for most commercial applications. Is there any plan to move away from PyMuPDF, or is there a way to use an alternative?</p>
]]></description><pubDate>Mon, 08 Apr 2024 11:18:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=39968547</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=39968547</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39968547</guid></item><item><title><![CDATA[New comment by willj in "The Obscene Energy Demands of A.I"]]></title><description><![CDATA[
<p>I’d argue Bitcoin is Obscene Energy Demand.</p>
]]></description><pubDate>Sat, 09 Mar 2024 15:29:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=39652224</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=39652224</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39652224</guid></item><item><title><![CDATA[New comment by willj in "Google's once happy offices feel the chill of layoffs"]]></title><description><![CDATA[
<p>Where are the happy offices these days? Which companies are the new “Google” who people are very eager to work for?</p>
]]></description><pubDate>Mon, 05 Feb 2024 14:09:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=39261352</link><dc:creator>willj</dc:creator><comments>https://news.ycombinator.com/item?id=39261352</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39261352</guid></item></channel></rss>