<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: anthuswilliams</title><link>https://news.ycombinator.com/user?id=anthuswilliams</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 27 May 2026 20:04:50 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=anthuswilliams" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by anthuswilliams in "When everyone has AI and the company still learns nothing"]]></title><description><![CDATA[
<p>I'd also add that top-heavy engineering organizations are sometimes incapable of delivering anything useful because everyone wants to work on the hard problems, establish the frameworks, define the processes, and so on, and no one wants to operate the damn business. It's good to have a mix of perspectives in a team.</p>
]]></description><pubDate>Wed, 06 May 2026 18:34:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48039874</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=48039874</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48039874</guid></item><item><title><![CDATA[New comment by anthuswilliams in "AI Self-preferencing in Algorithmic Hiring: Empirical Evidence and Insights"]]></title><description><![CDATA[
<p>At 90 seconds per resume, that would take up a full 8 hour day. Having gone through this myself, I don't think it's possible to do this much faster than that, even if you have an ATS that optimizes for that workflow.<p>I often found myself falling into patterns of poor judgement, e.g. mentally filtering out resumes based on the layout because, to my tired  and bored mind, they looked similar to the resumes I had seen from unqualified candidates. I actually think some automation is helpful in evaluating them more rigorously.</p>
]]></description><pubDate>Sat, 02 May 2026 18:11:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47988844</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47988844</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47988844</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Workspace Agents in ChatGPT"]]></title><description><![CDATA[
<p>That sort of messaging has been done for decades with business process orchestration companies, RPA vendors, etc. All the way back to the original business software vendors like Lotus and Excel. It's only big LLM labs that adopt this tone of dismissive trivialization of other people's work.</p>
]]></description><pubDate>Wed, 22 Apr 2026 23:20:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47870518</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47870518</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47870518</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Workspace Agents in ChatGPT"]]></title><description><![CDATA[
<p>The issue is not that they are generic. You could still be generic with phraseology that actually acknowledged the contributions and ownership involved in the jobs being done. For example, you could write e.g. "monitor for outages", "manage projects", "arrange community events", "handle logistics", and so on.<p>But the problem is LLMs can't do those things. All they can do is "edit files" and "send messages".</p>
]]></description><pubDate>Wed, 22 Apr 2026 23:17:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47870493</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47870493</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47870493</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Workspace Agents in ChatGPT"]]></title><description><![CDATA[
<p>Without commenting on the product itself (I haven't tried it), the marketing copy around this release commits the same sins I have seen from Anthropic and Grok and all the rest of them.<p>I'm so tired of seeing these companies trivializing other people's work! Nobody's job is "edit files" and "respond to messages"! People have jobs like "find and close leads" and "reconcile accounts" and "arrange student field trips" and "make sure the hospital has enough inventory", not "generate reports" and "write code".<p>Editing files, producing reports, even writing code is just a byproduct. This is like the idiotic "lines of code produced" metric, but now they apply it to all of society.</p>
]]></description><pubDate>Wed, 22 Apr 2026 19:39:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47868274</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47868274</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47868274</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Stop trying to engineer your way out of listening to people"]]></title><description><![CDATA[
<p>> Why is it always our burden to learn and improve<p>That's what the 400k/yr is for.</p>
]]></description><pubDate>Tue, 21 Apr 2026 15:33:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=47850275</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47850275</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47850275</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Show HN: Libretto – Making AI browser automations deterministic"]]></title><description><![CDATA[
<p>I literally _just_ put up an announcement on our internal Slack of a tool I had spent a few weeks trying to get right. Strange to post the announcement and, literally the same day, see a better, publicly available toolkit to do enable that very workflow!<p>I'm also using Playwright, to automate a platform that has a maze of iframes, referer links, etc. Hopefully I can replace the internals with a script I get from this project.</p>
]]></description><pubDate>Wed, 15 Apr 2026 21:18:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=47785360</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47785360</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47785360</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Ask HN: Who is hiring? (April 2026)"]]></title><description><![CDATA[
<p>There is something up with your hiring system. I shared this post with a friend who applied, and they got an instant rejection. This is a data engineer with 4+ years experience in your stack and a degree in nuclear engineering to boot. Maybe not an ultimate hire but clearly not someone you should be auto-rejecting.</p>
]]></description><pubDate>Thu, 02 Apr 2026 00:49:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=47608663</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47608663</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47608663</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Ensu – Ente’s Local LLM app"]]></title><description><![CDATA[
<p>Claude Cowork (part of the Desktop app) is claude code, running inside a VM.<p>Helpful writeup here: <a href="https://pvieito.com/2026/01/inside-claude-cowork" rel="nofollow">https://pvieito.com/2026/01/inside-claude-cowork</a>
(I am not the author)</p>
]]></description><pubDate>Wed, 25 Mar 2026 23:17:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=47524573</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47524573</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47524573</guid></item><item><title><![CDATA[New comment by anthuswilliams in "What young workers are doing to AI-proof themselves"]]></title><description><![CDATA[
<p>Summary of article: in an uncertain job market, some young people are going into blue collar trades. Others are starting startups. Others are powering through. Journalist says some words about "AI" being the cause of all this uncertainty.</p>
]]></description><pubDate>Mon, 23 Mar 2026 04:34:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=47485520</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=47485520</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47485520</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Experts Have World Models. LLMs Have Word Models"]]></title><description><![CDATA[
<p>This argument was specifically about LLMs, not about other techniques (RL, multi-armed bandit, etc) that might be better leveraged to accomplish this type of goal.<p>An LLM which makes a tool call to a function called `ride_bike`, where that function is a different sort of model with a different set of feedback mechanisms than those available to the LLM, is NOT the same thing at all. The LLM hasn't "learned" to ride the bike. The best you can say is that the LLM has learned that the bike can be ridden, and that it has a way of asking some other entity to ride on its behalf.<p>Now, could you develop such a model and make it available to an LLM? Sure, probably. But that's not an LLM. Moreover, it involves you, a human, making novel inroads on a different sort of AI/robotics problem. It simply is not possible to accomplish with an LLM.</p>
]]></description><pubDate>Wed, 11 Feb 2026 05:01:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=46971051</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46971051</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46971051</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Eight more months of agents"]]></title><description><![CDATA[
<p>Maybe. Or maybe services will switch to charging per API call or whatever instead of monthly or per-seat. Who can predict the future?<p>I mean, services _could_ make it harder to use LLMs to interact with them, but if agents are popular enough they might see customers start to revolt over it.</p>
]]></description><pubDate>Tue, 10 Feb 2026 20:52:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=46966733</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46966733</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46966733</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Eight more months of agents"]]></title><description><![CDATA[
<p>I'm very much looking forward to this shift. It is SO MUCH more pro-consumer than the existing SaaS model. Right now every app feels like a walled garden, with broken UX, constant redesigns, enormous amounts of telemetry and user manipulation. It feels like every time I ask for programmatic access to SaaS tools in order to simplify a workflow, I get stuck in endless meetings with product managers trying to "understand my use case", even for products explicitly marketed to programmers.<p>Using agents that interact with APIs represents people being able to own their user experience more. Why not craft a frontend that behaves exactly the the way YOU want it to, tailor made for YOUR work, abstracting the set of products you are using and focusing only on the actual relevant bits of the work you are doing? Maybe a downside might be that there is more explicit metering of use in these products instead of the per-user licensing that is common today. But the upside is there is so much less scope for engagement-hacking, dark patterns, useless upselling, and so on.</p>
]]></description><pubDate>Mon, 09 Feb 2026 20:12:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=46950497</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46950497</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46950497</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Experts Have World Models. LLMs Have Word Models"]]></title><description><![CDATA[
<p>> What is in the nature of bike-riding that cannot be reduced to text?<p>You're asking someone to answer this question in a text forum. This is not quite the gotcha you think it is.<p>The distinction between "knowing" and "putting into language" is a rich source of epistemological debate going back to Plato and is still widely regarded to represent a particularly difficult philosophical conundrum. I don't see how you can make this claim with so much certainty.</p>
]]></description><pubDate>Mon, 09 Feb 2026 00:04:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=46939923</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46939923</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46939923</guid></item><item><title><![CDATA[New comment by anthuswilliams in "2025 Letter"]]></title><description><![CDATA[
<p>I'm not sure I understand your complaint. Is it that he misuses the term Pascal's Wager? Or more generally that he doesn't extend enough credibility to the ideas in AI 2027?</p>
]]></description><pubDate>Fri, 02 Jan 2026 02:48:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=46460851</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46460851</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46460851</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Drugmakers raise US prices on 350 medicines despite pressure"]]></title><description><![CDATA[
<p>The claim about who subsidizes who was always hyperbole, I'll grant you that. I included the statement to make the point that this is the phenomenon people are referring to when they make that statement.<p>I happen to think there is validity to the statement if you control for other actuarial factors. But if you don't think that makes sense as a lens through which to look at the problem, I won't quibble, even though I disagree. We're also only talking about drug prices here, which is a small portion of overall healthcare spending.<p>In any case, the central point, that insurers benefit from higher prices, still stands.</p>
]]></description><pubDate>Thu, 01 Jan 2026 18:56:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=46456892</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46456892</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46456892</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Drugmakers raise US prices on 350 medicines despite pressure"]]></title><description><![CDATA[
<p>> And premiums would go up.<p>Yes. As I wrote above, insurers compete on premiums, and they do do so by using rebates to subsidize those premiums by spreading patients' deductibles across the insured population. As far as profits go, I can't speak to regulatory issues since they will vary by state, but in any case the same critique would apply if insurers are pocketing a fixed percentage of a larger amount.<p>Re your second point, it completely twists my point and is largely irrelevant. Yes, older people paying the same premiums as younger people is a counter-argument in that older people are more likely to need healthcare, but the central point is that people who have to USE their insurance (i.e. sick people) subsidize the premiums of people who don't (healthy people), and this critique applies regardless of age. Now, one could argue that the structural factors that control costs across age cohorts counterbalances this phenomenon. And I'd agree with you! But that doesn't negate the original point that insurance companies benefit from, and advocate for, high sticker prices.</p>
]]></description><pubDate>Thu, 01 Jan 2026 08:15:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=46452233</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46452233</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46452233</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Drugmakers raise US prices on 350 medicines despite pressure"]]></title><description><![CDATA[
<p>Insurance companies absolutely benefit from the higher and opaque prices, because they negotiate rebates with providers. This allows them to maximize patient copays and ensures they hit their deductible, i.e. paying as much as possible under their respective insurance plans. Contrast this with a no-rebate world with cheaper/more transparent pricing. Fewer patients would hit their out of pocket maximum.<p>They can use the rebates they get from the providers to subsidize the insured, allowing them to offer lower premiums and gain market share. This is what people mean when they say "In America, the sick people pay to subsidize the health care of the healthy people".<p>Of course, that above only applies if there is competitive pressure. If there is no competitive pressure (e.g. in states with only one or two insurers), they can keep premiums high and book as profit the difference between what the patient paid out and what the patient would have paid out in a lower-cost no-rebate world.</p>
]]></description><pubDate>Wed, 31 Dec 2025 17:46:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=46446411</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46446411</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46446411</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Skills for organizations, partners, the ecosystem"]]></title><description><![CDATA[
<p>Agree with this framing. They are like RAG setups that you can compose together without needing to build a dedicated app to do it.</p>
]]></description><pubDate>Sat, 20 Dec 2025 20:58:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=46339588</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46339588</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46339588</guid></item><item><title><![CDATA[New comment by anthuswilliams in "Skills for organizations, partners, the ecosystem"]]></title><description><![CDATA[
<p>> How do you handle versioning/updates when datasets change?<p>For data MCPs, we use remote MCPs that are served over an stdio bridge. So our configuration is just mcp-proxy[0] pointed at a fixed URL we control. The server has an /mcp endpoint that provides tools and that endpoint is hit whenever the desktop LLM starts up. So adding/removing/altering tools is simply a matter of changing that service and redeploying that API. (Note: There are sometimes complications, e.g. if I change an endpoint that used to return data directly, but now it writes a file to cloud storage and returns a URL (because the result is to large, i.e. to work around the aforementioned broken factor of MCP) we have to sync with our IT team to deploy a configuration change to everyone's machine.)<p>I have seen nicer implementations that use a full MCP gateway that does another proxy step to the upstream MCP servers, which I haven't used myself (though I want to). The added benefit is that you can log/track which MCPs your users are using most often and how they are doing, and you can abstract away a lot of the details of auth, monitor for security issues, etc.  One of the projects I've looked at in that space is Mint MCP, but I haven't used it myself.<p>> What's your hit rate on researchers actually converting LLM explorations into permanent artifacts vs just using it as a one-off?<p>Low. Which in our case is ideal, since most research ideas can be quickly discarded and save us a ton of time and money that would otherwise be spent running doomed lab experiments, etc. As you get later in the drug discovery pipeline you have a larger team built around the program, and then the artifacts are more helpful. There still isn't much of a norm in the biotech industry of having an engineering team support an advanced drug program (a mistake, IMO) so these artifacts go a long way given these teams don't have dedicated resources.<p>>  Do you think this pattern (LLM exploration > traditional tools) generalizes outside domains with high uncertainty?<p>I don't know for sure, as I don't live in that world. My instinct is: I wouldn't necessarily roll something like this out to external customers if you have a well-defined product. (IMO there just isn't that much of a market for uncertain outputs of such products, which is why all of the SaaS companies that have launched their integrated AI tools haven't seen much success with them.) But even within a domain like that, it can be useful to e.g. your customer support team, your engineers, etc. For example, one of the ideas on my "cool projects" list is an SRE toolkit that can query across K8s, Loki/Prometheus, your cloud provider, your git provider and help quickly diagnose production issues. I imagine the result of such an exploration would almost always be a new dashboard/alert/etc.<p>[0] <a href="https://github.com/sparfenyuk/mcp-proxy" rel="nofollow">https://github.com/sparfenyuk/mcp-proxy</a> - don't know much about this repo, but it was our starting point</p>
]]></description><pubDate>Sat, 20 Dec 2025 01:34:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=46332945</link><dc:creator>anthuswilliams</dc:creator><comments>https://news.ycombinator.com/item?id=46332945</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46332945</guid></item></channel></rss>