<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: mchusma</title><link>https://news.ycombinator.com/user?id=mchusma</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 16 Jun 2026 04:10:29 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=mchusma" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by mchusma in "Salesforce to Acquire Fin (formerly Intercom) for $3.6B"]]></title><description><![CDATA[
<p>You are right. These outcomes also skew heavily towards the easy stuff for LLMs to get. So tickets that take a human 1 min to respond to now cost you $0.99 ($60+/hour) and you are stuck only doing the hard tickets.</p>
]]></description><pubDate>Mon, 15 Jun 2026 20:33:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48546672</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48546672</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48546672</guid></item><item><title><![CDATA[New comment by mchusma in "Salesforce to Acquire Fin (formerly Intercom) for $3.6B"]]></title><description><![CDATA[
<p>I agree with you 100%. Fin and products like this simply do NOT solve the hard part of providing support in 2026. Basically, the hard parts are (1) coming up with the tools for agents to use, like searching for data, making updates, etc. (2) reviewing the logs of actual usage and adjusting prompts, docs, tools based on the real feedback. (3) tuning human escalation procedures.<p>This process is an ongoing effort, with an upfront engineering commitment which depends entirely on the product, but can be months of work. But if you have your own backend, I would argue this hard works is made HARDER by implementing something like Salesforce/Fin, because you have to now pipe a bunch of data and structure over to them, which is a pain.<p>LLM models capable of doing this are a commodity, the UI for customers and support teams is pretty trivial, the database/backend is trivial.<p>Outside of some cases, if you have your own app, and you have a given support volume, build your own.</p>
]]></description><pubDate>Mon, 15 Jun 2026 20:29:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48546640</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48546640</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48546640</guid></item><item><title><![CDATA[New comment by mchusma in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>I’ll just say that AI companies need to be pounding the table more about the necessity of AI. The US (and most other countries) have zero idea how to pay for its deficit spending. The only hope is massive GDP based growth and the only idea how to do that is AI.<p>This is rarely discussed, and while I agree we should be spending non-zero effort on safety, stopping progress is not an option.</p>
]]></description><pubDate>Sat, 13 Jun 2026 05:14:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48513516</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48513516</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48513516</guid></item><item><title><![CDATA[New comment by mchusma in "Anthropic's model naming, extrapolated"]]></title><description><![CDATA[
<p>I like "Proverb" as smaller than Haiku too, Aphorism is also good. But seriously I want Anthropic to up its small model game. Haiku is not competitive, Deepseek v4 flash outperforms my uses for about $0.10 / $0.20. Whereas Haiku 4.5 is $1/$5.<p>IMO Anthropic should just play the game at all the price tiers because it otherwise forces people to go elsewhere. I would probably pay for a "Proverb"/"Aphorism" class model that was worse than Deepseek at the same price just to stay in the ecosystem, if given the option.<p>(Note: I also see Google seem to make the same mistake, they actually do have competitive models in Gemma family but they don't make them available via the API. So there may be some reason for this.)</p>
]]></description><pubDate>Wed, 10 Jun 2026 21:48:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48483193</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48483193</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48483193</guid></item><item><title><![CDATA[New comment by mchusma in "Gemma 4 12B: A unified, encoder-free multimodal model"]]></title><description><![CDATA[
<p>It’s on openrouter. We just noticed performance was worse in a specific agentic app usecase. It’s possible we made an implementation mistake, my main point though is Google is really silly not hosting their own models.</p>
]]></description><pubDate>Thu, 04 Jun 2026 17:38:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48401955</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48401955</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48401955</guid></item><item><title><![CDATA[New comment by mchusma in "Gemma 4 12B: A unified, encoder-free multimodal model"]]></title><description><![CDATA[
<p>Gemma 4 31b outperformed Gemini 3.1 Flash-Lite in our app benchmarks (agentic tool use via api in our application as a part of various workflows). But google won't let you pay to use Gemma models, you have to go elsewhere, I think this may be because it would cannabilize Flash-lite.</p>
]]></description><pubDate>Wed, 03 Jun 2026 18:54:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48388192</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48388192</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48388192</guid></item><item><title><![CDATA[New comment by mchusma in "Gemma 4 12B: A unified, encoder-free multimodal model"]]></title><description><![CDATA[
<p>I think its even more puzzling because you can't even run Gemma 31b on google cloud, they only let you test it with a rate limit. No way (I can find) to actually pay them to use it.<p>We saw great results in our usecase using google direct. Moved to Openrouter because google wouldn't let us use it beyond a test.<p>Then Openrouters performance looked worse, not sure if there was a quantized version or something. So we instead looked at Deepseek v4 Flash, and opted to go for that.<p>This model would probably be great for a super low cost cloud model, would love to use it in the cloud, Google makes you go elsewhere.</p>
]]></description><pubDate>Wed, 03 Jun 2026 18:52:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=48388161</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48388161</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48388161</guid></item><item><title><![CDATA[New comment by mchusma in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>If we define product market fit as profitable with a trillion dollar valuation, I think the term has lost its helpfulness.<p>I do agree with the author that these companies seem much stronger financially recently though.</p>
]]></description><pubDate>Thu, 28 May 2026 07:11:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=48305663</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48305663</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48305663</guid></item><item><title><![CDATA[New comment by mchusma in "What it would take to rebuild U.S. manufacturing might"]]></title><description><![CDATA[
<p>I can’t read the full article, but the snippet (6%gdp/$2T) seems not that expensive? And you could read that either way ”cheap so we should do it” or “if we end up needing to do it, we can do it”.</p>
]]></description><pubDate>Thu, 28 May 2026 00:11:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=48302531</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48302531</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48302531</guid></item><item><title><![CDATA[New comment by mchusma in "Stripe is friendly to “friendly fraud”"]]></title><description><![CDATA[
<p>I am pretty convinced that friendly fraud is about 90% of chargebacks. I have seen some genuine fraud, but dwarfed by friendly fraud over time across 3 companies.</p>
]]></description><pubDate>Wed, 27 May 2026 03:26:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48289208</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48289208</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48289208</guid></item><item><title><![CDATA[New comment by mchusma in "Uber, Lyft drivers in Massachusetts form first US ride-share union"]]></title><description><![CDATA[
<p>I think it does bear that out in general, although it is slightly more complicated. What seems to happen
1. Low-wage workers, as a collective group, experience an increase in earnings (Dube & Zipperer, 2024).
2. Total job losses do take place, but are minor and teens/part-time/new entrants workers lose more often (Belman & Wolfson, 2014; Redmond & McGuinness, 2024).
3. Lost hours & increased prices - businesses primarily absorb the cost by slightly reducing weekly hours worked & increasing prices for consumers (Redmond & McGuinness, 2024)<p>I would agree that modest minimum wage increases are far from the worst thing the government does, compared to other government interventions.</p>
]]></description><pubDate>Tue, 26 May 2026 23:00:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48287191</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48287191</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48287191</guid></item><item><title><![CDATA[New comment by mchusma in "The real cost of owning a home"]]></title><description><![CDATA[
<p>There are also home warranties or tech solutions/concierges like tidy.com.<p>The issue historically is that these concierge things are expensive (should be solved by tech/ai) and the warranties create their own class of problems (claim frustrations etc).<p>But home ownership is expensive, no way around it. But the work in coordinating etc doesn’t fundamentally need to be.</p>
]]></description><pubDate>Tue, 26 May 2026 21:44:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48286412</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48286412</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48286412</guid></item><item><title><![CDATA[New comment by mchusma in "Germany news: Childfree adults to pay more for elder care"]]></title><description><![CDATA[
<p>The poor have dramatically more children than the rich on average, you don't need to be rich to have kids. Kids don't need to have rich parents to have a good live/upbringing.</p>
]]></description><pubDate>Tue, 26 May 2026 16:23:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48281888</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48281888</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48281888</guid></item><item><title><![CDATA[New comment by mchusma in "Ninth Circuit Panel Goes Out of Its Way to Question Section 230–DOE vs. Meta"]]></title><description><![CDATA[
<p>I have long wondered why these companies have only one model? Why not many models, letting the user choose? Why not let users tune what they want to see? It seems like a better user experience and safer from this type of (IMO valid) concerns. I’d want nothing to do with picking a users feed if I were them.</p>
]]></description><pubDate>Tue, 26 May 2026 01:16:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48273869</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48273869</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48273869</guid></item><item><title><![CDATA[New comment by mchusma in "2026 HIPAA Security Rule Update"]]></title><description><![CDATA[
<p>highly irritating. HIPAA was originally designed to be a "portability" standard (meaning easier to share). It has done the opposite. Health data is important to developing a cure, and privacy is unimportant to many people. The world would be better if there we were zero regulation here at all.</p>
]]></description><pubDate>Mon, 25 May 2026 19:34:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48270729</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48270729</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48270729</guid></item><item><title><![CDATA[New comment by mchusma in "Uber’s COO says it’s getting harder to justify money spent on tokenmaxxing"]]></title><description><![CDATA[
<p>I actually do think token maxing is good, but they should have limited it per user. I find it reallly hard to get people to max out the Claude $100 plan, let alone the $200 plan. I understand the enterprise plans are different and more expensive, which is how you get these kinds of issues. But encouraging people to try things with AI is very important, and some amount of token maxing is importsnt.</p>
]]></description><pubDate>Mon, 25 May 2026 17:56:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48269713</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48269713</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48269713</guid></item><item><title><![CDATA[New comment by mchusma in "Memory has grown to nearly two-thirds of AI chip component costs"]]></title><description><![CDATA[
<p>Everything I read seems to suggest that RAM capacity is going to grow at 20-25% a year, which just doesn't seem good enough. Even in consumer use cases, phones and laptops would benefit greatly by double the amount of RAM. And then obviously, the AI need is gigantic.<p>I don't see it going away. I mean, it may not grow as fast as now, but I don't see it growing away either. I get why the memory makers do not want to bankrupt themselves, but it feels like there's got to be some way to push that risk off onto model providers and other people in the ecosystem to allow us to grow ram capacity more like 50% per year.</p>
]]></description><pubDate>Sun, 24 May 2026 17:46:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=48259419</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48259419</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48259419</guid></item><item><title><![CDATA[New comment by mchusma in "Launch HN: Superset (YC P26) – IDE for the agents era"]]></title><description><![CDATA[
<p>I don’t see any specific mention of Conductor, is this confirmed?</p>
]]></description><pubDate>Fri, 22 May 2026 22:34:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=48242482</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48242482</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48242482</guid></item><item><title><![CDATA[New comment by mchusma in "Antigravity 2.0 Tops the OpenSCAD Architectural 3D LLM Benchmark"]]></title><description><![CDATA[
<p>I just left the google I/O feeling less confident about google's execution here.
- Gemini 3.5 flash is strange. Old cutoff, basically better than 3.1 pro at soem things worse at others, sometimes cheaper, sometimes more expensive than 3.1 pro.
- Antigravity had seemed abandoned, and people speculated them cutting it off, and they kind of did migrating everyone to a new antigravity
- Google "shipped the org chart" and they have so many AI products and none seem best of breed (e.g. the Gemini integration in google docs is worse than claude)<p>I was actually hoping for "Opus level intelligence at Haiku costs" model or "Sonnet level performance in Gemini 3.0 pricing", either of these would have been a workhorse, plus a competitor to Claude/Codex (1 app to do things). I got neither.</p>
]]></description><pubDate>Fri, 22 May 2026 16:43:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48238303</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48238303</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48238303</guid></item><item><title><![CDATA[New comment by mchusma in "Qwen3.7-Max: The Agent Frontier"]]></title><description><![CDATA[
<p>I've looked like a dozen places, I don't see anything. :(</p>
]]></description><pubDate>Wed, 20 May 2026 16:26:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=48210251</link><dc:creator>mchusma</dc:creator><comments>https://news.ycombinator.com/item?id=48210251</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48210251</guid></item></channel></rss>