<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: mesmertech</title><link>https://news.ycombinator.com/user?id=mesmertech</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 10 Jun 2026 10:08:41 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=mesmertech" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by mesmertech in "Fable 5 remotion video benchmark and examples"]]></title><description><![CDATA[
<p>Overall an improvement over Opus 4.8, but I'd still say Gemini 3.1 Pro has more of an artistic vision even tho it fails tool calls and writes buggy code sometimes.<p>Ik almost everyone is interested just in the SWE stuff, but this has been a good eval for me to think about how big the model is, how "creative" it is for generating new ideas etc.<p>More results from fable, with comparisons for Gemini, opus and some open source models: <a href="https://mesmer.tools/benchmarks/ai-video-generation" rel="nofollow">https://mesmer.tools/benchmarks/ai-video-generation</a></p>
]]></description><pubDate>Tue, 09 Jun 2026 21:36:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48468096</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48468096</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48468096</guid></item><item><title><![CDATA[Fable 5 remotion video benchmark and examples]]></title><description><![CDATA[
<p>Article URL: <a href="https://mesmer.tools/benchmarks/ai-video-generation">https://mesmer.tools/benchmarks/ai-video-generation</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48468095">https://news.ycombinator.com/item?id=48468095</a></p>
<p>Points: 5</p>
<p># Comments: 1</p>
]]></description><pubDate>Tue, 09 Jun 2026 21:36:12 +0000</pubDate><link>https://mesmer.tools/benchmarks/ai-video-generation</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48468095</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48468095</guid></item><item><title><![CDATA[Ask HN: Anyone else seeing serious degradation in DX with Opus 4.8?]]></title><description><![CDATA[
<p>As an anthropic fan boy(check my prev. comments), this is the first opus release where I feel like the model is just not pleasant to talk to not to mention untrustworthy.<p>The two examples for me where I lost confidence in it is once where it started with 2 random echo commands: https://snipboard.io/tpqfP2.jpg
Another where I asked it to create a new landing page and it just deleted an existing app page as a replacement.<p>I'm not exactly sure if this is a harness problem with the claude code updates(maybe system prompt changed) or if its just the model itself that has gotten too "safety-pilled" as I've been seeing similar opinions where devs are complaining about the fact that the model seems to distrust the user's intentions.<p>Either way, this is the first model release where I've downgraded to previous model since I was already pretty happy with it before. Should make it clear if its a model problem or the harness</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48356061">https://news.ycombinator.com/item?id=48356061</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 01 Jun 2026 12:40:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48356061</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48356061</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48356061</guid></item><item><title><![CDATA[New comment by mesmertech in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>I think gpt 5.6 is coming out today so might wanna wait</p>
]]></description><pubDate>Thu, 28 May 2026 17:24:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48312346</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48312346</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48312346</guid></item><item><title><![CDATA[New comment by mesmertech in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>/model claude-opus-4-8<p>seems to work but idk why they never set it so you can see it in the /model list.<p>"what model are you<p>I'm Claude Opus (claude-opus-4-8), running in Claude Code."</p>
]]></description><pubDate>Thu, 28 May 2026 17:23:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48312333</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48312333</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48312333</guid></item><item><title><![CDATA[New comment by mesmertech in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>Yep sorry was just pulling it out my rear, not like a market trend that nearly every enterprise uses Anthropic or Openai models for coding or that Anthropic has had such ridiculous growth that they're 10x-ing year over year</p>
]]></description><pubDate>Wed, 27 May 2026 22:01:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=48301358</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48301358</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48301358</guid></item><item><title><![CDATA[New comment by mesmertech in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>My point was that even openrouter, the one place people who are looking for open source SOTA models go to, doesn't definitively have opensource models at the top. Esp considering quite a lot of the closed models usage is through AWS, GCP , Azure etc, probably dwarfing the usage on openrouter by a huge factor</p>
]]></description><pubDate>Wed, 27 May 2026 21:52:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48301262</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48301262</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48301262</guid></item><item><title><![CDATA[New comment by mesmertech in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>As long as closed source is 6 months ahead in terms of current difference. Although this is hard to figure out using simple percent based coding benchmarks, you def. notice it when you're actually trying to do a long task. Even simple things like UI "taste" is enough for me to use opus instead of 5.5 though even though 5.5 is strictly better for anything that doesn't have a UI, ie backend, scripts, making agent workflows etc</p>
]]></description><pubDate>Wed, 27 May 2026 21:48:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48301216</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48301216</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48301216</guid></item><item><title><![CDATA[New comment by mesmertech in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>As long as closed models are 6 months ahead I won't be switching from them to prev. 6 month SOTA open source models. Maybe its just a different calculation if you're in a job, but as an indiehacker I'll take any edge I can get<p>Ofc again, can be convinced to switch if there's however a clear speed difference, like 5x+ for a open source sota even if it was SOTA for 6 months ago</p>
]]></description><pubDate>Wed, 27 May 2026 21:44:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48301174</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48301174</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48301174</guid></item><item><title><![CDATA[New comment by mesmertech in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>Based on current market for LLMs I'd say my use of "you" in the general is fine. Even openrouter which doesn't capture all of the SOTA closed models but nearly all of opensource model usage has Opus as 1st(on last week) on "Programming" category and 3rd in overall rankings<p><a href="https://openrouter.ai/rankings" rel="nofollow">https://openrouter.ai/rankings</a></p>
]]></description><pubDate>Wed, 27 May 2026 21:40:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=48301125</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48301125</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48301125</guid></item><item><title><![CDATA[New comment by mesmertech in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>Cost for the value delivered. Like if you offered the current SOTA open source models at $0.1/M, I still think I'd be using Opus or 5.5 at $30/M. Or say GPT 5 which was released Aug 25, I don't think I'd use it for coding for even $0.1. I'd def find other uses for it(translations, agentic workflows, prompt guards etc), but for coding I don't think I'd ever completely switch to a SOTA open model<p>Unless ofc there was an actual speed difference, only reason I'd be willing to go with a worse model couple of percent worse than current best model is if the speed was at least 5x higher. Looking forward to kimi k2.6 offered publicly by Cerebras</p>
]]></description><pubDate>Wed, 27 May 2026 18:14:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48298207</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48298207</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48298207</guid></item><item><title><![CDATA[New comment by mesmertech in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>For coding you always want to go with the best model in the category, not something that would be the best model if we went 1 year back which GLM 5.1 is, and I'm saying that as a big fan of GLM cause I run a translation site where GLM is good enough for the price.<p>Most of the money right now is in coding. Openai and Anthropic just have to be 6 months ahead of SOTA open source models and they'll capture most of the enterprise and dev market</p>
]]></description><pubDate>Wed, 27 May 2026 17:29:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48297544</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48297544</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48297544</guid></item><item><title><![CDATA[New comment by mesmertech in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>If nothing else this blog did give me the idea that I should split my $200 claude max plan into two $100 CC max and $100 codex plan, esp because Claude is now offering 1.5x weekly limits so its the 5x usage is now more like 7.5x usage.</p>
]]></description><pubDate>Wed, 27 May 2026 17:27:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=48297520</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48297520</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48297520</guid></item><item><title><![CDATA[New comment by mesmertech in "Claude Code weekly limits increasing 50% till July 13"]]></title><description><![CDATA[
<p>I'm just hoping they release Mythos soon now that it seems like they have enough compute to do promotions like this</p>
]]></description><pubDate>Wed, 13 May 2026 19:43:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48126493</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48126493</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48126493</guid></item><item><title><![CDATA[Claude Code weekly limits increasing 50% till July 13]]></title><description><![CDATA[
<p>Article URL: <a href="https://twitter.com/ClaudeDevs/status/2054639777685934564">https://twitter.com/ClaudeDevs/status/2054639777685934564</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48126429">https://news.ycombinator.com/item?id=48126429</a></p>
<p>Points: 10</p>
<p># Comments: 9</p>
]]></description><pubDate>Wed, 13 May 2026 19:38:12 +0000</pubDate><link>https://twitter.com/ClaudeDevs/status/2054639777685934564</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=48126429</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48126429</guid></item><item><title><![CDATA[New comment by mesmertech in "Claude Opus 4.7"]]></title><description><![CDATA[
<p>I think that was a typo on my end, its "/model claude-opus-4-7" not "/model claude-opus-4.7"</p>
]]></description><pubDate>Thu, 16 Apr 2026 15:19:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47794516</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=47794516</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47794516</guid></item><item><title><![CDATA[New comment by mesmertech in "Claude Opus 4.7"]]></title><description><![CDATA[
<p>I'm on the max $200 plan, so maybe its that?</p>
]]></description><pubDate>Thu, 16 Apr 2026 15:06:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47794264</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=47794264</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47794264</guid></item><item><title><![CDATA[New comment by mesmertech in "Claude Opus 4.7"]]></title><description><![CDATA[
<p>I think its just a visual/default thing, cause Opus 4.0 isn't offered on claude code anymore. And opus 4.7 is on their official docs as a model you can change to, on claude code<p>Just ask it what model it is(even in new chat).<p>what model are you?<p>I'm Claude Opus 4 (model ID: claude-opus-4-7).<p><a href="https://support.claude.com/en/articles/11940350-claude-code-model-configuration" rel="nofollow">https://support.claude.com/en/articles/11940350-claude-code-...</a></p>
]]></description><pubDate>Thu, 16 Apr 2026 15:05:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47794250</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=47794250</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47794250</guid></item><item><title><![CDATA[New comment by mesmertech in "Claude Opus 4.7"]]></title><description><![CDATA[
<p>Not showing up in claude code by default on the latest version. Apparently this is how to set it:<p>/model claude-opus-4-7<p>Coming from anthropic's support page, so hopefully they did't hallucinate the docs, cause the model name on claude code says:<p>/model claude-opus-4-7                                                                                                                                         
  ⎿  Set model to Opus 4<p>what model are you?<p>I'm Claude Opus 4 (model ID: claude-opus-4-7).</p>
]]></description><pubDate>Thu, 16 Apr 2026 14:50:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47793917</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=47793917</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47793917</guid></item><item><title><![CDATA[New comment by mesmertech in "Elevated errors on Claude.ai, API, Claude Code"]]></title><description><![CDATA[
<p>We went from "Peak hours" meaning 2x usage plus slower to now it just does 500 error<p><a href="https://mesmer.tools/random/is-it-peak-hours" rel="nofollow">https://mesmer.tools/random/is-it-peak-hours</a></p>
]]></description><pubDate>Wed, 15 Apr 2026 14:49:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47779834</link><dc:creator>mesmertech</dc:creator><comments>https://news.ycombinator.com/item?id=47779834</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47779834</guid></item></channel></rss>