<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: jameswhitford</title><link>https://news.ycombinator.com/user?id=jameswhitford</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 22 Jun 2026 21:24:00 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=jameswhitford" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>That is a great suggestion that I am definitely going to look into, thanks!</p>
]]></description><pubDate>Mon, 22 Jun 2026 10:51:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=48628461</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48628461</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48628461</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>I hear you</p>
]]></description><pubDate>Mon, 22 Jun 2026 10:49:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48628440</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48628440</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48628440</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>Cool to hear, what kind of tasks have you been using GLM for? And what other models have you found useful through Ollama?</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:41:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627942</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627942</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627942</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>I see your point. Just the fact that one model does have vision and one does not might be an interesting point of comparison, however.</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:39:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627922</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627922</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627922</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>This is excellent feedback thank you! These LLMisms in writing are a challenge I am living with currently and trying to improve on. The technical writing industry is taking a huge knock right now with companies demanding more work in less time with a big drop in quality, day to day I get less and less time to work on the quality in the prose of my work. We are working at the frontier of this right now, so we are the most heavily effected, but also get to experiment with the changes first which can be both stimulating and very frustrating.</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:38:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627913</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627913</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627913</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>Hi, author here, can you link? I would love to read about this.</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:30:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627867</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627867</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627867</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>Yes I agree 100%. My next guide would do better to use identical harnesses.</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:29:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627862</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627862</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627862</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>GLM 5.2 is text only, not multi modal. And Opus is multi modal.</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:27:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627849</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627849</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627849</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>Hi, author here, I cannot give an exact number for how many token the verification step took, but the verification GLM 5.2 ran was very stupid and definitely a waste of time. It read the pixel color data to try and verify the scene rendered properly. Which is really bad. Opus opened the game in a Playwright browser and took screenshots to verify the actual image. Which helped a lot.<p>Pro tip: You could use a multi-modal model to verify images as a subagent spawned by GLM 5.2, to get around this issue.</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:27:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627843</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627843</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627843</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>Yes I 100% agree. Time-taken can be improved (with harnesses, subagent workflows etc.) and varies based on task.</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:23:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627812</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627812</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627812</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>Yes, part of the reason I chose the one-shot test was really to test long-running tasks. A lot of people seem to be experimenting with this format, for example in the now trending loop-writing workflows. And really I am interested in diving into the murky waters of these novel workflows.</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:20:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627791</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627791</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627791</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>I appreciate the feedback!</p>
]]></description><pubDate>Mon, 22 Jun 2026 09:18:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627772</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627772</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627772</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>Yes this is true. This test was run on a $20 pro Claude subscription. I would definitely love to try use both models on the highest plans for a whole month and compare the two, great format for a future head-to-head comparison.</p>
]]></description><pubDate>Mon, 22 Jun 2026 07:52:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627090</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627090</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627090</guid></item><item><title><![CDATA[New comment by jameswhitford in "GLM 5.2 vs. Opus"]]></title><description><![CDATA[
<p>Hi, I am the author, I completely agree! I set out to run a vibe test on this one, not a benchmark, the real benchmarks are listed. My test shows what the models can do when both tasked with a long-running, technically difficult, one-shot task.<p>I think your test you describe (collaborative, task delegation, task completion, TTD, steerability) is a great format for a future test that I will definitely try out.</p>
]]></description><pubDate>Mon, 22 Jun 2026 07:46:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48627043</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=48627043</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48627043</guid></item><item><title><![CDATA[New comment by jameswhitford in "Claude is skeptical about OpenClaw"]]></title><description><![CDATA[
<p>I asked Claude Code to research Openclaw. It spawned a subagent, got back detailed results, and then flagged them as unreliable and/or hallucinated before I could read them.<p>TL;DR:<p>Claude isn't trained on openclaw data due to its knowledge cutoff, but this is the first time I have been asked to look at research myself to verify it isn't hallucinated or unreliable.<p>I am not making any claims about Anthropic training their models to perform worse when dealing with information about competitors...<p>But I am worried about this behaviour of flagging certain sources as unreliable for what seem like arbitrary reasons.<p>It could also be a case of prompt poisoning at one of the research URLs.</p>
]]></description><pubDate>Sun, 19 Apr 2026 08:25:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47822693</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=47822693</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47822693</guid></item><item><title><![CDATA[Claude is skeptical about OpenClaw]]></title><description><![CDATA[
<p>Article URL: <a href="https://wecreatethis.com/blog/post?slug=claude-is-skeptical-about-openclaw">https://wecreatethis.com/blog/post?slug=claude-is-skeptical-about-openclaw</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47822692">https://news.ycombinator.com/item?id=47822692</a></p>
<p>Points: 2</p>
<p># Comments: 2</p>
]]></description><pubDate>Sun, 19 Apr 2026 08:25:24 +0000</pubDate><link>https://wecreatethis.com/blog/post?slug=claude-is-skeptical-about-openclaw</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=47822692</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47822692</guid></item><item><title><![CDATA[New comment by jameswhitford in "The Case That A.I. Is Thinking"]]></title><description><![CDATA[
<p>Who would not want to say their product is the second coming of Christ if they could.</p>
]]></description><pubDate>Tue, 04 Nov 2025 07:09:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=45808197</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=45808197</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45808197</guid></item><item><title><![CDATA[New comment by jameswhitford in "The Case That A.I. Is Thinking"]]></title><description><![CDATA[
<p>This submarine isn’t swimming, it’s us that are submarining!<p>I think I hear my master’s voice..<p>Or is that just a fly trapped in a bottle?</p>
]]></description><pubDate>Tue, 04 Nov 2025 06:58:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=45808132</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=45808132</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45808132</guid></item><item><title><![CDATA[New comment by jameswhitford in "How to Migrate from OpenAI to Cerebrium for Cost-Predictable AI Inference"]]></title><description><![CDATA[
<p>It's a demo project using the free tier hardware from Cerebrum, demonstrating how to migrate with a few lines of code from OpenAI. The cost is never going to beat OpenAI on an A10, there are more powerful options available.</p>
]]></description><pubDate>Tue, 22 Jul 2025 14:09:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=44647111</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=44647111</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44647111</guid></item><item><title><![CDATA[New comment by jameswhitford in "How to Migrate from OpenAI to Cerebrium for Cost-Predictable AI Inference"]]></title><description><![CDATA[
<p>Serverless setups (like Cerebrium) charge per second the model is running, its not token based.</p>
]]></description><pubDate>Tue, 22 Jul 2025 10:05:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=44645081</link><dc:creator>jameswhitford</dc:creator><comments>https://news.ycombinator.com/item?id=44645081</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44645081</guid></item></channel></rss>