<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: julianlam</title><link>https://news.ycombinator.com/user?id=julianlam</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 24 May 2026 19:42:09 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=julianlam" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by julianlam in "Reflections on Building Forum Software"]]></title><description><![CDATA[
<p>As someone who's built a forum software for 10+ years (NodeBB), I'm glad you found the experience exciting.<p>I find building out forums exceedingly fun too (which is why I've been at it for a decade). Like you, we realized that federation between forums is quite important from a communication POV, though I'm not sure if you went that direction or just used PDSes as your user store.<p>We ended up integrating ActivityPub and its really reinvigorated my passion for building forums again :)<p>Usually when someone on HN talks about building a forum out, I tell them it took me a year (3 devs) before we reached rough feature parity. Perhaps it's possible for AI assisted clones to reach this point in weeks or months rather than years.<p>Good luck! When you get tired of it, just tell your agent to migrate all your data to NodeBB.</p>
]]></description><pubDate>Sat, 23 May 2026 21:12:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48251584</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48251584</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48251584</guid></item><item><title><![CDATA[New comment by julianlam in "Qwen3.7-Max: The Agent Frontier"]]></title><description><![CDATA[
<p>Try llama.cpp and Qwen3.6-35B-A3B<p>Good balance of intelligence and speed.</p>
]]></description><pubDate>Wed, 20 May 2026 18:10:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48211723</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48211723</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48211723</guid></item><item><title><![CDATA[New comment by julianlam in "Qwen3.7-Max: The Agent Frontier"]]></title><description><![CDATA[
<p>May I ask why the M instead of XL?<p>Obviously bigger != better but I don't know what the differences are.</p>
]]></description><pubDate>Wed, 20 May 2026 18:08:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48211696</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48211696</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48211696</guid></item><item><title><![CDATA[New comment by julianlam in "Qwen 3.7 Preview"]]></title><description><![CDATA[
<p>Gemma 4 and Qwen 3.6 were when my local inference experiments graduated from toy challenges with much hand holding to actually full day back and forth with good ability to utilise tool calls to discover how things are glued together.<p>I'm not talking about greenfield dev, I'm talking about interfacing with an existing decade old codebase.</p>
]]></description><pubDate>Mon, 18 May 2026 22:56:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=48186987</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48186987</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48186987</guid></item><item><title><![CDATA[New comment by julianlam in "OpenAI and Government of Malta partner to roll out ChatGPT Plus to all citizens"]]></title><description><![CDATA[
<p>> for one year<p><i>snort</i></p>
]]></description><pubDate>Sat, 16 May 2026 20:34:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48163546</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48163546</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48163546</guid></item><item><title><![CDATA[New comment by julianlam in "Claude for Small Business"]]></title><description><![CDATA[
<p>LLMs are bad at deterministic output.<p>Full stop.</p>
]]></description><pubDate>Thu, 14 May 2026 12:56:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48134733</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48134733</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48134733</guid></item><item><title><![CDATA[New comment by julianlam in "Maryland citizens hit with $2B power grid upgrade for out-of-state AI"]]></title><description><![CDATA[
<p>> those aren't going to enable much future growth.<p>What is with this obsessive need for "growth".</p>
]]></description><pubDate>Tue, 12 May 2026 04:35:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=48104220</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48104220</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48104220</guid></item><item><title><![CDATA[New comment by julianlam in "Local AI needs to be the norm"]]></title><description><![CDATA[
<p>Arguably, some of the things HN readers ask for can be capably completed by a local open weight model for free.</p>
]]></description><pubDate>Mon, 11 May 2026 03:54:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48090917</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48090917</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48090917</guid></item><item><title><![CDATA[New comment by julianlam in "Local AI needs to be the norm"]]></title><description><![CDATA[
<p>Not by much, and moving goalposts makes for a bad comparison. Local open weight models are already more powerful than frontier models from only a year back.<p>If you believe what you read here, the gap is closing fast.</p>
]]></description><pubDate>Mon, 11 May 2026 03:48:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48090883</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48090883</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48090883</guid></item><item><title><![CDATA[New comment by julianlam in "LLMs corrupt your documents when you delegate"]]></title><description><![CDATA[
<p>Indeed, that's what I do. I inspect the diff, though if it's an indentation change the entire block will be marked changed.<p>Still not an excuse to not read every line of course...<p>Unit tests give me the confidence that at least those tested logic paths are unaffected.<p>Sometimes with older codebases one cannot assume the paths have adequate test coverage.</p>
]]></description><pubDate>Sat, 09 May 2026 21:44:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=48078557</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48078557</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48078557</guid></item><item><title><![CDATA[New comment by julianlam in "I’ve banned query strings"]]></title><description><![CDATA[
<p>>  After I implemented that feature, a page from one of my favourite websites refused to load in the console... the third URL returns an HTTP 404 error page. <i>The website uses the query string to determine which one of its several font collections to show.</i><p>Yes, let's unilaterally decide that query strings are bad because one website (ab)uses query strings to load different fonts.<p>It's the query strings that are the problem, not the website!<p>jfc.<p>Look, I'm against utm fragments as much as the next guy, but let's not throw away a perfectly good thing because tracking is evil.</p>
]]></description><pubDate>Sat, 09 May 2026 18:23:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=48077044</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48077044</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48077044</guid></item><item><title><![CDATA[New comment by julianlam in "LLMs corrupt your documents when you delegate"]]></title><description><![CDATA[
<p>I always thought it was a little weird that LLMs aren't sophisticated enough to surgically edit files as needed.<p>For example, if there is a code block that needs to be wrapped within another function call, it'll rewrite the entire function call and you'll just have to pray that the re-written code block wasn't subtly changed.<p>I _think_ so far it hasn't introduced any changes....</p>
]]></description><pubDate>Sat, 09 May 2026 18:17:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48077002</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48077002</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48077002</guid></item><item><title><![CDATA[New comment by julianlam in "What Happened on the Hantavirus Cruise, According to a Doctor on Board"]]></title><description><![CDATA[
<p>Reader mode also works well</p>
]]></description><pubDate>Fri, 08 May 2026 04:18:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=48058493</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48058493</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48058493</guid></item><item><title><![CDATA[New comment by julianlam in "Agents need control flow, not more prompts"]]></title><description><![CDATA[
<p>> This started breaking down after ~30 files. Sometimes it would miss a file. Sometimes it would triple-test a bundle of files and take 10 minutes instead of 3. An error in one file would convince it it needs to re-test four previous files, for no reason. It was very frustrating.<p>Sorry, you thought a prompt was a suitable replacement for a testing suite?</p>
]]></description><pubDate>Thu, 07 May 2026 23:23:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48056410</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48056410</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48056410</guid></item><item><title><![CDATA[New comment by julianlam in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>So then these models could be used by llama.cpp today with the -md switch?<p>Interesting, must try tomorrow.</p>
]]></description><pubDate>Wed, 06 May 2026 04:29:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=48032171</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48032171</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48032171</guid></item><item><title><![CDATA[New comment by julianlam in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>Does this mean there will be new Gemma 4 models released with MTP, or are they already available in existing models + quants?</p>
]]></description><pubDate>Tue, 05 May 2026 19:22:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=48027244</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48027244</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48027244</guid></item><item><title><![CDATA[New comment by julianlam in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>Really excited to try this once it is merged into llama.cpp.<p>Gemma 4 26B-A4B is much quicker on my setup vs Qwen3.6-35B-A3B (by about 3x), so the thought of a 1.5 speedup is tantalizing.<p>Have tried draft models to limited success (the smaller 3B draft model in addition to a dense 14B Ministral model introduced too much overhead already)</p>
]]></description><pubDate>Tue, 05 May 2026 18:01:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48026155</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=48026155</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48026155</guid></item><item><title><![CDATA[New comment by julianlam in "Show HN: State of the Art of Coding Models, According to Hacker News Commenters"]]></title><description><![CDATA[
<p>We're all busy doing work instead of incessantly commenting about our models?</p>
]]></description><pubDate>Sun, 03 May 2026 14:26:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=47997273</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=47997273</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47997273</guid></item><item><title><![CDATA[New comment by julianlam in "Show HN: State of the Art of Coding Models, According to Hacker News Commenters"]]></title><description><![CDATA[
<p>I only started playing around with local inference a couple weeks ago. Prior to that I was just using Gemini via web since it came with my Workspace subscription, but I did not want to be reliant on the cloud.<p>Others will have a better idea since they've been messing around with local inference longer than I, but I am quite impressed with the models I have been loading on my laptop with only iGPU. As of this week I no longer feel like I am playing second fiddle with slow inference and small models. Gemma 4 (and maybe Qwen3.5, haven't tried it yet) seem to have changed the game this month!<p>Even with trying some absolutely <i>shiiiiite</i> models (I only had 16GB unified RAM at the start), I was suitably impressed that I splashed the $300 to double my RAM. I am happy that this one time cost was enough to break through to smarter models and faster inference. No ongoing cloud costs!</p>
]]></description><pubDate>Sun, 03 May 2026 13:43:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47996886</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=47996886</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47996886</guid></item><item><title><![CDATA[New comment by julianlam in "Show HN: State of the Art of Coding Models, According to Hacker News Commenters"]]></title><description><![CDATA[
<p>It's so interesting to see the wild pendulum swings of LLM sentiment here.<p>If one likes a model then it's capable of one-shotting entire apps.<p>Otherwise it's "only suitable for the most trivial tasks".<p>Never in between.</p>
]]></description><pubDate>Sun, 03 May 2026 04:49:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=47993442</link><dc:creator>julianlam</dc:creator><comments>https://news.ycombinator.com/item?id=47993442</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47993442</guid></item></channel></rss>