Hacker News: julianlam

New comment by julianlam in "Reflections on Building Forum Software"

julianlam — Sat, 23 May 2026 21:12:59 +0000

As someone who's built a forum software for 10+ years (NodeBB), I'm glad you found the experience exciting.

I find building out forums exceedingly fun too (which is why I've been at it for a decade). Like you, we realized that federation between forums is quite important from a communication POV, though I'm not sure if you went that direction or just used PDSes as your user store.

We ended up integrating ActivityPub and its really reinvigorated my passion for building forums again :)

Usually when someone on HN talks about building a forum out, I tell them it took me a year (3 devs) before we reached rough feature parity. Perhaps it's possible for AI assisted clones to reach this point in weeks or months rather than years.

Good luck! When you get tired of it, just tell your agent to migrate all your data to NodeBB.

New comment by julianlam in "Qwen3.7-Max: The Agent Frontier"

julianlam — Wed, 20 May 2026 18:10:20 +0000

Try llama.cpp and Qwen3.6-35B-A3B

Good balance of intelligence and speed.

New comment by julianlam in "Qwen3.7-Max: The Agent Frontier"

julianlam — Wed, 20 May 2026 18:08:30 +0000

May I ask why the M instead of XL?

Obviously bigger != better but I don't know what the differences are.

New comment by julianlam in "Qwen 3.7 Preview"

julianlam — Mon, 18 May 2026 22:56:56 +0000

Gemma 4 and Qwen 3.6 were when my local inference experiments graduated from toy challenges with much hand holding to actually full day back and forth with good ability to utilise tool calls to discover how things are glued together.

I'm not talking about greenfield dev, I'm talking about interfacing with an existing decade old codebase.

New comment by julianlam in "OpenAI and Government of Malta partner to roll out ChatGPT Plus to all citizens"

julianlam — Sat, 16 May 2026 20:34:08 +0000

> for one year

snort

New comment by julianlam in "Claude for Small Business"

julianlam — Thu, 14 May 2026 12:56:52 +0000

LLMs are bad at deterministic output.

Full stop.

New comment by julianlam in "Maryland citizens hit with $2B power grid upgrade for out-of-state AI"

julianlam — Tue, 12 May 2026 04:35:59 +0000

> those aren't going to enable much future growth.

What is with this obsessive need for "growth".

New comment by julianlam in "Local AI needs to be the norm"

julianlam — Mon, 11 May 2026 03:54:05 +0000

Arguably, some of the things HN readers ask for can be capably completed by a local open weight model for free.

New comment by julianlam in "Local AI needs to be the norm"

julianlam — Mon, 11 May 2026 03:48:50 +0000

Not by much, and moving goalposts makes for a bad comparison. Local open weight models are already more powerful than frontier models from only a year back.

If you believe what you read here, the gap is closing fast.

New comment by julianlam in "LLMs corrupt your documents when you delegate"

julianlam — Sat, 09 May 2026 21:44:21 +0000

Indeed, that's what I do. I inspect the diff, though if it's an indentation change the entire block will be marked changed.

Still not an excuse to not read every line of course...

Unit tests give me the confidence that at least those tested logic paths are unaffected.

Sometimes with older codebases one cannot assume the paths have adequate test coverage.

New comment by julianlam in "I’ve banned query strings"

julianlam — Sat, 09 May 2026 18:23:29 +0000

> After I implemented that feature, a page from one of my favourite websites refused to load in the console... the third URL returns an HTTP 404 error page. The website uses the query string to determine which one of its several font collections to show.

Yes, let's unilaterally decide that query strings are bad because one website (ab)uses query strings to load different fonts.

It's the query strings that are the problem, not the website!

jfc.

Look, I'm against utm fragments as much as the next guy, but let's not throw away a perfectly good thing because tracking is evil.

New comment by julianlam in "LLMs corrupt your documents when you delegate"

julianlam — Sat, 09 May 2026 18:17:34 +0000

I always thought it was a little weird that LLMs aren't sophisticated enough to surgically edit files as needed.

For example, if there is a code block that needs to be wrapped within another function call, it'll rewrite the entire function call and you'll just have to pray that the re-written code block wasn't subtly changed.

I _think_ so far it hasn't introduced any changes....

New comment by julianlam in "What Happened on the Hantavirus Cruise, According to a Doctor on Board"

julianlam — Fri, 08 May 2026 04:18:21 +0000

Reader mode also works well

New comment by julianlam in "Agents need control flow, not more prompts"

julianlam — Thu, 07 May 2026 23:23:14 +0000

> This started breaking down after ~30 files. Sometimes it would miss a file. Sometimes it would triple-test a bundle of files and take 10 minutes instead of 3. An error in one file would convince it it needs to re-test four previous files, for no reason. It was very frustrating.

Sorry, you thought a prompt was a suitable replacement for a testing suite?

New comment by julianlam in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"

julianlam — Wed, 06 May 2026 04:29:00 +0000

So then these models could be used by llama.cpp today with the -md switch?

Interesting, must try tomorrow.

New comment by julianlam in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"

julianlam — Tue, 05 May 2026 19:22:52 +0000

Does this mean there will be new Gemma 4 models released with MTP, or are they already available in existing models + quants?

New comment by julianlam in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"

julianlam — Tue, 05 May 2026 18:01:06 +0000

Really excited to try this once it is merged into llama.cpp.

Gemma 4 26B-A4B is much quicker on my setup vs Qwen3.6-35B-A3B (by about 3x), so the thought of a 1.5 speedup is tantalizing.

Have tried draft models to limited success (the smaller 3B draft model in addition to a dense 14B Ministral model introduced too much overhead already)

New comment by julianlam in "Show HN: State of the Art of Coding Models, According to Hacker News Commenters"

julianlam — Sun, 03 May 2026 14:26:11 +0000

We're all busy doing work instead of incessantly commenting about our models?

New comment by julianlam in "Show HN: State of the Art of Coding Models, According to Hacker News Commenters"

julianlam — Sun, 03 May 2026 13:43:53 +0000

I only started playing around with local inference a couple weeks ago. Prior to that I was just using Gemini via web since it came with my Workspace subscription, but I did not want to be reliant on the cloud.

Others will have a better idea since they've been messing around with local inference longer than I, but I am quite impressed with the models I have been loading on my laptop with only iGPU. As of this week I no longer feel like I am playing second fiddle with slow inference and small models. Gemma 4 (and maybe Qwen3.5, haven't tried it yet) seem to have changed the game this month!

Even with trying some absolutely shiiiiite models (I only had 16GB unified RAM at the start), I was suitably impressed that I splashed the $300 to double my RAM. I am happy that this one time cost was enough to break through to smarter models and faster inference. No ongoing cloud costs!

New comment by julianlam in "Show HN: State of the Art of Coding Models, According to Hacker News Commenters"

julianlam — Sun, 03 May 2026 04:49:30 +0000

It's so interesting to see the wild pendulum swings of LLM sentiment here.

If one likes a model then it's capable of one-shotting entire apps.

Otherwise it's "only suitable for the most trivial tasks".

Never in between.