Hacker News: zambelli

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Sun, 24 May 2026 09:33:57 +0000

v0.7.1 is now out - Forge can now sit behind Claude Code! Proxy mode can talk to supported backends and handles format translation. Anthropic > forge > OpenAI; or Anthropic > forge > Anthropic.

Will get this ported over into vLLM work and try to get that released soon.

Thanks to some kind folks who contributed Docker, token counting, and a handful of PRs I haven't gotten to yet.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Sun, 24 May 2026 00:59:16 +0000

Thanks for the thoughtful comment! Let me try to unpack some of what's there and what's missing.

Forge is at its core a mechanical reliability layer, whereas a lot of memory/skill management would be more of an orchestration component/element that that consumer would own.

That split that has forge stopping at the mechanical layer was an intentional design decision, but there's no reason it couldn't grow into more. I think a lot of what you're thinking about is a big model/small model split similar to how CC does it - but that's an orchestrator.

Now, where Forge can help with what you're suggesting - I think most of it is there, but needs some wiring from the consumer/orchestrator: - Forge surfaces information about which guardrails fired: InferenceResult.new_messages carries typed MessageMeta.type — RETRY_NUDGE, STEP_NUDGE, PREREQUISITE_NUDGE, CONTEXT_WARNING, SUMMARY. So every nudge that fired during a run is observable per-step. A consumer could capture that and compare to workflow steps to reconstruct what success looked like. - Combined with Guardrails.check() > CheckResult, you would have a lot of the journey the model took to get to the answer. - Forge lets you (actually, requires) you to define the system prompt, any workflow restrictions, and the tools. So if you know something about how your task will behave with a small model, you can include that in system prompt, or a tool that's a required step, etc.

For integrations into MCPs/etc that house memories and skills, those can be surfaced to the model with Forge in place. Prompt the model to search for tools in the MCP/surface an MCP tool, etc. I've built a consumer that follows this pattern: main agent gets task > main agent eyeballs whether it can be solved on its own > if not, sends to a subagent specialized on that topic (that has access to more tools related to that) - which allows me to keep context lean for each agent.

You could do something similar where the model is prompted to use its toolset, but if its unsure or needs a tool it doesn't have, to call the get_mcp() tool or something to look for better options.

Big model v small model now - a couple of ways I think about it. - You could use big models to go through your workflow a few times, see common patterns, and then use those to define prerequisite and required steps in Forge guardrails when using small models. - You could use small models the same way there's the ANTHROPIC_SMALL_FAST_MODEL env var in claude code (this is what Explore subagent uses I think). Big model is effectively an orchestrator, and when it recognizes a task is easy, it dispatches a small model to do it, where Forge might make it viable.

Hoepfully that helps! Forge could certainly elevate some of this to be more native - and I might do that - like a mode that packages up results for you so you don't need to reconstruct the nudge events from hooks firing. But everything should be there to integrate with a memory system with the information required, or with an API/MCP that has more tools or skills for the agent to read.

Would love to see the integration if you do it! You'd just need a consumer that captures the events forge returns and packages them up into whatever your memory system is looking for!

If you're looking for other ways of ingesting those memories/skills that isn't system prompt, message, or tool result, then that's something I can look into.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Sat, 23 May 2026 01:55:34 +0000

You're very welcome! I've seen some good PRs come through and merges are starting. I need 24-48 hours to get the conference demo and travel sorted then work can continue at a faster pace! No intention of dropping forge, plenty more to do especially around the proxy.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Fri, 22 May 2026 14:59:51 +0000

Thanks to everyone for the great discussion! v0.7.0 is out now. It was in flight when this landed - changes tool error channel based on dogfooding observations with some larger models, eval re-run (numbers shift but within CI), and most importantly docs updated! I hope they're clearer now.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Thu, 21 May 2026 18:49:30 +0000

This is a really neat writeup, and the empirical data for coding agents is super useful. Will take a closer read and see if there's anything I easily lift into my harness!

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Thu, 21 May 2026 04:13:58 +0000

Merged! Thanks for that catch. I'll try to sequence the in-flight work ASAP to get the vllm branch merged in as a whole.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 22:05:37 +0000

Oh that's a good find, I'll book ark this for a GitHub issue.

Glad to hear it's working!

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 20:16:05 +0000

Yeah I got it working as a quick test run to confirm a model issue vs backend issue on a consumer app. It worked on my dual-5070 Ti rig, but I didn't have time to formalize all the way and merge it in. Thanks for linking it!

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 19:04:04 +0000

Nice symmetry with tool call failures being sent to LLM that made the call without bugging the user. The artifact-generating entity gets the error back, effectively.

100% correct, and stackable. Could have topic refusal in LLM training itself, forge in tool call alter, and sdlc gates at the workflow level.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 17:59:29 +0000

Oh, interesting - thanks for the link. I really haven't explored this but it should slot in fairly easily I think? Gotta dig into it more.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 17:58:16 +0000

Very cool! I'll try to get an issue open on lmstudio support and add it to the backlog.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 16:55:00 +0000

Interesting, catching the problem upstream, effectively. How did you enforce the grammar?

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 15:57:49 +0000

Retry nudges do generate an extra LLM call, and those average extra calls time impacts are captured in the eval data.

But that's the difference between the call failing and succeeding (eventually).

On successful calls the presence of forge should be unnoticeable.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 15:35:01 +0000

Oh, awesome! I'll take a look.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 14:50:18 +0000

Thank you! I've been trying to catch those replies and redirect people, but hopefully your comment be upvoted for others. Very embarrassing to put up the post with the wrong link lol.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 13:28:05 +0000

Ohhhh, that's much more interesting. I haven't looked into that at all, but now I'm curious. I'd need to think way more about how to layer that into forge, but the principle could likely be applied somewhere. I get it now.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 13:26:48 +0000

This is not an agentic coding harness. It's a generic tool-calling guardrail stack. I have built a coding harness built on Forge since, but that's not what this is.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 13:25:50 +0000

Nice ;). I'll take a closer read of it, that's on me - I am definitely seeing more people looking in this direction as agents start to ramp in production at the enterprise level, which I suspect is highlighting some of these failure modes at higher stakes. And also the cloud frontier API bills.

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 13:24:23 +0000

I know :( - I posted the wrong link and now it's there forever.

Dashboard is in here: https://github.com/antoinezambelli/forge/tree/main/docs/resu...

New comment by zambelli in "Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks"

zambelli — Wed, 20 May 2026 13:23:45 +0000

Not stupid at all!

Some of the older models did do this (like 3.5-era ish I think), and the harness would parse the results.

The newer way frontier has setup is structured tool calls. `tool_use` or `tool_calls`. The response is then received as a different tool_result rather than a regular message. That's a bit of the newer way of doing it.

The failure mode in question is more the model mixing the two: "Sure, I'll read the file: {"tool": "read", "args": {"path": "foo"}}" - that'll break stuff. Other failure modes are the json not parsing when sent it as a structured call, and in some cases the model just emitting text and forgetting the tool call.