Hacker News: Nizoss

New comment by Nizoss in "Agents need control flow, not more prompts"

Nizoss — Thu, 07 May 2026 21:39:47 +0000

Exactly! I have said this a couple of times but it was taken literally as in no capital letters or strong language. Glad to see someone else who shares this perspective.

New comment by Nizoss in "Agents need control flow, not more prompts"

Nizoss — Thu, 07 May 2026 21:37:03 +0000

Not throwing shade at anyone here but the thought has definitely crossed my mind that we are recreating SAFe but for agents when looking at some of the orchestration setups out there. I think that it is better to not force the same hierarchical processes that worked for humans in large organizations onto agents and instead look at what they need to give better results and what their failure modes look like.

New comment by Nizoss in "Agents need control flow, not more prompts"

Nizoss — Thu, 07 May 2026 21:33:30 +0000

I fully agree. Also started using husky before expanding further and created my own hooks. I can’t imagine myself using agents today without them, it would require a lot of babysitting.

New comment by Nizoss in "Agents need control flow, not more prompts"

Nizoss — Thu, 07 May 2026 21:30:49 +0000

Sounds interesting, can you elaborate on your thinking? Got me curious.

New comment by Nizoss in "Agents need control flow, not more prompts"

Nizoss — Thu, 07 May 2026 21:15:39 +0000

Exactly! I don’t babysit TDD anymore. I have another agent that does that for me and honestly sometimes catches things I would have missed if I was the babysitting.

Hooks do wonders here. The payload contains a lot of information about the pending action the agent wants to make. Combine that with the most recent n events from the agent’s session history and you have a rich enough context to pass to another agent to validate the action through the SDK.

This way the validation uses the same subscription you’re logged in to, whether you’re using Claude Code, Codex, or Copilot. The validation agent responds with a json format that you can easily parse and return, allowing you to let the action through or block it with direction and guidance. I’m genuinely impressed by how well this works considering how simple it is.

You can find my approach here: https://github.com/nizos/probity

New comment by Nizoss in "Agents need control flow, not more prompts"

Nizoss — Thu, 07 May 2026 20:49:06 +0000

If you’re interested in such deterministic scaffolding/control flow, check out Probity.

I created it to address this exact issue. It is a vendor-neutral ESLint-style policy engine and currently supports Claude Code, Codex, and Copilot.

It uses the agents hooks payloads and session history to enforce the policies. Allowing it to be setup to block commits if a file has been modified since the checks were last run, disallow content or commands using string or regex matching, and enforce TDD without the need of any extra reporter setup and it works with any language.

Feedback welcome: https://github.com/nizos/probity

New comment by Nizoss in "EvanFlow – A TDD driven feedback loop for Claude Code"

Nizoss — Mon, 27 Apr 2026 13:44:26 +0000

Creator of TDD Guard here, thanks for the mention!

TDD Guard was built when Claude Code was the only one to offer hooks. Plugins didn't exist and the models were weaker, so the validation context and instructions took more work to get right. This is why it ended up requiring test reporters for different languages.

I have started a new project that does the same TDD enforcement, also through hooks, but without reporters. It works with any test runner, and it is vendor-agnostic, it works with Claude Code, Codex, and GitHub Copilot. The validator also sees recent session history which helps it handle cases like refactoring better.

The TDD instructions are still pretty basic compared to TDD Guard's, which have been dogfooded for a year. One thing I noticed while testing across agents is that some follow TDD a lot better than others, Codex struggled the most with the basic instructions.

Feedback welcome:

https://github.com/nizos/conduct

New comment by Nizoss in "Your job is to deliver code you have proven to work"

Nizoss — Thu, 18 Dec 2025 18:24:42 +0000

This is how I would also love to work but not all teams prefer this way. How many are you in your team? Was it easy to switch?

New comment by Nizoss in "Your job is to deliver code you have proven to work"

Nizoss — Thu, 18 Dec 2025 18:22:20 +0000

I would love to hear your thoughts on TDD-Guard. An open source plugin I created to enfore Test-Driven Development practices on agents:

https://github.com/nizos/tdd-guard

New comment by Nizoss in "Your job is to deliver code you have proven to work"

Nizoss — Thu, 18 Dec 2025 18:05:44 +0000

Yes! This is something that I also value. Having demo gifs of before and after helps a lot. I have encountered situations where what I thought was a minor finishing clean up had an effect that I didn't anticipate. By including demos in the PR it becomes a kind of guardrail against those situations for me. I also think it is neat and generally helpful for everyone.

New comment by Nizoss in "Your job is to deliver code you have proven to work"

Nizoss — Thu, 18 Dec 2025 17:51:11 +0000

If you write your tests the Test-Driven Development way in that they first fail before production changes are introduced, you will be able to trust them a lot more. Especially if they are well-written tests that test behavior or contracts, not implementation details. I find that dependency injection helps a lot with this. I try to avoid mocking and complex dependencies as much as possible. This also allows me to easily refactor the code without having to worry about breaking anything if all the tests still pass.

When it comes to agentic coding. I created an open source tool that enforces those practices. The agent gets blocked by a hook if it tries to do anything that violates those principles. I think it helps a lot if I may say so myself.

https://github.com/nizos/tdd-guard

Edit: I realize now that I misunderstood your comment. I was quick to respond.

New comment by Nizoss in "Show HN: Next AI Draw.io – Interactive Diagrams Creating with LLMs"

Nizoss — Mon, 01 Dec 2025 12:21:38 +0000

This looks really useful! Great work!

New comment by Nizoss in "Claude Code 2.0"

Nizoss — Tue, 30 Sep 2025 15:22:13 +0000

If you're on Windows and using vscode, add thiss to keybinds.json

[ { "key": "shift+enter", "command": "workbench.action.terminal.sendSequence", "args": { "text": "\u001b\n" }, "when": "terminalFocus" }, ]

It will allow you to get new lines without any strange output.

New comment by Nizoss in "ChatGPT Developer Mode: Full MCP client access"

Nizoss — Wed, 10 Sep 2025 19:08:13 +0000

And here I am still waiting for some kind of hooks support for ChatGPT/Codex.

New comment by Nizoss in "Claude for Chrome"

Nizoss — Tue, 26 Aug 2025 19:36:01 +0000

Same issue here, dark mode on mobile.

New comment by Nizoss in "My experience creating software with LLM coding agents – Part 2 (Tips)"

Nizoss — Sat, 23 Aug 2025 11:12:13 +0000

Yeah, this is a no brainer for certain use cases.

New comment by Nizoss in "Show HN: TDD-Guard – Test-Driven Development for Claude Code"

Nizoss — Wed, 20 Aug 2025 16:32:26 +0000

Not that I'm aware of. I'll check what possibilities exist with Opencode on their Discord.

New comment by Nizoss in "Show HN: TDD-Guard – Test-Driven Development for Claude Code"

Nizoss — Wed, 20 Aug 2025 16:18:22 +0000

That sounds amazing! Thanks for the heads up!

I deliberately picked a vendor-agnostic name. Adding support for other clients mainly means extending IClient:

https://github.com/nizos/tdd-guard/blob/main/src%2Fcontracts...

https://github.com/nizos/tdd-guard/tree/main/src%2Fvalidatio...

I'll take a closer look into adding this support. I'd also welcome a contribution if that's something you would be interested in!

The question is, do other agent platforms support hooks or similar functionality?

New comment by Nizoss in "Show HN: TDD-Guard – Test-Driven Development for Claude Code"

Nizoss — Wed, 20 Aug 2025 16:04:07 +0000

Thanks for the TL;DR!

Go support was recently added, and TDD-Guard also works with these frameworks:

JavaScript/TypeScript: Vitest and Jest Python: Pytest PHP: PHPUnit Go: Native go test

Adding a new language or framework just means creating a reporter that outputs test results in a format that TDD-Guard can consume.

I'm not familiar with Opencode. Is there something particular that interests you in it?

New comment by Nizoss in "Show HN: TDD-Guard – Test-Driven Development for Claude Code"

Nizoss — Wed, 20 Aug 2025 15:39:03 +0000

Hi HN,

I believe that using guardrails with agentic coding is far more effective than simply using instructions. This plugin demonstrates it using Test-Driven Development.

TDD-Guard can now be used with Go. Other supported languages are: JS/TS, Python, PHP, with dotnet in the works. Next are Ruby and Rust. I'd love community help adding support for more test frameworks and programming languages.

Here's feedback from an early user:

> This plugin is absolutely phenomenal and has become an indispensable part of my toolkit.

> It might sound strange, but I'm moving significantly faster on both new features and refactoring tasks now. The way it works in tandem with my strict ESLint setup is brilliant!I It iterates through issues and consistently produces clean, working code. It's not an exaggeration to say you've completely changed how I think about TDD and AI in my coding process due to this plugin.

Happy to answer questions!