Hacker News: amavashev

New comment by amavashev in "Your Website Is Not for You"

amavashev — Fri, 01 May 2026 13:45:18 +0000

True website is not for you and in the age for AI is not even for people. Its for AI agents reading your website and deciding what to do with it: recommend it, skip it, integrate with it, etc.

New comment by amavashev in "Show HN: Stop over-budget AI API calls per customer/feature (no proxy)"

amavashev — Mon, 23 Mar 2026 16:45:21 +0000

The no proxy approach makes sense for LLMs calls. The gap is non LLM calls.

Often times damage is done by non LLM calls -- tool calls like sending email, add records, files, placing order, etc. Budget enforcement at the LLM layer wont work for those.

built an open protocol + reference implementation, handles both any tool calls, LLM calls, or any other call: https://runcycles.io, open sourced under Apache 2.0

New comment by amavashev in "Be intentional about how AI changes your codebase"

amavashev — Fri, 20 Mar 2026 12:21:56 +0000

Agree, you need to your own code review, although as AI gets better, this problem will most likely be solved.

New comment by amavashev in "[dead]"

amavashev — Fri, 20 Mar 2026 12:19:12 +0000

The core argument here is that autonomous agents will need a economic envelope enforced before execution not after.

This means agents must follow this flow: reserve, use, commit or release.

Interesting how others are handling runaway agents, loops, etc, especially under concurrency.

New comment by amavashev in "Show HN: Cycles – hard limits on agent actions before execution"

amavashev — Wed, 18 Mar 2026 16:15:08 +0000

technical breakdown: https://dev.to/amavashev/i-burned-153-in-30-minutes-with-an-...

New comment by amavashev in "Show HN: Cycles – hard limits on agent actions before execution"

amavashev — Tue, 17 Mar 2026 20:07:32 +0000

Demo (no API key needed): https://github.com/runcycles/cycles-runaway-demo

Show HN: Cycles – hard limits on agent actions before execution

amavashev — Tue, 17 Mar 2026 19:28:43 +0000

Rate limits control velocity. They say nothing about what an agent is allowed to do next. An agent can pass every rate-limit check and still delete 400 records, send 200 emails, or place orders before anyone notices. The damage isn't always in the bill — it's in the consequence.

Cycles is an open protocol for pre-execution enforcement. The core mechanism: reserve exposure before the action runs, commit actual usage after, release the remainder if it fails. Every reservation is idempotent so retries don't double-count.

Atomic operations mean concurrent agents can't both see "enough budget" and both proceed. It's not a rate limiter. It's not an observability tool. It's a runtime authority that answers one question before every instrumented action: is this still allowed to proceed?

Three clients: Python (PyPI), TypeScript (npm), Spring Boot (Maven Central). Self-hostable server, Apache 2.0.

Integrations: OpenClaw, LangChain, Vercel, FastAPI

The demo shows the failure mode in 60 seconds — same agent, same bug, two outcomes: without Cycles it burns $6, with Cycles it stops at $1.

Happy to answer questions about the protocol design, the idempotency semantics, or the concurrency model.

Comments URL: https://news.ycombinator.com/item?id=47417096

Points: 1

# Comments: 2

New comment by amavashev in "Show HN: OpenClaw plugin – hard budget limits for agent tool calls"

amavashev — Sun, 15 Mar 2026 16:03:44 +0000

I'm the author. A few design decisions worth explaining:

The model downgrade is the most useful feature for daily use — when budget drops below a threshold, the plugin silently swaps claude-opus to claude-sonnet, gpt-4o to gpt-4o-mini. The agent keeps running, just cheaper. Only at full exhaustion does it stop.

The prompt hint (before_prompt_build hook) tells the model its remaining budget in the system prompt. Models self-regulate when they know the constraint exists — fewer unnecessary tool calls, shorter responses.

The underlying Cycles protocol handles idempotency under retries so concurrent tool calls don't double-spend against the same budget.

Happy to answer questions about the plugin or the protocol.

Show HN: OpenClaw plugin – hard budget limits for agent tool calls

amavashev — Sun, 15 Mar 2026 16:03:15 +0000

OpenClaw agents can loop, pick expensive models, and burn budget before anyone notices. This plugin stops that.

Install:

    openclaw plugins install @runcycles/openclaw-budget-guard

Add one config block with your tenant and optional model fallbacks (e.g. claude-opus → claude-sonnet when budget is low). The plugin handles the rest: balance checks before model selection, reservations before tool calls, commits after, and cleanup at session end.

Built on the Cycles protocol — reserve budget before execution, commit actual spend after, release the remainder.

Plugin: https://github.com/runcycles/cycles-openclaw-budget-guard

npm: https://www.npmjs.com/package/@runcycles/openclaw-budget-gua...

Comments URL: https://news.ycombinator.com/item?id=47388702

Points: 1

# Comments: 1

Show HN: RunCycles – pre-execution budget enforcement for autonomous agents

amavashev — Sun, 15 Mar 2026 00:08:39 +0000

I built this after reading too many incident reports of agent loops spending $200 in 4 minutes because a quality threshold was never met.

The pattern is always the same: an agent retries, fans out, or loops. Each iteration passes individual rate-limit checks. Observability fires an alert after the money is gone. Provider caps are per-provider, not cross-provider. None of these stop the spend before it happens.

RunCycles takes a different approach: reserve budget before the call, commit actual spend after, release the remainder if the work is cancelled. The reservation is atomic across all affected budget scopes — tenant, workspace, agent — using Redis Lua scripts so concurrent agents sharing a budget can't collectively overrun it.

The integration surface is small:

    @cycles(estimate=50_000, action_kind="llm.completion", action_name="gpt-4o")
    def call_llm(prompt: str) -> str:
        return openai.complete(prompt)

When budget is exhausted, the next reservation attempt gets a 409 BUDGET_EXCEEDED before the downstream call is made.

The architecture is three pieces:

- Cycles Protocol: an open OpenAPI spec defining the reservation lifecycle, idempotency semantics, scope hierarchy, and overage policies. - RunCycles Server: Spring Boot + Redis, implements the spec. Runs in Docker. - Clients: Python, TypeScript, Java/Spring Boot.

The hardest part was idempotency under retries — if a commit fails transiently and retries with the same key, it should get the original response back, not double-charge. The Lua scripts handle this atomically.

What it's not: a billing system, observability dashboard, or agent framework. It's the layer that decides whether an action may proceed before it proceeds.

Org: https://github.com/runcycles Docs: https://runcycles.github.io/docs

Comments URL: https://news.ycombinator.com/item?id=47382742

Points: 2

# Comments: 0

New comment by amavashev in "Show HN: I lost $200 from an agent loop, so I built per-tool AI budget controls"

amavashev — Wed, 18 Feb 2026 12:10:51 +0000

Per-key isolation + model locking is a solid baseline — especially for multi-tool stacks where one shared key hides everything.

One thing we’ve noticed though: spend caps stop damage, but they don’t prevent pathological behavior. By the time the cap trips, the agent has already drifted.

We’ve been experimenting with pre-authorization per action (reserve → commit style) rather than just per-key ceilings. It lets you detect anomalous patterns before the burn accumulates — especially in looping or tool-chaining scenarios.

Curious — have you seen most overruns come from loops, retries, or just high-token completions?

New comment by amavashev in "Ask HN: What are the biggest limitations of agentic AI in real-world workflows?"

amavashev — Wed, 18 Feb 2026 12:06:45 +0000

Drift correlating more with constraint tension than raw step count matches what we’ve observed.

Your external gate instinct is right, but the gate has to be structurally external, not just logically external. If the agent can reason about the gate, it can learn to route around it.

We’ve been experimenting with pre-authorization before high-impact actions (rather than post-hoc validation) - I've drafted Cycles Protocol v0 spec to deal with this problem.

What’s interesting is that anomalous reservation patterns often show up before output quality visibly degrades — which makes drift detectable earlier.

Still early work, but happy to compare notes if that’s useful.

Show HN: Scalerx.ai – Create, train and deploy AI agents

amavashev — Sun, 03 Nov 2024 11:50:19 +0000

Article URL: https://scalerx.ai/

Comments URL: https://news.ycombinator.com/item?id=42032528

Points: 1

# Comments: 0

Create, train and launch personalized AI Agents

amavashev — Sat, 02 Nov 2024 16:35:49 +0000

Article URL: https://scalerx.ai/

Comments URL: https://news.ycombinator.com/item?id=42027409

Points: 2

# Comments: 0