Hacker News: rjpruitt16

New comment by rjpruitt16 in "Ask HN: Who is hiring? (May 2026)"

rjpruitt16 — Fri, 01 May 2026 18:09:16 +0000

I’m curious if you are looking into gleam at all. I have written elixir/gleam interop otp library. Probably will help a lot for safety in finance. I could help turn certain components into for type safe gleam code

New comment by rjpruitt16 in "Layer 8: The coordination protocol AI agents and embedded devices don't have yet"

rjpruitt16 — Wed, 22 Apr 2026 04:38:22 +0000

Yeah, thanks for checking it out. I know it’s a lot read. Keep me in mind if you run retry storms for agents.

Layer 8: The coordination protocol AI agents and embedded devices don't have yet

rjpruitt16 — Wed, 22 Apr 2026 02:07:38 +0000

Article URL: https://www.ezthrottle.network/blog/operationless-network-for-a-new-world-of-devices

Comments URL: https://news.ycombinator.com/item?id=47857889

Points: 2

# Comments: 2

New comment by rjpruitt16 in "Show HN: Self-healing browser harness via direct CDP"

rjpruitt16 — Mon, 20 Apr 2026 05:37:04 +0000

Where do you think AI web agents with be by 2027? How are people creating value today?

Where will browsers use be at the end of the year?

New comment by rjpruitt16 in "Ask HN: What Are You Working On? (April 2026)"

rjpruitt16 — Mon, 13 Apr 2026 15:17:04 +0000

An api rate limit router - it’s gives a queue per user to solve noisy neighbors, it allows the user user to reroute 500s to other regions automatically, the api provider can tune the speed of request with headers. It should be a massive speed boost because non-beam servers waste performance on sleeping threads. It’s works by my edge servers doing all the retrying across different regions and it can send webhook to you across regions which means we delivery if you an all your downstream are up in one region. It’s actually already ready but of course no one wants to trust me with their traffic. Nobody wants to even read the design doc nor sdk. I’m hoping that upcoming new agents bring so much traffic that people look for this category of software. Until then I guess I am Cloudfare before the world got hit by bot scrapers.

The modern orchestrator are reactive, they don’t handle spikey traffic. Your favorite retry library will cause retry storms for downstream dependencies and your public APIs. Remember EZThrottle blog posts

EZThrottle.network

New comment by rjpruitt16 in "Show HN: Pitstop-check – finds the retry bug that turns 429s into request storms"

rjpruitt16 — Mon, 06 Apr 2026 01:03:54 +0000

Ezthrottle works by sending the request and depending on what error code the user wishes to reroute on, it will send to another region. It give the user a chance to say something different in case the api misclassifies the error. The user would have to tune it.

New comment by rjpruitt16 in "Show HN: Pitstop-check – finds the retry bug that turns 429s into request storms"

rjpruitt16 — Tue, 24 Mar 2026 05:06:01 +0000

Cool stuff.

New comment by rjpruitt16 in "Ask HN: How do you deal with people who trust LLMs?"

rjpruitt16 — Thu, 19 Mar 2026 03:30:13 +0000

Thanks for this. I was in the camp of trust the LLM but y’all have made valid points. After discussing with ChatGPT, it agree there are some areas where it should not be trusted as accurate, but it said with historical facts like the holocaust it should be high. Idk, perhaps we need labs to produce a chart of it level of trust deserve to certain topics

New comment by rjpruitt16 in "Ask HN: How to scale agent systems when Layer 7 is unreliable?"

rjpruitt16 — Wed, 11 Mar 2026 16:02:54 +0000

What do you use?

New comment by rjpruitt16 in "Launch HN: Terminal Use (YC W26) – Vercel for filesystem-based agents"

rjpruitt16 — Tue, 10 Mar 2026 01:11:17 +0000

Im curious if you guys are seeing rate limiting issues. Agents sharing api keys tend to be retry storm monsters. I wonder how agent companies will address

Ask HN: How to scale agent systems when Layer 7 is unreliable?

rjpruitt16 — Sat, 07 Mar 2026 22:55:20 +0000

Agent workflows often involve 10+ API calls to different services (LLMs, data APIs, web scraping). Layer 7 being unreliable = workflows fail or cause retry storms.

Common failure modes I'm thinking about: - 429 rate limits → agents retry → hammer API worse - Partial outages → synchronized retries across customers - LangGraph workflows fail mid-execution → how to resume?

For those running agent systems at scale: - How do you handle Layer 7 failures? - Retry coordination? Circuit breakers? - How do you prevent retry storms to downstream dependencies? - Do LangGraph workflows gracefully handle API failures?

Curious what the production reality looks like.

Comments URL: https://news.ycombinator.com/item?id=47292281

Points: 1

# Comments: 2

New comment by rjpruitt16 in "Ask HN: How do you handle API rate limits in production?"

rjpruitt16 — Wed, 25 Feb 2026 04:19:49 +0000

interesting. What type of features did this enable. How was it maintaining redis. How many queues did you have.

Ask HN: How do you handle API rate limits in production?

rjpruitt16 — Tue, 24 Feb 2026 01:35:07 +0000

I'm building data pipelines that sync data from various third party apis. We constantly hit 429 rate limits, and our janky retry system fails regularly. For those running production data syncs or microservices calling external APIs heavily:

How do you handle rate limiting across multiple workers? Do you use circuit breakers, retry libraries, or something custom? How do you prevent retry storms when 100 workers all hit the same rate limit?

Curious what's working at scale.

Comments URL: https://news.ycombinator.com/item?id=47131730

Points: 3

# Comments: 2

Show HN: Stop Losing LangGraph Progress to 429 Errors

rjpruitt16 — Tue, 17 Feb 2026 03:01:42 +0000

Hey HN, I built this because I kept losing progress in LangGraph workflows when OpenRouter or OpenAI returned 429s. The problem: You're 7 steps into an agent workflow. Step 7 hits a rate limit. Everything crashes. Restart from step 1. Client-side retries don't help at scale:

100 workers all retry independently → retry storm Sequential fallbacks are slow (try OpenRouter, wait 5s, try Anthropic, wait 5s) No coordination across instances

So I built a coordination layer that:

Races multiple providers simultaneously (OpenRouter + Anthropic + OpenAI) Coordinates retries across all workers (no retry storms) Resumes workflows via webhooks (idempotent keys = checkpoints)

It runs on Fly.io's anycast network + BEAM for distributed coordination. Architecture deep dive: https://www.ezthrottle.network/blog/making-failure-boring-ag... Happy to answer questions about the approach or why BEAM made this possible when other languages would struggle.

Comments URL: https://news.ycombinator.com/item?id=47043197

Points: 1

# Comments: 0

New comment by rjpruitt16 in "Ask HN: Is anyone losing sleep over retry storms or partial API outages?"

rjpruitt16 — Tue, 03 Feb 2026 15:32:06 +0000

Fair pushback — to clarify, I’m not assuming incompetence or suggesting infra should paper over bad architecture.

By “losing sleep” I really mean on-call fatigue during partial outages — the class of incidents where backoff, shedding, and breakers exist, but retry amplification, shared rate limits, or degraded dependencies still cause noisy pages and prolonged recovery.

I’m trying to understand how teams coordinate retries and backpressure across many independent clients/services when refactors aren’t immediately available, not replace good architecture or take ownership of someone else’s system.

If you’ve seen patterns that consistently avoid that on-call pain at scale, I’d genuinely love to learn from them.

New comment by rjpruitt16 in "Ask HN: Is anyone losing sleep over retry storms or partial API outages?"

rjpruitt16 — Tue, 03 Feb 2026 15:17:36 +0000

Agree backoff+jitter is table stakes, and load shedding/backpressure is necessary under sustained overload. The tricky cases I’m digging into are shared rate limits (429s) and many concurrent clients/agents where local backoff isn’t coordinated and you still get herds after partial outages. Curious what patterns you’ve seen work well for coordinating retries/fairness across tenants or API keys?

Ask HN: Is anyone losing sleep over retry storms or partial API outages?

rjpruitt16 — Tue, 03 Feb 2026 04:19:00 +0000

I’m working on infrastructure to solve retry storms and outages. Before I go further, I want to understand what people are actually doing today. Compare solutions and maybe help someone see potential solutions. The problems:

Retry storms - API fails, your entire fleet retries independently, thundering herd makes it worse.

Partial outages - API is “up” but degraded (slow, intermittent 500s). Health checks pass, requests suffer.

What I’m curious about: ∙ What’s your current solution? (circuit breakers, queues, custom coordination, service mesh, something else?) ∙ How well does it work? What are the gaps? ∙ What scale are you at? (company size, # of instances, requests/sec)

I’d love to hear what’s working, what isn’t, and what you wish existed.

Comments URL: https://news.ycombinator.com/item?id=46866428

Points: 2

# Comments: 4

Ask HN: Are retries the wrong abstraction under rate limits?

rjpruitt16 — Sat, 31 Jan 2026 21:39:45 +0000

Over the last few years, I’ve watched a lot of production systems fail in ways that feel… strangely predictable.

When services hit 429s or timeouts, the standard response is almost always the same: retries with backoff, sleep loops, jitter, etc. This is treated as a best practice across languages and platforms.

But in systems with high concurrency, fan-out, or shared downstream dependencies, retries often seem to amplify load instead of smoothing it. What starts as localized failure can turn into retry storms, thundering herds, and cascading outages.

It’s made me wonder whether retries are solving the wrong problem at the wrong layer — treating a coordination issue as an application-level error-handling concern.

I wrote up a longer piece exploring this idea and arguing for making failure boring again by handling it at a different layer: https://www.ezthrottle.network/blog/making-failure-boring-again

Curious how this matches others’ experience:

Have retries actually improved stability for you under sustained rate limiting?

Have you seen cases where they clearly made things worse?

If retries aren’t the right abstraction, what is?

Interested in war stories, counterexamples, and alternative approaches.

Comments URL: https://news.ycombinator.com/item?id=46841137

Points: 3

# Comments: 0

Show HN: EZThrottle – Coordinated retries and region racing for APIs Gleam/BEAM

rjpruitt16 — Thu, 29 Jan 2026 01:36:35 +0000

Hi HN,

I built EZThrottle because I was tired of copy-pasting exponential backoff into every codebase.

The core idea: retries shouldn't be independent. When thousands of machines all hit a 429 and retry independently, you get retry storms that cascade into outages. EZThrottle coordinates failure in one place.

What it does: - Rate limiting per destination (default 2 RPS – conservative on purpose) - Region racing: send to multiple regions, accept first success, cancel the rest. If one region goes down, your request still completes with just a latency bump instead of a full outage - Adds event-driven architecture to your stack via webhooks – fire and forget, get results delivered

Current scale (setting expectations): Running 4 machines across 2 US regions (Dallas + Washington DC) on Fly.io. Not massive yet – we'll expand to more regions with demand. Early days.

SDKs: - Python: https://github.com/rjpruitt16/ezthrottle-python - Node: https://github.com/rjpruitt16/ezthrottle-node - Go: https://github.com/rjpruitt16/ezthrottle-go

Written in Gleam, runs on the BEAM. This is part of a larger vision (L8-OS – local-first AI stack), but EZThrottle works standalone.

Blog post with architecture diagrams: https://ezthrottle.network/blog/making-failure-boring-again

Free tier: 1M requests/month. Happy to answer questions.

@RahmiPruitt on Twitter

Comments URL: https://news.ycombinator.com/item?id=46804549

Points: 1

# Comments: 0

New comment by rjpruitt16 in "Glixir: A safe(ish) OTP library for gleam"

rjpruitt16 — Tue, 16 Sep 2025 21:24:52 +0000

I hear a lot of people complain that Gleam is “not OTP.” They’re right. At the time of writing, Gleam still doesn’t have a registry or pubsub. If you ask the Gleam team, they’ll tell you to use external calls — which works, but is a pain to reproduce consistently across projects.

So I wrote Glixir, a library that wraps common OTP patterns in Gleam using Elixir’s battle-tested OTP library. It’s not as type-safe as Gleam, but I needed something production-ready today, not next year.

This lets you:

Use supervisors, GenServers, and pubsub patterns from Gleam.

Stay in Gleam where it makes sense, but drop down to Elixir OTP when you need reliability.

Ship code faster without reinventing OTP on the Gleam side.

Obviously there are trade-offs — you give up some type safety, and it’s not “pure Gleam.” But if you’re trying to build something real, you may find this a useful middle ground.

It's MIT so if you disagree with direction you can fork and do whatever you want.