Hacker News: robrenaud

New comment by robrenaud in "Local AI needs to be the norm"

robrenaud — Mon, 11 May 2026 14:04:08 +0000

There is some recent work on modularizing knowledge in LLMs.

It might be possible to train a big generalist that is a composition of modules, some of which can be dropped dynamically at inference time, depending on the prompt.

New comment by robrenaud in "Google and Pentagon reportedly agree on deal for 'any lawful' use of AI"

robrenaud — Tue, 28 Apr 2026 18:12:06 +0000

Is every American tax payer morally compromised?

New comment by robrenaud in "United Wizards of the Coast"

robrenaud — Mon, 27 Apr 2026 20:05:47 +0000

My big gripe with unions is the unwavering protection of their worst performing members.

Eg, that they necessitated so called "rubber rooms" like these in the NYC public schools, where teachers got paid to do nothing while waiting on arbitration.

https://en.wikipedia.org/wiki/Reassignment_center

New comment by robrenaud in "Sauna effect on heart rate"

robrenaud — Mon, 20 Apr 2026 15:17:42 +0000

The flat earthers are why I hate astronomy.

Afaict, the grand parent poster is just very wrong. You do want to cause acute stresses to your heart (cardiovascular exercise) to get it work better.

New comment by robrenaud in "Claude Opus 4.7"

robrenaud — Thu, 16 Apr 2026 21:48:24 +0000

Yeah, it's different. Anthropic profits when it delivers tokens. Hosting providers pay when Anthropic scrapes them.

New comment by robrenaud in "TinyLoRA – Learning to Reason in 13 Parameters"

robrenaud — Wed, 01 Apr 2026 05:57:30 +0000

Yeah, my big problem with the paper is it just might be an artifact of qwen's training process.

New comment by robrenaud in "Yann LeCun's AI startup raises $1B in Europe's largest ever seed round"

robrenaud — Tue, 10 Mar 2026 15:36:17 +0000

Was Alphago's move 37 original?

In the last step of training LLMs, reinforcement learning from verified rewards, LLMs are trained to maximize the probability of solving problems using their own output, depending on a reward signal akin to winning in Go. It's not just imitating human written text.

Fwiw, I agree that world models and some kind of learning from interacting with physical reality, rather than massive amounts of digitized gym environments is likely necessary for a breakthrough for AGI.

New comment by robrenaud in "Yann LeCun raises $1B to build AI that understands the physical world"

robrenaud — Tue, 10 Mar 2026 15:24:59 +0000

Recursive self improvement. It's when AI speeds up the development of the next AI.

New comment by robrenaud in "Ask HN: Who wants to be hired? (March 2026)"

robrenaud — Tue, 03 Mar 2026 20:13:16 +0000

Location: SF (current). NYC/Philly general area acceptable. Remote okay. email: rrenaud@gmail.com Resume: 16 year SWE -> MLE @ Google, MS from NYU with focus on ML. Retired. Now I hack on data analysis for video game projects for fun, and I love it. I'd take crazy low compensation to do work with interesting game data sets. EG, for game balance, strategic analysis, or to improve/augment game video content.

New comment by robrenaud in "Ask HN: Who is hiring? (March 2026)"

robrenaud — Mon, 02 Mar 2026 17:17:44 +0000

What do y'all think about the latency/quality tradeoff with LLMs?

Human voices don't take 30 seconds to think, retrieve, research, and summarize a high quality answer. Humans are calibrated in their knowledge, they know what they understand and what they don't. They can converse in real time without bullshitting.

Frontier real time-ish LLM generated voice systems are still plagued by 2024 era LLM nonsense, like the inability to count Rs in strawberry. [1]

I'd personally love a voice interface that, constrained by the technology of today, takes the latency hit to deliver quality.

[1] https://www.instagram.com/reel/DTYBpa7AHSJ/?igsh=MzRlODBiNWF...

Ask HN: Is there something like Google style guide for AI-only coded apps?

robrenaud — Sun, 01 Mar 2026 06:51:37 +0000

The Google style guide for C++/Java/Python are opinionated, hard fought, wise, and they elimate a large source of errors while minimizing harmful, unneded inconsistences. They picked a style that made great realistic use of the best cognition at the time.

The intent is still great, but now we should think about writing good general rules for building programs that are essentially all AI generated. What generic wisdom leads to flexible, auditable, composable and robust apps and systems?

Comments URL: https://news.ycombinator.com/item?id=47204331

Points: 1

# Comments: 2

New comment by robrenaud in "How an inference provider can prove they're not serving a quantized model"

robrenaud — Sat, 21 Feb 2026 22:36:45 +0000

Please serve well quantized models.

If you can get 99 percent of the quality for 50 percent of the cost, that is most times a good tradeoff.

New comment by robrenaud in "Consistency diffusion language models: Up to 14x faster, no quality loss"

robrenaud — Fri, 20 Feb 2026 18:22:07 +0000

Cite a source. Your concrete claim is that, on average, for every $1 of subscription revenue on a monthly subscription, OpenAI and Anthropic were losing $11.50?

It seems completely implausible.

I could believe that if a $20 sub used every possible token granted, it would cost $250. But certainly almost no one was completely milking their subscription. In the same way that no one is streaming netflix literally 24/7.

New comment by robrenaud in "Ask HN: What are you working on? (February 2026)"

robrenaud — Mon, 09 Feb 2026 06:09:01 +0000

I used to play very competitively, but I've been more chill recently. I just think it's a nice problem/dataset to work with, because of the depth of my understanding of the game.

New comment by robrenaud in "Ask HN: What are you working on? (February 2026)"

robrenaud — Mon, 09 Feb 2026 03:57:08 +0000

I’ve been experimenting with a live win probability predictor for the 10-player arcade game Killer Queen. The goal is to predict the winner in a causal, event-by-event fashion.

Right now I’m struggling to beat a baseline LightGBM model trained on hand-engineered expert features. My attempts at using a win probability head on top of nanoGPT, treating events as tokens, have been significantly worse. I am seeing about 65% accuracy compared to the LightGBM’s 70%. That 5% gap is huge given how stochastic the early game is, and the Transformer is easily 4 OOM more expensive to train.

To bridge the gap, I’m moving to a hybrid approach. I’m feeding those expert features back in as additional tokens or auxiliary loss heads, and I am using the LightGBM model as a teacher for knowledge distillation to provide smoother gradients.

The main priority here is personalized post-game feedback. By tracking sharp swings in win probability, or $\Delta WP$, you can automatically generate high or low-light reels right after a match. It helps players see the exact moment a play was either effective or catastrophic.

There is also a clear application for automated content creation. You can use $\Delta WP$ as a heuristic to identify the actual turning points of a match for YouTube summaries without needing to manually scrub through hours of Twitch footage.

New comment by robrenaud in "LLMs as the new high level language"

robrenaud — Sun, 08 Feb 2026 04:05:15 +0000

A compiler that can turn cash into improved code without round tripping a human is very cool though. As those steps can get longer and succeed more often in more difficult circumstances, what it means to be a software engineer changes a lot.

New comment by robrenaud in "Ask HN: What are you working on? (January 2026)"

robrenaud — Sun, 11 Jan 2026 23:37:53 +0000

Previously, I made a live win probability model for the 5v5 arcade game Killer Queen Arcade from their game events API.

Now I am trying to use that model to make:

1. A post game instant replay that shows the most important/pivotal moments from the most recently finished game. Some arcades have a seperate display for observers, it could work well there, or as good filler between matches on twitch streams.

2. A personalized per tournament/yearly highlights recap.

If it works well, it might be a kind of tool that generalizes well for summarizing long twitch streams for Youtube.

https://github.com/rrenaud/kq_stream_highlights

New comment by robrenaud in "ChatGPT Health"

robrenaud — Thu, 08 Jan 2026 17:12:05 +0000

Here is research about doctors interpreting test results. It seems to favor GP's view that many doctors struggle to weigh test specificity and sensitivity vs disease base rate.

https://bmjopen.bmj.com/content/bmjopen/5/7/e008155.full.pdf

New comment by robrenaud in "AI sycophancy panic"

robrenaud — Sun, 04 Jan 2026 17:24:37 +0000

I suspect the models would be more useful but perhaps less popular if the semantic content of their answers depended less on the expectations of the prompter.

New comment by robrenaud in "Using Vectorize to build an unreasonably good search engine in 160 lines of code"

robrenaud — Thu, 25 Dec 2025 08:24:16 +0000

What are you embedding? Are you doing a geo restricted area (small universe?).