Hacker News: Juvination

New comment by Juvination in "DeepSeek 4 Flash local inference engine for Metal"

Juvination — Thu, 07 May 2026 20:46:14 +0000

I love scrolling and reading through this, thinking yeah of course Python is slower than Java, oh wow Rust is pretty on par I wonder what the Java devs did. Then you hit asm and your jaw drops.

New comment by Juvination in "Nano-vLLM: How a vLLM-style inference engine works"

Juvination — Mon, 02 Feb 2026 16:22:13 +0000

Nobody ever said that they were nonexistent before LLMs. When you are investigating and trying to determine if something is AI generated they are the number one indicator.

So if you're being accused of just spewing AI, then double down and spew what looks EVEN MORE like AI. What are you even doing?

New comment by Juvination in "Nano-vLLM: How a vLLM-style inference engine works"

Juvination — Mon, 02 Feb 2026 15:51:44 +0000

The em dashes really aren't helping their case.

New comment by Juvination in "A Brief History of Ralph"

Juvination — Mon, 19 Jan 2026 18:25:49 +0000

I've been working with the Ralphosophy? for iterative behavior in my workflow and it seems pretty promising for cutting out a few manual steps.

I still have a manual part which is breaking the design document down into multiple small gh issues after a review but I think that is fine for now.

Using codex exec, we start working on a github issue with a supplied design document, creating a PR on completion. Then we perform a review using a review skill madeup which is effectively just a "cite your sources" skill on the review along with Open Questions.

Then we iterate through open questions doing a minimum of 3 reviews (somewhat arbitrary but sometimes multiple reviews catch things). Then finally I have I have a step in for checking Sonarcloud, fixing them and pushing the changes. Realistically this step should be broken out into multiple iterations to avoid large context rot.

What I miss the most is output, seeing whats going on in either Codex or Claude in real time. I can output the last response but it just gets messy until I make something a bit more formal.