Hacker News: deoxykev

Solving Automata Cam Profiling with Grasshopper

deoxykev — Wed, 11 Feb 2026 01:31:27 +0000

Article URL: https://www.youtube.com/watch?v=grgIhw1YbHw

Comments URL: https://news.ycombinator.com/item?id=46969632

Points: 2

# Comments: 0

New comment by deoxykev in "Iowa City made its buses free. Traffic cleared, and so did the air"

deoxykev — Mon, 24 Nov 2025 00:25:03 +0000

I live there in that city. There are hardly any homeless at all here. Not like other cities at least. I could see it being a major problem in other places.

New comment by deoxykev in "Images over DNS"

deoxykev — Sat, 20 Sep 2025 17:02:01 +0000

How about LLM chat over DNS? https://github.com/accupham/llm-dns-proxy

New comment by deoxykev in "Images over DNS"

deoxykev — Sat, 20 Sep 2025 16:59:55 +0000

And it typically works on captive portals too before payment.

New comment by deoxykev in "Perfume reviews"

deoxykev — Fri, 18 Jul 2025 18:52:24 +0000

Meta-commentary always leans nerdier.

New comment by deoxykev in "It's time to become an ML engineer (2022)"

deoxykev — Thu, 20 Feb 2025 14:47:51 +0000

Curious to hear what kind of work you do. Because there are definitely fields where productivity as 10x'd because of AI tools.

New comment by deoxykev in "Web awesome: "Shoelace 3.0" open source web components"

deoxykev — Tue, 18 Feb 2025 18:35:33 +0000

HTMX and shoelace is an awesome combo. Super fast to prototype things and tweak as needed. Being able to copy paste snippets and directly inject data in a straightforward way is a nice way of working. It limits cognitive overhead so you can focus on the domain logic rather than fight javascript dependencies.

New comment by deoxykev in "DeepRAG: Thinking to retrieval step by step for large language models"

deoxykev — Tue, 04 Feb 2025 18:55:58 +0000

Don't forget to finetune the reranker too if you end up doing the embedding model. That tends to have outsized effects on performance for out of distribution content.

New comment by deoxykev in "Show HN: Klarity – Open-source tool to analyze uncertainty/entropy in LLM output"

deoxykev — Mon, 03 Feb 2025 20:01:52 +0000

Interesting, I had never heard about min-p until now. From what I understand, it's like a low-pass filter for the token sampling pool which boosts semantic coherence. Like removing static from the radio.

Do you have any benchmarks of min-p sampling with the new reasoning models, such as QwQ and R1?

New comment by deoxykev in "How to Run DeepSeek R1 671B Locally on a $2000 EPYC Server"

deoxykev — Mon, 03 Feb 2025 17:45:13 +0000

Yeah, there is a clear bottleneck somewhere in llama.cpp. Even high end hardware is struggling to get good numbers. The theoretical limit should be higher, but it's not yet.

Benchmarks: https://github.com/ggerganov/llama.cpp/issues/11474#issuecom...

New comment by deoxykev in "Efficient Reasoning with Hidden Thinking"

deoxykev — Mon, 03 Feb 2025 17:43:22 +0000

I don't think autoregressive models have a fundemental difference in terms of reasoning capability in latent space vs token space. Latent space enables abstract reasoning and pattern recognition, while token space acts as both the discrete interface for communication, and as a interaction medium to extend, refine and synthesize high order reasoning over latent space.

Intuively speaking, most people think of writing as a communication tool. But actually it's also a thinking tool that helps create deeper connections over discrete thoughts which can only occupy a fixed slice of our attention at any given time. Attentional capacity the primary limitation-- for humans and LLMs. So use the token space as extended working memory. Besides, even the Coconut paper got mediocre results. I don't think this is the way.

New comment by deoxykev in "Show HN: Klarity – Open-source tool to analyze uncertainty/entropy in LLM output"

deoxykev — Mon, 03 Feb 2025 17:34:10 +0000

The fundemental challenge of using log probabilities to measure LLM certainty is the mismatch between how language models process information and how semantic meaning actually works. The current models analyze text token by token-- fragments that don't necessarily align with complete words, let alone complex concepts or ideas.

This creates a gap between the mechanical measurement of certainty and true understanding, much like mistaking the map for the territory or confusing the finger pointing at the moon with the moon itself.

I've done some work before in this space, trying to come up with different useful measures from the logprobs, such as measuring shannon entropy over a sliding window, or even bzip compression ratio as a proxy for information density. But I didn't find anything semantically useful or reliable to exploit.

The best approach I found was just multiple choice questions. "Does X entail Y? Please output [A] True or [B] False. Then measure the linprobs of the next token, which should be `[A` (90%) or `[B` (10%). Then we might make a statement like: The LLM thinks there is a 90% probability that X entails Y.

New comment by deoxykev in "DeepSeek gives Europe's tech firms a chance to catch up"

deoxykev — Mon, 03 Feb 2025 11:38:43 +0000

My take: the distills under 32B aren’t worth running. Quants seem to impact quality much more than other models. 32B and 70B unquantized are very good. 671B is SOTA.

New comment by deoxykev in "How to Run DeepSeek R1 671B Locally on a $2000 EPYC Server"

deoxykev — Sat, 01 Feb 2025 16:16:19 +0000

8x 3090 will net you around 10-12tok/s

New comment by deoxykev in "Show HN: Flow – A dynamic task engine for building AI agents"

deoxykev — Tue, 03 Dec 2024 13:40:51 +0000

Have you hit any non-determinism errors keeping workflow state outside temporal?

New comment by deoxykev in "Show HN: Flow – A dynamic task engine for building AI agents"

deoxykev — Tue, 03 Dec 2024 02:10:41 +0000

Hey, I’m building agents on top of temporal as well. One of the main limitations is child workflows can not spawn other child workflows. Are you doing an activity for every prompt execution and passing those through other activities? Or something more framework-y?

New comment by deoxykev in "Capstone Disassembler Framework"

deoxykev — Wed, 25 Sep 2024 17:22:31 +0000

Imhex is a really great frontend for Capstone. https://github.com/WerWolv/ImHex

New comment by deoxykev in "Serving AI from the Basement – 192GB of VRAM Setup"

deoxykev — Mon, 09 Sep 2024 14:17:09 +0000

Are you able to run 405B? 4Bit quant vram requirements are just shy of 192GB.

New comment by deoxykev in "Mistral AI Launches New 8x22B MOE Model"

deoxykev — Wed, 10 Apr 2024 03:09:24 +0000

4 bit quants should require 85GB VRAM, so this will fit nicely on 4x 24G consumer GPUs, plus some leftover for KV cache optimization.

New comment by deoxykev in "Show HN: Beyond text splitting – improved file parsing for LLMs"

deoxykev — Mon, 08 Apr 2024 14:22:47 +0000

How does this compare to LayoutLMv3? Was it trained on forms at all?