Hacker News: WASDx

New comment by WASDx in "GLM-5.2 is the new leading open weights model on Artificial Analysis"

WASDx — Wed, 17 Jun 2026 16:19:07 +0000

Are you suggesting it should summarize the image in text or generate it in HTML or something else?

New comment by WASDx in "Running local models is good now"

WASDx — Tue, 16 Jun 2026 17:30:01 +0000

Looking at some benchmarks, the latest ~30B Gemma/Qwen score similar as Claude or GPT versions that were released just one year earlier. That's crazy progress. I can't imagine how it will be in a few years.

New comment by WASDx in "Open source AI must win"

WASDx — Sat, 13 Jun 2026 19:45:42 +0000

I think this is inevitable. Sooner or later, model-specific ASIC's will make economical sense. We're already seeing it happening with Taalas/Cerebras so I think it's sooner than 5 years. And inference is order of magnitude faster which is amazing.

New comment by WASDx in "Open source AI must win"

WASDx — Sat, 13 Jun 2026 19:25:22 +0000

> distributed LLM inference

This seems extremely inefficient considering data transfer between model layers if the model is distributed. I found this project called Petals that claim up to 4 tok/s for a 180B model although its repository hasn't been updated in two years.

https://petals.dev/

New comment by WASDx in "Claude Fable 5"

WASDx — Tue, 09 Jun 2026 20:15:44 +0000

I like this one, although its data seem to overlap with ECI.

https://artificialanalysis.ai/trends

New comment by WASDx in "Real-time LLM Inference on Standard GPUs: 3k tokens/s per request"

WASDx — Fri, 29 May 2026 16:44:18 +0000

https://chatjimmy.ai/ from Taalas also feels like that.

New comment by WASDx in "Claude Opus 4.8"

WASDx — Thu, 28 May 2026 22:02:16 +0000

I think their "code" ranking is biased towards visual aesthetics more than raw coding as the voters are just asked which generated website they prefer.

New comment by WASDx in "A Claude Code and Codex Skill for Deliberate Skill Development"

WASDx — Thu, 14 May 2026 11:21:32 +0000

I've had mostly problem-free experiences with intellij (ultimate-only feature I think). One click finds declarations both in business code and buried deep in libraries.

New comment by WASDx in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"

WASDx — Wed, 06 May 2026 19:04:48 +0000

gemma-4-31B-it-assistant is a 0.5B model. So it's performance would likely be comparable to other models of such size.

New comment by WASDx in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"

WASDx — Wed, 06 May 2026 18:20:39 +0000

I think this is the future. When models start converging at "really good" (which I think is already happening) then burning them into ASIC silicon is the natural next step.

Harnesses can keep improving with a fixed model and the throughput opens up new possibilities like doing 10x more "thinking" or exploring parallel paths and picking the best.

New comment by WASDx in "GitHub RCE Vulnerability: CVE-2026-3854 Breakdown"

WASDx — Tue, 28 Apr 2026 18:55:09 +0000

I was impressed enough by AI finding vulnerabilities in source code, but doing it in binary executables is just amazing. This has so much potential, good and bad.

And yet another lesson to not treat data as instructions. Sanitize all user input!

New comment by WASDx in "We replaced Node.js with Bun for 5x throughput"

WASDx — Mon, 06 Apr 2026 08:26:21 +0000

Creating a custom tuple class to use as key could be faster though. Nested map lookups have less efficient memory access patterns.

New comment by WASDx in "Show HN: I made a YouTube search form with advanced filters"

WASDx — Mon, 06 Apr 2026 08:06:40 +0000

Similar site with same features: https://xn--1-zfa.com/

New comment by WASDx in "Kotlin creator's new language: talk to LLMs in specs, not English"

WASDx — Thu, 12 Mar 2026 21:11:17 +0000

I think these limitations could be addressed by allowing trivial manual adjustments to the generated code before committing. And/or allowing for trivial code changes without a spec change. The judgement of "trivial" being that it still follows the spec and does not add functionality mandating a spec change. I haven't checked if they support any of this but I would be frustrated not being allowed to make such a small code change, say to fix an off-by-one error that I recently got from LLM output. The code change would be smaller than the spec change.

Cool idea overall, an incremental psuedocode compiler. Interesting to see how well it scales.

I can also see a hybrid solution with non-specced code files for things where the size of code and spec would be the same, like for enums or mapping tables.

New comment by WASDx in "Elasticsearch was never a database"

WASDx — Fri, 16 Jan 2026 19:44:12 +0000

I've managed a 100+ node cluster for years without seeing any corruption. Where are you getting this from?

New comment by WASDx in "Antislop: A framework for eliminating repetitive patterns in language models"

WASDx — Thu, 23 Oct 2025 19:29:40 +0000

You can customize it to get rid of all that. I set it to the "Robot" personality and a custom instruction to "No fluff and politeness. Be short and get straight to the point. Don't overuse bold font for emphasis."

New comment by WASDx in "Ask HN: Does anyone else notice YouTube causing 100% CPU usage and stattering?"

WASDx — Fri, 19 Sep 2025 14:59:24 +0000

Same. I recall the "stable volume" setting also eating cpu.

New comment by WASDx in "Show HN: Engineering.fyi – Search across tech engineering blogs in one place"

WASDx — Sun, 10 Aug 2025 15:40:29 +0000

FYI here is a list of hundreds of engineering blogs: https://github.com/kilimchoi/engineering-blogs

New comment by WASDx in "How we replaced Elasticsearch and MongoDB with Rust and RocksDB"

WASDx — Sat, 09 Aug 2025 13:58:36 +0000

The `/_cluster/reroute` endpoint lets you do that with a curl. We have aliases for common operations so I've never felt that I lack a CLI. I'm happy with Elasticsearch overall having a few years of experience.

New comment by WASDx in "QUIC for the kernel"

WASDx — Thu, 31 Jul 2025 16:49:29 +0000

I recall this article on QUIC disadvantages: https://www.reddit.com/r/programming/comments/1g7vv66/quic_i...

Seems like this is a step in the right direction to resole some of those issues. I suppose nothing is preventing it from getting hardware support in future network cards as well.