<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: WASDx</title><link>https://news.ycombinator.com/user?id=WASDx</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 21 Jun 2026 09:26:24 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=WASDx" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by WASDx in "GLM-5.2 is the new leading open weights model on Artificial Analysis"]]></title><description><![CDATA[
<p>Are you suggesting it should summarize the image in text or generate it in HTML or something else?</p>
]]></description><pubDate>Wed, 17 Jun 2026 16:19:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48572627</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48572627</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48572627</guid></item><item><title><![CDATA[New comment by WASDx in "Running local models is good now"]]></title><description><![CDATA[
<p>Looking at some benchmarks, the latest ~30B Gemma/Qwen score similar as Claude or GPT versions that were released just <i>one year earlier</i>. That's crazy progress. I can't imagine how it will be in a few years.</p>
]]></description><pubDate>Tue, 16 Jun 2026 17:30:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48558759</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48558759</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48558759</guid></item><item><title><![CDATA[New comment by WASDx in "Open source AI must win"]]></title><description><![CDATA[
<p>I think this is inevitable. Sooner or later, model-specific ASIC's will make economical sense. We're already seeing it happening with Taalas/Cerebras so I think it's sooner than 5 years. And inference is order of magnitude faster which is amazing.</p>
]]></description><pubDate>Sat, 13 Jun 2026 19:45:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=48520735</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48520735</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48520735</guid></item><item><title><![CDATA[New comment by WASDx in "Open source AI must win"]]></title><description><![CDATA[
<p>> distributed LLM inference<p>This seems extremely inefficient considering data transfer between model layers if the model is distributed. I found this project called Petals that claim up to 4 tok/s for a 180B model although its repository hasn't been updated in two years.<p><a href="https://petals.dev/" rel="nofollow">https://petals.dev/</a></p>
]]></description><pubDate>Sat, 13 Jun 2026 19:25:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=48520543</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48520543</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48520543</guid></item><item><title><![CDATA[New comment by WASDx in "Claude Fable 5"]]></title><description><![CDATA[
<p>I like this one, although its data seem to overlap with ECI.<p><a href="https://artificialanalysis.ai/trends" rel="nofollow">https://artificialanalysis.ai/trends</a></p>
]]></description><pubDate>Tue, 09 Jun 2026 20:15:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=48467049</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48467049</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48467049</guid></item><item><title><![CDATA[New comment by WASDx in "Real-time LLM Inference on Standard GPUs: 3k tokens/s per request"]]></title><description><![CDATA[
<p><a href="https://chatjimmy.ai/" rel="nofollow">https://chatjimmy.ai/</a> from Taalas also feels like that.</p>
]]></description><pubDate>Fri, 29 May 2026 16:44:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48325702</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48325702</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48325702</guid></item><item><title><![CDATA[New comment by WASDx in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>I think their "code" ranking is biased towards visual aesthetics more than raw coding as the voters are just asked which generated website they prefer.</p>
]]></description><pubDate>Thu, 28 May 2026 22:02:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48316137</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48316137</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48316137</guid></item><item><title><![CDATA[New comment by WASDx in "A Claude Code and Codex Skill for Deliberate Skill Development"]]></title><description><![CDATA[
<p>I've had mostly problem-free experiences with intellij (ultimate-only feature I think). One click finds declarations both in business code and buried deep in libraries.</p>
]]></description><pubDate>Thu, 14 May 2026 11:21:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48133868</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48133868</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48133868</guid></item><item><title><![CDATA[New comment by WASDx in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>gemma-4-31B-it-assistant is a 0.5B model. So it's performance would likely be comparable to other models of such size.</p>
]]></description><pubDate>Wed, 06 May 2026 19:04:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48040247</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48040247</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48040247</guid></item><item><title><![CDATA[New comment by WASDx in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>I think this is the future. When models start converging at "really good" (which I think is already happening) then burning them into ASIC silicon is the natural next step.<p>Harnesses can keep improving with a fixed model and the throughput opens up new possibilities like doing 10x more "thinking" or exploring parallel paths and picking the best.</p>
]]></description><pubDate>Wed, 06 May 2026 18:20:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48039683</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=48039683</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48039683</guid></item><item><title><![CDATA[New comment by WASDx in "GitHub RCE Vulnerability: CVE-2026-3854 Breakdown"]]></title><description><![CDATA[
<p>I was impressed enough by AI finding vulnerabilities in source code, but doing it in binary executables is just amazing. This has so much potential, good and bad.<p>And yet another lesson to not treat data as instructions. Sanitize all user input!</p>
]]></description><pubDate>Tue, 28 Apr 2026 18:55:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47938879</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=47938879</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47938879</guid></item><item><title><![CDATA[New comment by WASDx in "We replaced Node.js with Bun for 5x throughput"]]></title><description><![CDATA[
<p>Creating a custom tuple class to use as key could be faster though. Nested map lookups have less efficient memory access patterns.</p>
]]></description><pubDate>Mon, 06 Apr 2026 08:26:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47658259</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=47658259</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47658259</guid></item><item><title><![CDATA[New comment by WASDx in "Show HN: I made a YouTube search form with advanced filters"]]></title><description><![CDATA[
<p>Similar site with same features: <a href="https://ä1.com/" rel="nofollow">https://xn--1-zfa.com/</a></p>
]]></description><pubDate>Mon, 06 Apr 2026 08:06:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47658168</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=47658168</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47658168</guid></item><item><title><![CDATA[New comment by WASDx in "Kotlin creator's new language: talk to LLMs in specs, not English"]]></title><description><![CDATA[
<p>I think these limitations could be addressed by allowing trivial manual adjustments to the generated code before committing. And/or allowing for trivial code changes without a spec change. The judgement of "trivial" being that it still follows the spec and does not add functionality mandating a spec change. I haven't checked if they support any of this but I would be frustrated not being allowed to make such a small code change, say to fix an off-by-one error that I recently got from LLM output. The code change would be smaller than the spec change.<p>Cool idea overall, an incremental psuedocode compiler. Interesting to see how well it scales.<p>I can also see a hybrid solution with non-specced code files for things where the size of code and spec would be the same, like for enums or mapping tables.</p>
]]></description><pubDate>Thu, 12 Mar 2026 21:11:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47357164</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=47357164</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47357164</guid></item><item><title><![CDATA[New comment by WASDx in "Elasticsearch was never a database"]]></title><description><![CDATA[
<p>I've managed a 100+ node cluster for years without seeing any corruption. Where are you getting this from?</p>
]]></description><pubDate>Fri, 16 Jan 2026 19:44:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=46651179</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=46651179</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46651179</guid></item><item><title><![CDATA[New comment by WASDx in "Antislop: A framework for eliminating repetitive patterns in language models"]]></title><description><![CDATA[
<p>You can customize it to get rid of all that. I set it to the "Robot" personality and a custom instruction to "No fluff and politeness. Be short and get straight to the point. Don't overuse bold font for emphasis."</p>
]]></description><pubDate>Thu, 23 Oct 2025 19:29:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=45685874</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=45685874</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45685874</guid></item><item><title><![CDATA[New comment by WASDx in "Ask HN: Does anyone else notice YouTube causing 100% CPU usage and stattering?"]]></title><description><![CDATA[
<p>Same. I recall the "stable volume" setting also eating cpu.</p>
]]></description><pubDate>Fri, 19 Sep 2025 14:59:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=45302397</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=45302397</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45302397</guid></item><item><title><![CDATA[New comment by WASDx in "Show HN: Engineering.fyi – Search across tech engineering blogs in one place"]]></title><description><![CDATA[
<p>FYI here is a list of hundreds of engineering blogs: <a href="https://github.com/kilimchoi/engineering-blogs" rel="nofollow">https://github.com/kilimchoi/engineering-blogs</a></p>
]]></description><pubDate>Sun, 10 Aug 2025 15:40:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=44855931</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=44855931</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44855931</guid></item><item><title><![CDATA[New comment by WASDx in "How we replaced Elasticsearch and MongoDB with Rust and RocksDB"]]></title><description><![CDATA[
<p>The `/_cluster/reroute` endpoint lets you do that with a curl. We have aliases for common operations so I've never felt that I lack a CLI. I'm happy with Elasticsearch overall having a few years of experience.</p>
]]></description><pubDate>Sat, 09 Aug 2025 13:58:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=44846501</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=44846501</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44846501</guid></item><item><title><![CDATA[New comment by WASDx in "QUIC for the kernel"]]></title><description><![CDATA[
<p>I recall this article on QUIC disadvantages: <a href="https://www.reddit.com/r/programming/comments/1g7vv66/quic_is_not_quick_enough_over_fast_internet/" rel="nofollow">https://www.reddit.com/r/programming/comments/1g7vv66/quic_i...</a><p>Seems like this is a step in the right direction to resole some of those issues. I suppose nothing is preventing it from getting hardware support in future network cards as well.</p>
]]></description><pubDate>Thu, 31 Jul 2025 16:49:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=44747484</link><dc:creator>WASDx</dc:creator><comments>https://news.ycombinator.com/item?id=44747484</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44747484</guid></item></channel></rss>