<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: electroglyph</title><link>https://news.ycombinator.com/user?id=electroglyph</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 23 Jun 2026 23:45:35 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=electroglyph" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by electroglyph in "Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions"]]></title><description><![CDATA[
<p>existing embedding models like alibaba's modernbert tune or one of the jina v5s would probably map query to category automatically. (i.e. store embeddings of each category and calculate cosine sim for each incoming query vs. categories and pick the closest)<p>also, you could stick a classifier head on a BERT model as another option.</p>
]]></description><pubDate>Mon, 22 Jun 2026 04:04:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48625512</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48625512</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48625512</guid></item><item><title><![CDATA[New comment by electroglyph in "The time the x86 emulator team found code so bad they fixed it during emulation"]]></title><description><![CDATA[
<p>heh, when Raymond Chen dunks on the MSVC team =)</p>
]]></description><pubDate>Tue, 16 Jun 2026 05:58:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48551117</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48551117</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48551117</guid></item><item><title><![CDATA[New comment by electroglyph in "AI is code – and can't be prompted into being smarter"]]></title><description><![CDATA[
<p>it's probably a lie</p>
]]></description><pubDate>Mon, 15 Jun 2026 01:37:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48535459</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48535459</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48535459</guid></item><item><title><![CDATA[New comment by electroglyph in "DeepSeek V4 Pro beats GPT-5.5 Pro on precision"]]></title><description><![CDATA[
<p>deepseek 4 pro is insanely good for the price</p>
]]></description><pubDate>Mon, 08 Jun 2026 03:18:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48440963</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48440963</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48440963</guid></item><item><title><![CDATA[New comment by electroglyph in "Nvidia is proposing a beast of a CPU system for Windows PCs"]]></title><description><![CDATA[
<p>that link actually recommends <i>not</i> doing it from UEFI and doing it via software</p>
]]></description><pubDate>Sat, 06 Jun 2026 22:49:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48429819</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48429819</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48429819</guid></item><item><title><![CDATA[New comment by electroglyph in "KVarN: Native vLLM backend for KV-cache quantization by Huawei"]]></title><description><![CDATA[
<p>any divergence (even if the benchmark is better) from full precision is error</p>
]]></description><pubDate>Thu, 04 Jun 2026 21:33:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48404928</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48404928</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48404928</guid></item><item><title><![CDATA[New comment by electroglyph in "Gustav Klimt and Egon Schiele in Conversation (2018)"]]></title><description><![CDATA[
<p>this is better than TFA</p>
]]></description><pubDate>Sun, 31 May 2026 05:40:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48343320</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48343320</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48343320</guid></item><item><title><![CDATA[New comment by electroglyph in "FBI Arrests CIA Official with $40M in Gold Bars in His Home"]]></title><description><![CDATA[
<p>sometimes i wonder if the left hand knows what the right is doing. it looks like we arrested our own spy in this case: <a href="https://www.politico.com/news/2026/05/25/american-journalist-unregistered-agent-china-00935518" rel="nofollow">https://www.politico.com/news/2026/05/25/american-journalist...</a></p>
]]></description><pubDate>Thu, 28 May 2026 01:45:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48303315</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48303315</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48303315</guid></item><item><title><![CDATA[New comment by electroglyph in "Norway's 2 petabytes of Huawei flash storage and LLM training"]]></title><description><![CDATA[
<p>absolutely. somebody online was wanting an LLM with Georgian language support, and that's exactly what i suggested: start digitizing Georgian text.</p>
]]></description><pubDate>Mon, 25 May 2026 23:16:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48272978</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48272978</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48272978</guid></item><item><title><![CDATA[New comment by electroglyph in "Wake up! 16b"]]></title><description><![CDATA[
<p>i'll upvote this each time it's submitted</p>
]]></description><pubDate>Sun, 24 May 2026 04:39:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48254425</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48254425</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48254425</guid></item><item><title><![CDATA[New comment by electroglyph in "Qwen3.7-Max: The Agent Frontier"]]></title><description><![CDATA[
<p>you should be using dflash with that model, look it up</p>
]]></description><pubDate>Wed, 20 May 2026 23:09:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48215590</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48215590</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48215590</guid></item><item><title><![CDATA[New comment by electroglyph in "DeepSeek-V4-Flash means LLM steering is interesting again"]]></title><description><![CDATA[
<p>heretic maintainer: <a href="https://github.com/p-e-w/heretic" rel="nofollow">https://github.com/p-e-w/heretic</a><p>the fun bits are in another branch or PRs</p>
]]></description><pubDate>Sat, 16 May 2026 22:30:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=48164329</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48164329</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48164329</guid></item><item><title><![CDATA[New comment by electroglyph in "DeepSeek-V4-Flash means LLM steering is interesting again"]]></title><description><![CDATA[
<p>p-e-w was just talking about this the other day in his Discord. seems doing the one neuron method is quite bad for KLD and that's why the newer techniques have stuck.</p>
]]></description><pubDate>Sat, 16 May 2026 20:56:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48163727</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48163727</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48163727</guid></item><item><title><![CDATA[New comment by electroglyph in "Seeing Birdsong"]]></title><description><![CDATA[
<p>site has so little information there doesn't seem to be much to discuss</p>
]]></description><pubDate>Mon, 11 May 2026 07:21:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48091995</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48091995</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48091995</guid></item><item><title><![CDATA[New comment by electroglyph in "A polynomial autoencoder beats PCA on transformer embeddings"]]></title><description><![CDATA[
<p>this looks awesome. i've been struggling with vector compression, and have been trying PCA + all sorts of rotations. looking forward to trying this out</p>
]]></description><pubDate>Fri, 08 May 2026 10:22:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48061122</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48061122</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48061122</guid></item><item><title><![CDATA[New comment by electroglyph in "Making LLM Training Faster with Unsloth and NVIDIA"]]></title><description><![CDATA[
<p>nice writeup! looking forward to doing some more training as soon as i get some more data sorted. it'll be a custom arch, but i'll probably shoehorn it into unsloth for a speed boost.</p>
]]></description><pubDate>Thu, 07 May 2026 09:19:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48047238</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48047238</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48047238</guid></item><item><title><![CDATA[New comment by electroglyph in "Train Your Own LLM from Scratch"]]></title><description><![CDATA[
<p>you can train it, but not fully</p>
]]></description><pubDate>Tue, 05 May 2026 06:41:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48018889</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=48018889</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48018889</guid></item><item><title><![CDATA[New comment by electroglyph in "Opus 4.7 knows the real Kelsey"]]></title><description><![CDATA[
<p>that's in the ideal scenario where it's only seen a single copy of it tho</p>
]]></description><pubDate>Fri, 01 May 2026 08:24:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47972415</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=47972415</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47972415</guid></item><item><title><![CDATA[New comment by electroglyph in "New copy of earliest poem in English, written 1,3k years ago, discovered in Rome"]]></title><description><![CDATA[
<p>it was 1.3e-6 billion years ago!</p>
]]></description><pubDate>Fri, 01 May 2026 08:21:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=47972401</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=47972401</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47972401</guid></item><item><title><![CDATA[New comment by electroglyph in "Microsoft and OpenAI end their exclusive and revenue-sharing deal"]]></title><description><![CDATA[
<p>i'm doing inference on a free mi300x instance from AMD right now. not sure if the software stack is just old or what, but here's what i've observed: stuck on an old version of vllm pre-Transformers 5 support. it lacks MoE support for qwen3 models. oss-120b is faaaar slower than it should be.<p>int8 quantization seems like it's almost supported, but not quite. speeds drop to a fraction of full precision speed and the server seems like it intermittently hangs. int4 quantization not supported. fp8 quantization not supported.<p>again, maybe AMD is just being lazy with what they've provided, but it's not a great look.<p>right now the fastest smart model i can run is full precision qwen3-32b. with 120 parallel requests (short context) i'm getting PP @ 4500 tokens/sec and TG @ 1300 tokens/sec</p>
]]></description><pubDate>Tue, 28 Apr 2026 05:31:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=47930715</link><dc:creator>electroglyph</dc:creator><comments>https://news.ycombinator.com/item?id=47930715</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47930715</guid></item></channel></rss>