<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ColonelPhantom</title><link>https://news.ycombinator.com/user?id=ColonelPhantom</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 18 May 2026 08:45:46 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ColonelPhantom" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ColonelPhantom in "Show HN: Rust but Lisp"]]></title><description><![CDATA[
<p>Carp is memory safe via linear types + references, similar to Rust, so I would not describe it as C-like but rather Rust-like.</p>
]]></description><pubDate>Sun, 10 May 2026 08:32:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48082073</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=48082073</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48082073</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "The text mode lie: why modern TUIs are a nightmare for accessibility"]]></title><description><![CDATA[
<p>But what _is_ a "Text User Interface"? Google Images just returns what is being discussed here: "GUIs" that run in some kind of text mode. And to me, that's also what a TUI is.<p>A more textually oriented environment (like a normal Unix shell) is, in my experience, usually referred to as a CLI: Command Line Interface.<p>I did find an interesting hybrid in the Pi coding agent: it seems to leverage the normal terminal scrollback, while still enhancing it with things like transient input fields and status lines, so that it can display those without cluttering scrollback.</p>
]]></description><pubDate>Mon, 04 May 2026 05:40:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=48005032</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=48005032</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48005032</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Framework Laptop 13 Pro"]]></title><description><![CDATA[
<p>You mentioned Strix Halo, which also has off-die memory. Strix Halo does have a real advantage from its wider memory bus (four channels for 256 bit instead of 128 bit), but Strix Point is equivalent-ish to Intel's platforms like Panther Lake or Arrow Lake in terms of memory setup.<p>In fact, Intel also had Lunar Lake, which had on-package memory. However, it was still limited to 128-bit dual-channel, so there weren't really many performance benefits; it did however help with power efficiency.</p>
]]></description><pubDate>Tue, 21 Apr 2026 19:34:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47853451</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47853451</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47853451</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Framework Laptop 13 Pro"]]></title><description><![CDATA[
<p>Hilariously, those AMD chips are way behind the Intels in terms of memory.<p>First off, I believe that Intel has its memory far more "unified". AMD typically has a stricter VRAM/RAM 'tradeoff' setting that does not exist on Intel in the same way to my knowledge. (See how on Strix Halo systems, there is a thing about "allocating" 96 GB to the GPU, which seems to be needed sometimes but prevents the CPU from accessing that memory.)<p>Secondly, the Panther Lake board has LPDDR5X LPCAMM2 memory at 7467 MT/s, while the AMD boards are stuck with DDR5 SODIMMs at a meagre 5600 MT/s. In other words, the Intel board gets a third more memory bandwidth!</p>
]]></description><pubDate>Tue, 21 Apr 2026 19:30:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47853407</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47853407</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47853407</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Every GPU That Mattered"]]></title><description><![CDATA[
<p>Nvidia Turing (RTX 20) definitely marked a major shift IMO.<p>- It was the first card to enable real-time ray-traced effects. 
- Mesh shaders are a significant overhaul of the geometry pipeline that's only recently getting real traction.
- Its tensor cores enabled a new generation of AI-driven upscaling/antialiasing. DLSS 2, FSR 4 and XeSS are all some variation of "TAA + neural networks", and these all rely on specialized matrix hardware to get optimal performance.<p>Obviously all of these features are supported across all vendors. Intel Arc Alchemist has all of these features as well, and AMD got RT and mesh shader support with RDNA2 along with slowly building up to tensor cores with RDNA3/4. But Turing clearly debuted these feature which have majorly changed the landscape of realtime 3D graphics.</p>
]]></description><pubDate>Wed, 08 Apr 2026 02:34:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=47684251</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47684251</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47684251</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Intel Announces Arc Pro B70 and Arc Pro B65 GPUs"]]></title><description><![CDATA[
<p>838 seems to be the real INT8 TOPS number for the 5090; going from 800 to 3400 takes an x2 speedup for sparsity (so skipping ops) and another x2 speedup for FP4 over INT8.<p>So it's closer to half the speed than a tenth. Intel also seems to be positioning this card against the RTX PRO 4000 Blackwell, not the 5090, and that one gets more like 300 INT8 TOPS. It also has less memory but at a slightly higher bandwidth. The 5090 is much faster and IIRC priced similarly to the PRO 4000, but is also decidedly a consumer product which, especially for Nvidia, comes with limitations (e.g. no server-friendly form factor cards available, and there are or used to be driver license restrictions that prevented using a consumer card in a data center setup).</p>
]]></description><pubDate>Thu, 26 Mar 2026 20:44:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47535468</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47535468</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47535468</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "ARM AGI CPU: Specs and SKUs"]]></title><description><![CDATA[
<p>Aren't Intel Xeon Rapids and Intel Xeon Forest just different target markets? Rapids has fewer but faster cores in general, and more special-purpose accelerators (e.g. AMX, QAT), while Forest is focused on maximum compute density (just pack in as many fast-enough cores as you can).<p>IIRC Granite Rapids is also not _that_ old, and either current or a single generation behind. (Has its successor landed yet? IIRC GNR is the same generation as Sierra Forest).</p>
]]></description><pubDate>Wed, 25 Mar 2026 19:01:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47521690</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47521690</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47521690</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Building an FPGA 3dfx Voodoo with Modern RTL Tools"]]></title><description><![CDATA[
<p>Very cool! I am wondering one thing: how fast is it? Much of the "secret sauce" of the Voodoo is its high speed: a first-gen Verite or (God forbid) any ViRGE takes many more cycles for common operations like, say, Z-buffered pixels.<p>I'm guessing this isn't fully cycle-accurate, but is it at least somewhat "IPC-accurate"? I'm guessing yes? But much of that was also derived from Voodoo's (for the time) crazy high memory bandwidth AFAIK.</p>
]]></description><pubDate>Mon, 23 Mar 2026 12:49:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=47488764</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47488764</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47488764</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Tinybox – A powerful computer for deep learning"]]></title><description><![CDATA[
<p>GPT-OSS is tailored to be extremely memory efficient. Not only is it natively using the 4.25 bit per token MXFP4 format, but it also uses sliding window attention for half of its layers. It also doesn't have that many layers, only 36 for the 120B version and 24 for the 120B version. (The 120B is also much much sparser than the 20B.)<p>I found a Reddit comment claiming only 36 KiB per token. With that, half a million tokens fits in 18 GB, which is less than one GPU. And three GPUs fit the parameters with room to spare (64 out of 72 GB).</p>
]]></description><pubDate>Mon, 23 Mar 2026 00:14:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=47483825</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47483825</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47483825</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "NanoGPT Slowrun: 10x Data Efficiency with Infinite Compute"]]></title><description><![CDATA[
<p>Interesting; I was not aware of those "universal synthetics" but they make sense: a stronger reasoning base would make modeling tasks easier. Thanks for the link!<p>Again, though, if those work I assume they will be used for the slowrun. Surely a few hundred LoC to generate data would not be considered cheating :)</p>
]]></description><pubDate>Fri, 20 Mar 2026 13:28:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=47454206</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47454206</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47454206</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "NanoGPT Slowrun: 10x Data Efficiency with Infinite Compute"]]></title><description><![CDATA[
<p>If generating synthetic data is such a great way to improve performance, why would it not be applied to the slowrun? Especially for the unlimited compute track, you should have plenty of time to generate as much synthetic data as your heart desires.<p>Intuitively, I would expect the synthetic data to mostly just "regurgitate" the existing data, and not add much. But I could be wrong of course, and perhaps doing reinforcement learning somewhere could solve that issue as well (though I don't know if there is much hidden in FineWeb that you could RL on; at best you can do self-verification probably?)</p>
]]></description><pubDate>Fri, 20 Mar 2026 11:49:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=47453286</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47453286</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47453286</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Show HN: Axe – A 12MB binary that replaces your AI framework"]]></title><description><![CDATA[
<p>I like the idea of LLM-calling as an automation-friendly CLI tool! However, putting all my agents in ~/.config feels antithetical to this. My Bash scripts do not live there either, but rather in a separate script collection, or preferably, at their place of use (e.g. in a repo).<p>For example, let's say I want to add commit message generation (which I don't think is a great use of LLMs, but it is a practical example) to a repo. I would add the appropriate hook to /.git, but I would also want the agent with its instructions to live inside the repo (perhaps in an `axe` or `agents` directory).<p>Can Axe load agents from the current folder? Or can that be added?</p>
]]></description><pubDate>Fri, 13 Mar 2026 09:26:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47362226</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47362226</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47362226</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Forcing Flash Attention onto a TPU and Learning the Hard Way"]]></title><description><![CDATA[
<p>Interesting read! One remark though: I'm not too familiar with the architecture of a Google TPU, but comparing the TPU's VMEM with Nvidia's shared memory feels wrong to me.<p>Looking at the size, and its shared nature, it feels far more natural to compare with the L2 cache, which is also shared across the entire GPU and is in the same order of size (40MB on the listed A100).</p>
]]></description><pubDate>Fri, 13 Mar 2026 00:46:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47359360</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47359360</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47359360</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "MacBook Pro with M5 Pro and M5 Max"]]></title><description><![CDATA[
<p>The reason for that is that most memory bandwidth bumps come with new memory generations. For example an early DDR4 platform (e.g. Intel Skylake/Core iX-6000) and a late one (e.g. AMD Zen3/Ryzen 5000) only differ by 1.5x as well, typically.<p>The same trend is visible in GPUs: for example, my RTX 2070 (GDDR6) has the same memory bandwidth as a 3070 and only a little bit less than a 4070 (GDDR6X). However, a 5070 does get significantly more bandwidth due to the jump to GDDR7. Lower-end cards like the 4060 even stuck to GDDR6, which gave them a bandwidth deficit compared to a 3060 due to the narrower memory buses on the 40 series.</p>
]]></description><pubDate>Wed, 04 Mar 2026 12:09:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47246345</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47246345</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47246345</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Qwen 3.5 small models out"]]></title><description><![CDATA[
<p>It's not just Qwen; we also recently had GLM-4.7-Flash in the same roughly 30B-A3 range. Seems to me like there's no shortage of competition for good old GPT-OSS 20B (not just Qwen3.5-35B and GLM-4.7-Flash, but also Qwen3(-Coder)-30B or Granite 4 Small).</p>
]]></description><pubDate>Wed, 25 Feb 2026 07:39:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47148560</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=47148560</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47148560</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Porsche sold more electrified cars in Europe in 2025 than pure gas-powered cars"]]></title><description><![CDATA[
<p>> Meanwhile, European makers are stuck not knowing what to do, make Americans happy or compete with the Chinese.<p>Huh? This comment sounds extremely America-centric to me. Porsche sells more cars into Europe than North America, despite taking a bigger there (-13-16% vs 0%)!<p>In general I don't think Porsche is representative of the car market as a whole, given their cars are all premium sports cars to at least some degree.<p>If you want more representative numbers look at more mass-market manufacturers. Notably, the Volkswagen group has a huge 20%+ market share in the EU, while it is below 5% in the USA. Renault is another example of a strong EU-centric brand and manufacturer with over 10% market share, even over 25% at home in France. Ford is a good example of the opposite, having 13% market share in the USA and only 2-3% here. Stellantis is strong in both markets, but has significant differentiation, even having different brands in both markets.</p>
]]></description><pubDate>Tue, 20 Jan 2026 10:50:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=46690421</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=46690421</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46690421</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Show HN: Z80-μLM, a 'Conversational AI' That Fits in 40KB"]]></title><description><![CDATA[
<p>Each layer of the LM is also at most 16 KiB, so if you want to minimize bank switching, I think making sure each layer is in one bank would be enough? Bank switching shouldn't give much overhead anyway unless it complicates an inner loop, which would be avoided if no layers are split across banks.</p>
]]></description><pubDate>Mon, 29 Dec 2025 23:18:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=46427254</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=46427254</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46427254</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "Bikeshedding, or why I want to build a laptop"]]></title><description><![CDATA[
<p>Touchegg kinda sucks (gestures are not 1:1 but rather just "triggered"), and you also don't need it. KDE and Gnome (as well as some WMs like Niri) have native touchpad gesture support on Wayland. Using my touchpad for history navigation also works flawlessly on Firefox with two fingers (essentially just horizontal scrolling).</p>
]]></description><pubDate>Sun, 07 Dec 2025 14:51:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=46182063</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=46182063</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46182063</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "GNOME 50 completes the migration to Wayland, dropping X11 backend code"]]></title><description><![CDATA[
<p>I have no idea what you are saying (with "is"??), but I don't think this is true: KDE Dolphin is very full-featured and runs natively on Wayland.</p>
]]></description><pubDate>Sun, 16 Nov 2025 13:49:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=45945136</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=45945136</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45945136</guid></item><item><title><![CDATA[New comment by ColonelPhantom in "GNOME 50 completes the migration to Wayland, dropping X11 backend code"]]></title><description><![CDATA[
<p>That's fair! I <i>believe</i> that window positioning also works on XWayland, though, so running your file manager that way should still work with the rest of the system being Wayland (and Gnome has no plans to drop XWayland afaik).<p>I believe the main holdup is a desire for Wayland to be usable with e.g. VR interfaces where there is no simple 2d grid.<p>Out of curiosity, how do you want the file manager to behave? And did you write your own or are you using an existing one that works that way?</p>
]]></description><pubDate>Fri, 14 Nov 2025 15:37:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=45927840</link><dc:creator>ColonelPhantom</dc:creator><comments>https://news.ycombinator.com/item?id=45927840</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45927840</guid></item></channel></rss>