<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: zepearl</title><link>https://news.ycombinator.com/user?id=zepearl</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 30 Jun 2026 22:38:06 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=zepearl" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by zepearl in "Looking Ahead to Postgres 19"]]></title><description><![CDATA[
<p>> <i>Synchronously updated materialized views...</i><p>Oh yes, I'd love them too (if you're referring to, in Oracle slang, "...update on commit") - and it would be cool to have as well the option for a lazy update ("on demand" by taking into consideration only the records that have been changed since the last refresh, to handle multiple updates in a single pass - not sure how Oracle can achieve that technically...). This would be in my opinion a fantastic added functionality compared to basically all other (OLTP?) opensource DBs.<p>And: I'm really curious about the "OrioleDB" project... ( <a href="https://github.com/orioledb/orioledb/releases" rel="nofollow">https://github.com/orioledb/orioledb/releases</a> ) as a few years ago I was struggling a lot with "vacuum" of a kind-of-temporary table that had quite high amounts of continuous random inserts & deletes (problem solved by accumulating more changes in RAM before flushing them to the table therefore increasing amount of rows changed per "page", but I had to sweat a lot to find a good balance...).</p>
]]></description><pubDate>Tue, 30 Jun 2026 17:40:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48736287</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=48736287</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48736287</guid></item><item><title><![CDATA[New comment by zepearl in "Ask HN: MacBook vs. Dedicated GPU for LLM"]]></title><description><![CDATA[
<p>In my case it somehow depends a lot on the task being performed... .<p>E.g. when doing text transcription/OCR from images (Qwen 3.6 27B Q4_K_M by Bartowski) with a context size of ~50k I get a pp of ~460 tokens per second and a generation ranging from 35 to 45 tokens per second (using "--spec-type draft-mtp --spec-draft-n-max 2" currently with llama.cpp b6548).<p>On the other hand when handling code (Qwen 3.6 27B Q5_K_M by Bartowski) with a context size of 128k I get a pp ranging between 500 to 1500 tokens per second and a generation between 25 and 40 tokens per second (using in this case as well "--spec-type draft-mtp --spec-draft-n-max 2" currently with llama.cpp b6548).<p>Anyway in theory with "--split-mode layer" I think that it's anyway the slowest card that drives the overall performance (I do see in "nvtop" that usually the 5070 is ~25% active, the 5060 ~50% and the 3060 ~75%).</p>
]]></description><pubDate>Sun, 28 Jun 2026 16:59:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48709200</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=48709200</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48709200</guid></item><item><title><![CDATA[New comment by zepearl in "Ask HN: MacBook vs. Dedicated GPU for LLM"]]></title><description><![CDATA[
<p>I always run just one model at once, I switch between them depending on what I do (e.g. Qwen3.6 27B Q5_K_M by Bartowski when programming with "OpenHands" and when doing OCR text transcription and transformation, Gemma4 31B-it Q5_K_M by Bartowski when chatting in Open WebUI doing general tasks).</p>
]]></description><pubDate>Sun, 28 Jun 2026 15:48:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=48708426</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=48708426</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48708426</guid></item><item><title><![CDATA[New comment by zepearl in "Ask HN: MacBook vs. Dedicated GPU for LLM"]]></title><description><![CDATA[
<p>I agree. To run an acceptable model (e.g. Qwen/Qwen3.6-27B or google/gemma-4-31B) with a good quantization (minimum Q5) with a good context size (min 64k) you could buy 2 or even 3 GTX 5060 16GiB VRAM for ~550$ each. Fyi the much faster MoE models were useless for my usecases - e.g not able to correctly identify me/I/you, endless thinking loops, etc.<p>I'm currently running those models using an RTX 5070 12GiB + RTX 5060 16GiB + RTX 3060 12GiB with a 96k context size with MTP/speculative decoding and I'm quite happy (the 5070 is about 4x faster than the 3060, the 5060 is inbetween them so about 2x faster than a 3060).</p>
]]></description><pubDate>Sat, 27 Jun 2026 04:55:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48695260</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=48695260</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48695260</guid></item><item><title><![CDATA[New comment by zepearl in "Ask HN: Why Ask HN has only 14 questions now?"]]></title><description><![CDATA[
<p>It's the AI. Everything gets answered (somehow), no direct questions need to be asked anymore.<p>(you're right - I was wondering the same thing 1h ago :o) )</p>
]]></description><pubDate>Wed, 03 Jun 2026 21:19:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=48390229</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=48390229</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48390229</guid></item><item><title><![CDATA[New comment by zepearl in "32GB of DDR5 now costs $375 – AI shortage continues to squeeze PC building"]]></title><description><![CDATA[
<p>This...<p><i>Right now the biggest threat to their IPO's is that people realize that local models are good enough for whatever they're peddling...</i><p>...plus the recent price increases by AI companies, made me actually think the opposite: that there might be another additional "run" for memory and/or GPUs.<p>Therefore, yesterday I decided to order an additional RTX 5060 with 16 GiB VRAM for the ~500$ that I saved during the last months (to be added to the RTX 5070 12 GiB that I bought last year to play games in 4k + my old RTX 3060 12 GiB which I recycled a few months ago after noticing how nice it is to run llama.cpp locally without having to worry about subscription costs).<p>The original 24 GiB VRAM were actually quite enough for some of the stuff that I do (e.g. transcribe text of image scans of old magazines, coding with Aider, etc - I usually use Q5_K_M quantizations of Qwen & Gemma by Bartowski as lower ones delivered sometimes weird results and/or looped forever in "thinking"-mode), but I guess that with 40 GiB I should be bullet-proof for my pessimistic view of our future :o)</p>
]]></description><pubDate>Wed, 03 Jun 2026 18:13:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48387594</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=48387594</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48387594</guid></item><item><title><![CDATA[New comment by zepearl in "Dirtyfrag: Universal Linux LPE"]]></title><description><![CDATA[
<p>damn you're right, thx</p>
]]></description><pubDate>Thu, 07 May 2026 21:58:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48055679</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=48055679</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48055679</guid></item><item><title><![CDATA[New comment by zepearl in "Dirtyfrag: Universal Linux LPE"]]></title><description><![CDATA[
<p>So if I understand correctly 3 modules are involved:<p>- esp4 (kernel config "CONFIG_AF_RXRPC")<p>- esp6 (kernel config "CONFIG_INET_ESP")<p>- rxrpc (kernel config "CONFIG_INET6_ESP")<p>Is this correct?</p>
]]></description><pubDate>Thu, 07 May 2026 20:36:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48054598</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=48054598</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48054598</guid></item><item><title><![CDATA[New comment by zepearl in "Copy Fail"]]></title><description><![CDATA[
<p>Thanks a lot!!!<p>I was running in Gentoo "6.18.18" (amd64) and the exploit worked (and all other shells which I PREVIOUSLY opened could then just execute "su -" without password to become "root") -> doing temporarily a "modprobe -r algif_aead" on-the-fly did not fix it as I was still able to swap to "root" from the unprivileged user by executing just "su -".<p>"6.18.25" fixed it (module "algif_aead" still running).<p>- Maybe older Kernel versions that don't contain the fix should be blacklisted?<p>- FYI in Gentoo I had to recompile "sys-fs/zfs-kmod" after the minor kernel upgrade (I initially skipped it, but after rebooting with the new kernel I could not mount my raidz1) -> the same might be needed for other external modules.</p>
]]></description><pubDate>Thu, 30 Apr 2026 18:50:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=47966683</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=47966683</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47966683</guid></item><item><title><![CDATA[New comment by zepearl in "We do not think Anthropic should be designated as a supply chain risk"]]></title><description><![CDATA[
<p>Using X (at least in this context?) is weird.</p>
]]></description><pubDate>Sat, 28 Feb 2026 23:47:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47201846</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=47201846</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47201846</guid></item><item><title><![CDATA[New comment by zepearl in "How do I cancel my ChatGPT subscription?"]]></title><description><![CDATA[
<p>I downloaded Ollama ( <a href="https://github.com/ollama/ollama/releases" rel="nofollow">https://github.com/ollama/ollama/releases</a> ) and experimented with a few Qwen models ( <a href="https://huggingface.co/Qwen/collections" rel="nofollow">https://huggingface.co/Qwen/collections</a> ).<p>My performance when using an RTX 5070 12GiB VRAM, Ryzen 7 9700X 8 cores CPU, 32GiB DDR5 6000MT (2 sticks):<p><pre><code>  - "qwen2.5:7b": ~128 tokens/second (this model fits 100% in the VRAM).
  - "qwen2.5:32b": ~4.6 tokens/second.
  - "qwen3:30b-a3b": ~42 tokens/second (this is a MoE model with multiple specialized "brains") (this uses all 12GiB VRAM + 9GiB system RAM, but the GPU usage during tests is only ~25%).
  - qwen3.5:35b-a3b: ~17 tokens/second, but it's highly unstable and crashes -> currently not usable for me.
</code></pre>
So currently my sweet spot is "qwen3:30b-a3b" - even if the model doesn't completely fit on the GPU it's still fast enough. "qwen3.5" was disappointing so far, but maybe things will change in the future (maybe Ollama needs some special optimizations for the 3.5-series?).<p>I would therefore deduce that the most important thing is the amount of VRAM and that performance would be similar even when using an older GPU (e.g. an RTX 3060 with as well 12GiB RAM)?<p>Performance without a GPU, tested by using a Ryzen 9 5950X 16 cores CPU, 128GiB DDR4 3200 MT:<p><pre><code>  - "qwen2.5:7b": ~9 tokens/second
  - "qwen3:32b": ~2 tokens/second
  - "qwen3:30b-a3b": ~16 tokens/second</code></pre></p>
]]></description><pubDate>Sat, 28 Feb 2026 09:44:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47192943</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=47192943</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47192943</guid></item><item><title><![CDATA[How I launched 3 consoles and found true love at Babbage's store no. 9 (2013)]]></title><description><![CDATA[
<p>Article URL: <a href="https://arstechnica.com/gadgets/2013/01/how-i-launched-3-consoles-and-found-true-love-at-babbages-store-no-9/">https://arstechnica.com/gadgets/2013/01/how-i-launched-3-consoles-and-found-true-love-at-babbages-store-no-9/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47074671">https://news.ycombinator.com/item?id=47074671</a></p>
<p>Points: 79</p>
<p># Comments: 33</p>
]]></description><pubDate>Thu, 19 Feb 2026 15:14:18 +0000</pubDate><link>https://arstechnica.com/gadgets/2013/01/how-i-launched-3-consoles-and-found-true-love-at-babbages-store-no-9/</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=47074671</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47074671</guid></item><item><title><![CDATA[New comment by zepearl in "Ferrari vs. Markets"]]></title><description><![CDATA[
<p>What about pre-December_2022? I cannot imagine that just a handful were imported.</p>
]]></description><pubDate>Sun, 01 Feb 2026 01:11:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46842651</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=46842651</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46842651</guid></item><item><title><![CDATA[New comment by zepearl in "Kidnapped by Deutsche Bahn"]]></title><description><![CDATA[
<p>> <i>The main reason for this is lack of competition for DB in Germany</i><p>Cannot be - there is no competition in Switzerland, but things run pretty smoothly -> in the case of Germany I'd rather say: "lack of oversight, controls, 'konsequent zu sein'" -> in the case of Germany's DB I think that nobody at all levels gives a *hit about its problems.</p>
]]></description><pubDate>Mon, 29 Dec 2025 23:54:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=46427619</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=46427619</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46427619</guid></item><item><title><![CDATA[New comment by zepearl in "Nvidia takes $5B stake in Intel under September agreement"]]></title><description><![CDATA[
<p>I interpreted your post like what "krupan" posted in the separate sub-thread ("This is a much tighter circle than any of us should be comfortable with"), but maybe others interpreted it differently (the words of your post are quite generic...). Cheers :o)</p>
]]></description><pubDate>Mon, 29 Dec 2025 22:35:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=46426687</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=46426687</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46426687</guid></item><item><title><![CDATA[New comment by zepearl in "Nearly 90% of Windows Games Now Run on Linux"]]></title><description><![CDATA[
<p>To fix stuttering I had to disable compositing in the window manager (Xfce on Linux Mint, nVidia proprietary with AMD CPU).</p>
]]></description><pubDate>Wed, 29 Oct 2025 12:29:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=45745950</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=45745950</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45745950</guid></item><item><title><![CDATA[New comment by zepearl in "Robert Redford has died"]]></title><description><![CDATA[
<p>Fully agree - such a great movie: absolutely flowing, entertaining, fantastic characters, nice colors.
That together with "Three days of the Condor" is what I immediately though of when I heard the news, but so far only one of the newspapers/sites I read has mentioned both of them..., weird :o|
Am I just getting too old (respectively, are articles being written by too junior people?)?</p>
]]></description><pubDate>Wed, 17 Sep 2025 22:37:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=45282262</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=45282262</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45282262</guid></item><item><title><![CDATA[Ask HN: Tesla switching from "Godot" to "Unreal": is this ~informative?]]></title><description><![CDATA[
<p>I read on many sites, e.g....<p>* English: https://www.theverge.com/news/759016/tesla-in-car-graphics-unreal-engine<p>* German: https://www.golem.de/news/fahrzeugvisualisierung-tesla-soll-wechsel-zur-unreal-engine-planen-2508-199152.html<p>...that Tesla might be switching from the "Godot"- to the "Unreal"-engine.<p>Are there any REAL reasons known for such a switch? (eg. about bad_support/bad_performance/bad_adhoc_customizations/bad_overall_approach/whatever...)<p>Reason: I just started this week looking for a 3D-engine (candidates so far are Godot & Bevy, other options tbd) and this news made me wonder if I should be "more careful" about Godot... ?</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44918453">https://news.ycombinator.com/item?id=44918453</a></p>
<p>Points: 10</p>
<p># Comments: 5</p>
]]></description><pubDate>Fri, 15 Aug 2025 23:28:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=44918453</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=44918453</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44918453</guid></item><item><title><![CDATA[New comment by zepearl in "Tell HN: Cloudflare is blocking Pale Moon and other non-mainstream browsers"]]></title><description><![CDATA[
<p>Same here - fail2ban then adds the IP to my nftables fw</p>
]]></description><pubDate>Thu, 06 Feb 2025 12:48:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=42961837</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=42961837</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42961837</guid></item><item><title><![CDATA[New comment by zepearl in "Google Cache is fully dead"]]></title><description><![CDATA[
<p>> <i>I would presume Google still has all this data. ...</i><p>Maybe - I guess that they must have served that "cached" content from DB-records that had it all saved directly (URL X has contents Y => basically a "mirror" of the terms that they indexed) => not having to store that "mirror" (only the search index) might save quite a lot of storage space (and I/O and CPU to decompress it, as users won't be requesting it anymore) => all in all that might save quite a lot of infrastructure costs $$$.<p>> <i>Could this be an advantage that Google can use to train their models on but others won't have access?</i><p><i>Maybe</i> (if they decided to just get rid of the I/O related to the user requests), but on the other hand I don't know if previously any "Google-consumer" was ever able to perform mass-downloads of Google's "cached" data - could that be done without being banned by Google's webpage (or API)?</p>
]]></description><pubDate>Wed, 25 Sep 2024 21:06:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=41651876</link><dc:creator>zepearl</dc:creator><comments>https://news.ycombinator.com/item?id=41651876</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41651876</guid></item></channel></rss>