<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: rft</title><link>https://news.ycombinator.com/user?id=rft</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 15 Jun 2026 11:28:17 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=rft" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by rft in "Gemma 4 12B: A unified, encoder-free multimodal model"]]></title><description><![CDATA[
<p>Those numbers are wrong for most use cases. Likely the LLM did not take SWA (Sliding Window Attention) of G4 into account. Without SWA those numbers could be correct, I can't load a q8 without SWA on a 24GB card.<p>I tested this with the at the time newest llama.cpp master on a Linux system with 2 3090 24GB, only one was used for testing. q8 without any KV quant, 256k context, mmproj loaded takes less than 20GB VRAM. This runs at about 1.5k to 2k tok/s pp and 40-50 tok/s gen (slightly lowered power limits & undervolted). q8 with 64k non-quant context and mmproj takes just under 16GB VRAM. Drop down to the q6k model, no mmproj, 64k non-quant context and it fits in 12GB VRAM. All the way down to q4km and some batch size tweaking and it barely fits into 8GB VRAM.<p>64k context is the minimum for Hermes agent, so a vision capable "agentic" model fits into a 16GB card. This is very impressive. I am currently testing how smart the model is and it does decently so far, had one looping issue it recovered after a lot of tokens, did some basic tool calling.</p>
]]></description><pubDate>Thu, 04 Jun 2026 21:37:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=48404967</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=48404967</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48404967</guid></item><item><title><![CDATA[New comment by rft in "I put a datacenter GPU in my gaming PC"]]></title><description><![CDATA[
<p>Ah, ok. I took a look at the 3090 numbers and they list 400 tok/s prefill, so if I normalize my expectations to that base line the numbers you posted do make sense. I haven't dug deep into that site's methodology, but their estimates seems way off. Especially since they don't take into account cache quant when deciding whether or not you can run a model. Overall I found that website a bit confusing, but maybe the UX just didn't click with me.</p>
]]></description><pubDate>Sun, 31 May 2026 20:34:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=48349475</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=48349475</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48349475</guid></item><item><title><![CDATA[New comment by rft in "I put a datacenter GPU in my gaming PC"]]></title><description><![CDATA[
<p>> The MI100 is roughly double the performance on Qwen 3.5 35B A3B Q5_K_M to the R9700 (462 token/s prefill vs 239 tokens/s, 217 tokens/s vs 118 token/s for inference)<p>Those prefill numbers look really low to me. I can run nearly that same model (qwen 3.6) at q4km with q6 cache on a single 3090 and get 2.3k-4.4k prefill and 100-170 generation. Just based on raw numbers I would expect the R9700 to land around 70-90 generation (about 2/3 of memory bandwidth of a 3090) and at least the same or higher prefill (nearly 3x FP16 TOPS on the R9700). That means the numbers really don't add up. Is the benchmark done with some special settings, e.g. parallel requests or with very low prompt length?</p>
]]></description><pubDate>Sun, 31 May 2026 19:59:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=48349183</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=48349183</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48349183</guid></item><item><title><![CDATA[New comment by rft in "Anthropic acquires Stainless"]]></title><description><![CDATA[
<p>For those who, like me, had to do a double take on that number: <a href="https://www.businessinsider.com/openclaw-peter-steinberger-ai-token-bill-2026-5" rel="nofollow">https://www.businessinsider.com/openclaw-peter-steinberger-a...</a><p>Yes, $1.3M in token cost in less than 30 days and some days were even off-peak, if you can call it that with that insane scale that likely hides quite a lot of tokens in the lower bars.<p>HN thread: <a href="https://news.ycombinator.com/item?id=48159227">https://news.ycombinator.com/item?id=48159227</a></p>
]]></description><pubDate>Tue, 19 May 2026 00:00:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48187556</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=48187556</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48187556</guid></item><item><title><![CDATA[New comment by rft in "Poland is now among the 20 largest economies"]]></title><description><![CDATA[
<p>Without having firm data, I can see a few factors that are different. After the collapse of the GDR, it was easier for eastern Germans to move to west Germany than for Polish to move to a different country in the west. Mostly younger and educated people would have made that move, hampering future generations. With the Reunification also came the whole Treuhand issue which essentially sold off a good chunk of eastern Germany for pennies to western investors, because eastern investors had no capital. That meant the east lost out on the profits from its economy as they would accumulate in the west instead. Even today a large part of east German rentals are owned by western landlords or corporations. Then the industrial base of west Germany was setup far more for competing on the open world market with automotive companies in the NW (VW), SW (Daimler) and SE (BMW) plus the big industrial area Ruhrgebiet. So you naturally got an economic focus even after Reunification on the old BRD with the previous GDR requiring decades to hopefully catch up to the rest of the new country.</p>
]]></description><pubDate>Fri, 08 May 2026 13:25:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48062792</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=48062792</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48062792</guid></item><item><title><![CDATA[New comment by rft in "Edit store price tags using Flipper Zero"]]></title><description><![CDATA[
<p>And a minority of those kids will get curious about the How and Why. Those are the security nerds of the future securing the networks against both the kids they were themselves and actual malicious actors.<p>Source: Early interest in wifi security, including in other people's networks, lead me down an education and career in security</p>
]]></description><pubDate>Tue, 21 Apr 2026 11:17:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=47847254</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=47847254</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47847254</guid></item><item><title><![CDATA[New comment by rft in "Bet on German Train Delays"]]></title><description><![CDATA[
<p>In 2019 there was a talk about data mining the DB arrival data [1] (yes, this problem is nothing new). One of the takeaways was that on some connections you can actually buy a "Sparticket" (cheaper, but only valid for a specific train), but get it upgraded to a "Flexticket" (more expensive, can take any train on the route) for free. This works because a delay of more than X minutes removes the specific train requirement and some routes are nearly always delayed by at least that threshold.<p>[1] <a href="https://media.ccc.de/v/36c3-10652-bahnmining_-_punktlichkeit_ist_eine_zier#t=2601" rel="nofollow">https://media.ccc.de/v/36c3-10652-bahnmining_-_punktlichkeit...</a> (German)</p>
]]></description><pubDate>Wed, 04 Mar 2026 15:04:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=47248535</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=47248535</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47248535</guid></item><item><title><![CDATA[New comment by rft in "Go Home, Windows EXE, You're Drunk"]]></title><description><![CDATA[
<p>Related, there is a stable way to detect whether your .exe is running under Wine and even which version by looking up the address of the export wine_get_version in ntdll [1]. Years ago I actually had to do this to work around some weird path bug when we were testing our Windows build under Wine (easier to setup Wine than a full Windows CI).<p>[1] <a href="https://www.winehq.org/pipermail/wine-devel/2008-September/069387.html" rel="nofollow">https://www.winehq.org/pipermail/wine-devel/2008-September/0...</a></p>
]]></description><pubDate>Fri, 16 Jan 2026 13:06:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=46645994</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=46645994</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46645994</guid></item><item><title><![CDATA[New comment by rft in "39th Chaos Communication Congress Videos"]]></title><description><![CDATA[
<p>I still have to go through my watch list, the age old issue of not having my slides done before congress...<p>The 10 year of Dieselgate is interesting just from a "how bad is it really?" PoV, I saw the part about curves and other defeat devices already [1].<p>The Rowhammer talk is likely going to be great as well, I like Daniel's work [2].<p>The practical Cross-VM Spectre was interesting to show this is still a problem [3].<p>The opensource secure element was good for trying such a thing, but I wasn't that impressed with the content [4].<p>[1] <a href="https://cfp.cccv.de/39c3/talk/7MSRA7/" rel="nofollow">https://cfp.cccv.de/39c3/talk/7MSRA7/</a> <a href="https://media.ccc.de/v/39c3-10-years-of-dieselgate" rel="nofollow">https://media.ccc.de/v/39c3-10-years-of-dieselgate</a><p>[2] <a href="https://cfp.cccv.de/39c3/talk/3JXAJJ/" rel="nofollow">https://cfp.cccv.de/39c3/talk/3JXAJJ/</a> <a href="https://media.ccc.de/v/39c3-rowhammer-in-the-wild-large-scale-insights-from-flippyr-am" rel="nofollow">https://media.ccc.de/v/39c3-rowhammer-in-the-wild-large-scal...</a><p>[3] <a href="https://cfp.cccv.de/39c3/talk/ATYLN9/" rel="nofollow">https://cfp.cccv.de/39c3/talk/ATYLN9/</a> <a href="https://media.ccc.de/v/39c3-spectre-in-the-real-world-leaking-your-private-data-from-the-cloud-with-cpu-vulnerabilities" rel="nofollow">https://media.ccc.de/v/39c3-spectre-in-the-real-world-leakin...</a><p>[4] <a href="https://cfp.cccv.de/39c3/talk/9DYZXG/" rel="nofollow">https://cfp.cccv.de/39c3/talk/9DYZXG/</a> <a href="https://media.ccc.de/v/39c3-lessons-from-building-an-open-architecture-secure-element" rel="nofollow">https://media.ccc.de/v/39c3-lessons-from-building-an-open-ar...</a></p>
]]></description><pubDate>Fri, 02 Jan 2026 20:11:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=46468851</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=46468851</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46468851</guid></item><item><title><![CDATA[New comment by rft in "The Angry Path to Zen: AMD Zen Microcode Tools and Insights [video]"]]></title><description><![CDATA[
<p>Author here, happy to answer any questions you might have.</p>
]]></description><pubDate>Fri, 02 Jan 2026 09:18:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=46462983</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=46462983</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46462983</guid></item><item><title><![CDATA[New comment by rft in "WinApps: Run Windows apps as if they were a part of the native Linux OS"]]></title><description><![CDATA[
<p>SteamVR works ok, but last I checked it still performs worse than on Windows. If you are feeling adventurous, you can try a FOSS VR stack [1]. It works for Steam games running Proton and when it works it provides better performance. I had some troubles with it, sometimes you need to switch versions or you get some artifacts in games, or some games just don't work at all. Good thing is, switching between FOSS and SteamVR is as simple as launching either first before starting the VR game in Steam.<p>I guess the Linux VR stack might get a bit of love from Valve for the Steam Frame, so things might improve in the near future.<p>[1] <a href="https://lvra.gitlab.io/docs/fossvr/envision/" rel="nofollow">https://lvra.gitlab.io/docs/fossvr/envision/</a></p>
]]></description><pubDate>Sat, 29 Nov 2025 11:26:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=46086748</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=46086748</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46086748</guid></item><item><title><![CDATA[New comment by rft in "A monopoly ISP refuses to fix upstream infrastructure"]]></title><description><![CDATA[
<p>At this point escalating, or threatening to, might be the better option. But I can't help trying to figure out how to solve a people/organizational problem with a technological solution.<p>Github is still famously IPv4 only. I don't know if there is a split between the SSH (if you use SSH to access the repos) and HTTPS (the tarballs) setup on their end, so maybe you get full speed on IPv6 and limited on IPv4 (or the other way around). Try disabling IPv6 on your end, if the speeds match then this might be it. If IPv6 is fast using an IPv4 gateway that tunnels via an IPv6 VPN might be a workaround.<p>I also had a similar problem a while back. Some speedtests showed more bandwidth than I could get in regular HTTPS downloads. I could get multiple downloads running at the same time that in total added up to the expected speed. In my case the line was just lossy enough (TCP retransmits in Wireshark) for TCP to never scale up its window size properly beyond a certain limit per connection. I verified this by running iperf in TCP and UDP against a gigabit server, UDP reached near full speed because it didn't care about a few lost packages. Working around that issue might be a bit harder, maybe [1] via [2] can provide some ideas to look into.<p>[1] <a href="https://github.com/apernet/tcp-brutal" rel="nofollow">https://github.com/apernet/tcp-brutal</a><p>[2] <a href="https://news.ycombinator.com/item?id=38164574">https://news.ycombinator.com/item?id=38164574</a></p>
]]></description><pubDate>Sun, 23 Nov 2025 09:36:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=46022058</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=46022058</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46022058</guid></item><item><title><![CDATA[New comment by rft in "Germany: States Pass Porn Filters for Operating Systems"]]></title><description><![CDATA[
<p>As much as I hate where this might lead, I can't help but amuse myself over a scene like this: You walk up to a bearded fellow in a dark alley and quietly whisper "Stallman..." only for him to quietly respond "... was right.". He then opens his coat for you to choose the install USB of your favorite distro (Arch of course!). He dutifully hands you the stick and a printed copy of the GPLv5 for you to hide in your coat as you walk past the telescreens back to your home.</p>
]]></description><pubDate>Fri, 21 Nov 2025 12:18:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=46003811</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=46003811</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46003811</guid></item><item><title><![CDATA[New comment by rft in "Germany: States Pass Porn Filters for Operating Systems"]]></title><description><![CDATA[
<p>That part also caused my tin foil hat to heat up. At least they get the credit of including it directly instead of adding it in a later revision that gets even less news coverage. It is hard not to grow cynical when you see this.<p>I am also worried about another detail:<p>> The states also want to prevent the circumvention of blocking orders by erotic portals ... using so-called mirror domains – i.e., the distribution of identical content under a minimally changed web address. For a page to be treated as a mirror page and quickly blocked without a new procedure, it must essentially have the same content as the already blocked original.<p>Note the part "quickly blocked without a new procedure" so there is a way to block sites with even less process and oversight. That just invites overblocking without accountability.</p>
]]></description><pubDate>Fri, 21 Nov 2025 11:58:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=46003675</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=46003675</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46003675</guid></item><item><title><![CDATA[New comment by rft in "NTSB Preliminary Report – UPS Boeing MD-11F Crash [pdf]"]]></title><description><![CDATA[
<p>I managed to find some statistics on hull losses per million departures [1, p. 13]. Seems like indeed MD-11s have a highish rate of incidents by that metric compared to other types, even if they are not catastrophically less safe than other planes. That metric stacks the statistics a bit against cargo planes, which most (all?) MD-11s are now. These planes tend to fly longer haul instead of short hop, so you get more flight time/miles but less departures. There are also likely some other confounding factors like mostly night operations (visibility and crew fatigue) and the tendency to write off older planes instead of returning them to service after an incident. Plus these aircraft have been in operation long enough that improvements in procedures and training would impact them less than more modern types, as in they already had more accidents before these improvements.<p>[1] <a href="https://www.boeing.com/content/dam/boeing/boeingdotcom/company/about_bca/pdf/statsum.pdf" rel="nofollow">https://www.boeing.com/content/dam/boeing/boeingdotcom/compa...</a></p>
]]></description><pubDate>Thu, 20 Nov 2025 20:50:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=45997518</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=45997518</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45997518</guid></item><item><title><![CDATA[New comment by rft in "NTSB Preliminary Report – UPS Boeing MD-11F Crash [pdf]"]]></title><description><![CDATA[
<p>At some point it comes down to probabilities. With so many flights going on, one in a million incidents become a certainty. For example UA232 [1] suffered failure in all 3 redundant hydraulic systems due to an uncontained engine failure. Any of the 3 systems would have been enough to retain control of the aircraft. Of course this lead to some investigations on why all 3 systems could be impacted at the same time and what can be done to limit failures.<p>Besides the technical aspects that flight is an impressive example of resilience and skill. Bringing that plane down to the ground in nearly one piece was essentially impossible and a one in a million chance in itself.<p>[1] <a href="https://en.wikipedia.org/wiki/United_Airlines_Flight_232" rel="nofollow">https://en.wikipedia.org/wiki/United_Airlines_Flight_232</a></p>
]]></description><pubDate>Thu, 20 Nov 2025 20:06:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=45997003</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=45997003</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45997003</guid></item><item><title><![CDATA[New comment by rft in "NTSB Preliminary Report – UPS Boeing MD-11F Crash [pdf]"]]></title><description><![CDATA[
<p>Given that the report only mentioned a single other seemingly related accident in 1979 I am not sure that objectively this is a reason to discontinue flying these planes. The fact that these planes have been in service since the early 70s is a testament to their safety and reliability in itself. Of course public perception, especially with the videos of huge fireballs from hitting one of the worst possible locations, might put enough pressure on airlines to retire the planes anyway.<p>I agree on the end of an era. Hearing something else besides just Airbus- or Boeing-something always gives me a bit of joy. Even though MDs and DCs are of course Boeings in a sense now as well.</p>
]]></description><pubDate>Thu, 20 Nov 2025 19:46:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=45996792</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=45996792</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45996792</guid></item><item><title><![CDATA[New comment by rft in "NTSB Preliminary Report – UPS Boeing MD-11F Crash [pdf]"]]></title><description><![CDATA[
<p>I haven't seen that one, this video [1] includes a different angle taken from a vehicle on the airport.<p>[1] <a href="https://youtu.be/POKJUJk_2xs?t=342" rel="nofollow">https://youtu.be/POKJUJk_2xs?t=342</a></p>
]]></description><pubDate>Thu, 20 Nov 2025 19:32:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=45996651</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=45996651</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45996651</guid></item><item><title><![CDATA[New comment by rft in "NTSB Preliminary Report – UPS Boeing MD-11F Crash [pdf]"]]></title><description><![CDATA[
<p>Grounding all MD-11s and DC-10s is a major move. I guess it makes sense as a big factor was the fatigue cracks on the pylon (lugs), despite the pylon not being behind on inspections. I am wondering what the inspections of pylons in other planes will yield, likely that will determine whether the grounding will continue.<p>But beyond figuring out why the engine mount failed, I am very interested in what caused the actual crash. "Just" losing thrust in a single engine is usually not enough to cause a crash, the remaining engine(s) have enough margin to get the plane airborne. Of course this was a major structural failure and might have caused additional damage.<p>EDIT: It seems there was damage to the engine in the tail, even though this was not specified in the preliminary report, likely because it has not been sufficiently confirmed yet.</p>
]]></description><pubDate>Thu, 20 Nov 2025 19:23:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=45996558</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=45996558</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45996558</guid></item><item><title><![CDATA[New comment by rft in "Show HN: I made a down detector for down detector"]]></title><description><![CDATA[
<p>Had to check, but that is actually beyond what DNS allows. Labels (the part between dots) are limited to 63 characters. We could sneakily drop an s somewhere in there and then it would fit.<p><a href="https://datatracker.ietf.org/doc/html/rfc1035" rel="nofollow">https://datatracker.ietf.org/doc/html/rfc1035</a><p>Also I think I triggered a nice error log in domaintools just now. <a href="https://whois.domaintools.com/downdetectorsdowndetectorsdowndetectorsdowndetectorsdowndetector.com" rel="nofollow">https://whois.domaintools.com/downdetectorsdowndetectorsdown...</a></p>
]]></description><pubDate>Wed, 19 Nov 2025 14:00:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=45979581</link><dc:creator>rft</dc:creator><comments>https://news.ycombinator.com/item?id=45979581</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45979581</guid></item></channel></rss>