<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: xmddmx</title><link>https://news.ycombinator.com/user?id=xmddmx</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 21 Jun 2026 07:03:43 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=xmddmx" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by xmddmx in "Microsoft new Outlook takes 10 seconds to do what Outlook Classic does instantly"]]></title><description><![CDATA[
<p>Another way (which happens to be correct) to say tenth of a second is 100 000 000 (one hundred million) nanoseconds.  You were off by a factor of 1000!</p>
]]></description><pubDate>Thu, 18 Jun 2026 13:31:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48584993</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=48584993</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48584993</guid></item><item><title><![CDATA[New comment by xmddmx in "Did Claude increase bugs in rsync?"]]></title><description><![CDATA[
<p>There's a meta-level of irony here that's important to note.<p>TFA is defending the use of AI, and it very clearly (to me) used AI to analyze the data and present the results.<p>In doing so, the author used statistics in a way they do not appear to understand, and ended up making numerous false claims (you can see the thread discussing these here <a href="https://news.ycombinator.com/item?id=48417626">https://news.ycombinator.com/item?id=48417626</a> )<p>In short, the study doesn't have sufficient statistical power, and is making "no difference" claims that aren't justified.<p>The meta-irony is this:  the author used an LLM to interpret data in this study, and seems to have made the same category of mistake (confidently asserting falsehoods) that the study was supposed to be investigating (confidently submitting bad commits to the rsync project).</p>
]]></description><pubDate>Sat, 06 Jun 2026 02:09:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48420707</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=48420707</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48420707</guid></item><item><title><![CDATA[New comment by xmddmx in "Did Claude increase bugs in rsync?"]]></title><description><![CDATA[
<p>You say "the underpowered study did not find evidence".  Not true, it found quite a bit of evidence - many statistics were presented.  There is no absence of evidence.  The author wrote about the evidence, presenting P values and other statistics.<p>Of course the critical part is not the numbers, but what they mean.<p>So, what does the evidence mean?<p>The author interprets it to mean that there is no difference.  They state this several times:<p>"46%  EXACT PERMUTATION TEST P-VALUE (ONE-SIDED, H₁: CLAUDE MEAN > HISTORICAL)[...] What this p-value tells us is There's nothing unusual about the Claude group."<p>"74% ONE-SIDED P-VALUE (H₁: CLAUDE MORE LIKELY ABOVE MEDIAN)
Fisher's exact test asks: if we split all releases at the historical median (0.74 sev/10c), are these Claude releases significantly buggy than previous releases (more likely to land above the median)? With a p-value of 74%, the answer is a decisive no. "<p>In an under-powered study, when a P value is above your alpha level cutoff (.05, .01, whatever was chosen) you can't distinguish between "no effect" and "could be an effect, but I didn't see one".</p>
]]></description><pubDate>Sat, 06 Jun 2026 01:50:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=48420597</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=48420597</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48420597</guid></item><item><title><![CDATA[New comment by xmddmx in "Did Claude increase bugs in rsync?"]]></title><description><![CDATA[
<p>Perhaps in an “everyday language” way, but not in the technical, statistical sense.<p>In an underpowered statistical study, a claim that two experimental conditions did not differ are not persuasive.</p>
]]></description><pubDate>Sat, 06 Jun 2026 01:08:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48420377</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=48420377</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48420377</guid></item><item><title><![CDATA[New comment by xmddmx in "Did Claude increase bugs in rsync?"]]></title><description><![CDATA[
<p>The concept you need here is "Statistical Power".<p>The ELI5 version is that there are two mistakes you can make when looking at a P value:<p>Type I error, where your P value is falsely low.  In the experiment being discussed here, it would lead one to conclude that AI code is worse.  Otherwise known as a false positive.<p>Type II error, where your P value is falsely high, leading you to conclude that AI code is no different.  Otherwise known as a false negative.<p><a href="https://en.wikipedia.org/wiki/Power_(statistics)" rel="nofollow">https://en.wikipedia.org/wiki/Power_(statistics)</a><p>One can calculate statistical power for a given experimental protocol.<p>My hunch is that if you did this, you would find this experiment is grossly under-powered.<p>This means you can't make the "absence of evidence" claim.</p>
]]></description><pubDate>Fri, 05 Jun 2026 21:28:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48418576</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=48418576</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48418576</guid></item><item><title><![CDATA[New comment by xmddmx in "Ollama is now powered by MLX on Apple Silicon in preview"]]></title><description><![CDATA[
<p>On a M4 Pro MacBook Pro with 48GB RAM I did this test:<p>ollama run $model "calculate fibonacci numbers in a one-line bash script" --verbose<p><pre><code>  Model                         PromptEvalRate EvalRate
  ------------------------------------------------------
  qwen3.5:35b-a3b-q4_K_M         6.6            30.0
  qwen3.5:35b-a3b-nvfp4         13.2            66.5
  qwen3.5:35b-a3b-int4          59.4            84.4

</code></pre>
I can't comment on the quality differences (if any) between these three.</p>
]]></description><pubDate>Tue, 31 Mar 2026 14:51:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47588208</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=47588208</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47588208</guid></item><item><title><![CDATA[What's the deal with distributed SYN DOS attacks]]></title><description><![CDATA[
<p>I run some boutique web servers (Apache on macOS, https on 443) and most of them were being weird - occasional slowdowns.    Apache logs were mostly blank.  CPU and RAM were close to 0% usage.<p>But even a curl test on localhost was showing 1,2, or more seconds longer slowdown.s<p>After a lot of digging, realized that I was victim of some sort of DOS attack, which appeared to be a SYN flood attack.<p>In a normal SYN flood attack the SYN packets are sent from one IP address then never reply, leaving the server in a state with multiple connections stuck in the "SYN_RECVD" state.<p>In this attack, the SYN packets are actually being "sent" from multiple IPs - and one theory is that these are forged IPs, so the attack is really a reflection attack, where the attacker sends a forged IP SYN to my server, which replies (several times with SYN ACK)<p>I blocked the IP block /16 at the firewall and all was well, but this made me wonder:<p>How are modern OS's so vulnerable to this?  On my macOS server, I could run  netstat -anp tcp | grep '\.443 ' and see about 128 entries stuck in "SYN_RCVD" at which point the server just went to pieces.<p>In other words, if my server received 128 "SYN" packets, it would die for about 75 seconds.<p>Is this a macOS only problem? Are other OS's susceptible to this?<p>for this question, please consider "Use CloudFlare" off limits.  I'm more interested in why modern OSs can be so fragile to a few (forged) TCP packets.</p>
<hr>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47293006">https://news.ycombinator.com/item?id=47293006</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 08 Mar 2026 00:28:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47293006</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=47293006</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47293006</guid></item><item><title><![CDATA[New comment by xmddmx in "Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers"]]></title><description><![CDATA[
<p>See my other note [1] about bugs in Ollama with Qwen3.5.<p>I just tried this (Ollama macOS 0.17.4, qwen3.5:35b-a3b-q4_K_M) on a M4 Pro, and it did fine:<p>[Thought for 50.0 seconds]<p>1. potato
2. potato
[...]
100. potato<p>In other words, it did great.<p>I think 50 seconds of thinking beforehand was perhaps excessive?<p>[1] <a href="https://news.ycombinator.com/item?id=47202082">https://news.ycombinator.com/item?id=47202082</a></p>
]]></description><pubDate>Sun, 01 Mar 2026 00:38:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47202297</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=47202297</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47202297</guid></item><item><title><![CDATA[New comment by xmddmx in "Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers"]]></title><description><![CDATA[
<p>See my other note about bugs in Ollama with Qwen3.5.<p>I just tried this (Ollama macOS 0.17.4, qwen3.5:35b-a3b-q4_K_M) on a M4 Pro, and it did fine:<p>[Thought for 50.0 seconds]<p>1. potato
2. potato
[...]
100. potato<p>In other words, it did great.<p>I think 50 seconds of thinking beforehand was perhaps excessive?</p>
]]></description><pubDate>Sun, 01 Mar 2026 00:37:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47202290</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=47202290</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47202290</guid></item><item><title><![CDATA[New comment by xmddmx in "Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers"]]></title><description><![CDATA[
<p>Ollama users: there are notable bugs with ollama and Qwen3.5 so don't let your first impression be the last.<p>Theory is that some of the model parameters aren't set properly and this encourages endless looping behavior when run under ollama:<p><a href="https://github.com/ollama/ollama/issues?q=is%3Aissue%20state%3Aopen%20qwen%203.5" rel="nofollow">https://github.com/ollama/ollama/issues?q=is%3Aissue%20state...</a>  (a bunch of them)</p>
]]></description><pubDate>Sun, 01 Mar 2026 00:13:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47202082</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=47202082</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47202082</guid></item><item><title><![CDATA[New comment by xmddmx in "Sheldon Brown's Bicycle Technical Info"]]></title><description><![CDATA[
<p>> Sheldon Brown, a beloved iconoclast bicycle tech guru, died Sunday from a heart attack. He was 68 63.<p>Curious, what does "He was 68 63" mean.  Is it a bicycle gear joke about his age at death?</p>
]]></description><pubDate>Sat, 07 Feb 2026 00:43:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=46920119</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46920119</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46920119</guid></item><item><title><![CDATA[New comment by xmddmx in "Show HN: An interactive map of US lighthouses and navigational aids"]]></title><description><![CDATA[
<p>On Mac Safari, holding shift and using the magic mouse to scroll up or down reverses the zoom direction.<p>This is both right (Shift-X is the reverse of X due to convention) 
But is also wrong (Shift-Scroll is the macOS gesture for scrolling on maps where Scroll alone doesn't zoom in or out).<p>TLDR: I really wish Apple would adopt the "scroll up to zoom in" convention used by the rest of the free world.</p>
]]></description><pubDate>Sun, 25 Jan 2026 23:54:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=46759986</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46759986</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46759986</guid></item><item><title><![CDATA[New comment by xmddmx in "The struggle of resizing windows on macOS Tahoe"]]></title><description><![CDATA[
<p>My memory is that it was named "Aero Glass" which heightens the irony of "Liquid Glass" sucking.<p>But I see many references to it being called just "Aero", but some call it "Aero Glass" [1]<p>Does anyone know the truth?<p>[1] <a href="https://www.pcmag.com/archive/rip-aero-glass-windows-8-sticks-a-fork-in-familiar-ui-298095" rel="nofollow">https://www.pcmag.com/archive/rip-aero-glass-windows-8-stick...</a></p>
]]></description><pubDate>Sun, 11 Jan 2026 22:48:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=46581263</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46581263</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46581263</guid></item><item><title><![CDATA[New comment by xmddmx in "Lessons from 14 years at Google"]]></title><description><![CDATA[
<p>Right, this annoyed me too - it was stated w/o attribution as if novel.<p>What is the name of the law when someone writes a think piece of "stuff I've learned" and fails to cite any of it to existing knowledge?<p>Makes me wonder if (A) they do know it's not their idea, but they are just cool with plagiarism or (B) they don't know it's not their idea.</p>
]]></description><pubDate>Sun, 04 Jan 2026 23:20:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=46493448</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46493448</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46493448</guid></item><item><title><![CDATA[New comment by xmddmx in "Microsoft kills official way to activate Windows 11/10 without internet"]]></title><description><![CDATA[
<p>You mean 1990. Someone graduating college in 1990 would have been about 21.  That was 35 years ago,  so they would be about 56 in 2025.<p>Math is hard.</p>
]]></description><pubDate>Sat, 03 Jan 2026 03:25:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=46472513</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46472513</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46472513</guid></item><item><title><![CDATA[New comment by xmddmx in "Fighting Fire with Fire: Scalable Oral Exams"]]></title><description><![CDATA[
<p>The CFR is pretty clear, and I have experience with this (being both an IRB reviewer, faculty member, and researcher).  When it says "is exempt" it means "is exempt".<p>Imagine otherwise: a teacher who wants change their final exam from a 50 item Scantron using A-D choices, to a 50 item Scantron using A-E choices, because they think having 5 choices per item is better than 4, would need to ask for IRB approval.  That's not feasible, and is not what happens in the real world of academia.<p>It is true that local IRBs may try to add additional rules, but the NU policy you quote talks about "studies".    Most IRBs would disagree that "professor playing around with grading procedures and policies" constitutes a "study".<p>It would be presumed exempted.<p>Are you a teacher or a student?  If you are a teacher, you have wide latitude that a student researcher does not.<p>Also, if you are a teacher, doing "research about your teaching style", that's exempted.<p>By contrast, if you are a student, or a teacher "doing research" that's probably not exempt and must go through IRB.</p>
]]></description><pubDate>Sat, 03 Jan 2026 03:04:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=46472394</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46472394</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46472394</guid></item><item><title><![CDATA[New comment by xmddmx in "Fighting Fire with Fire: Scalable Oral Exams"]]></title><description><![CDATA[
<p>Turns out that under the USA Code of Federal Regulations, there's a pretty big exemption to IRB for research on pedagogy:<p>CFR 46.104 (Exempt Research):<p>46.104.d.1
"Research, conducted in established or commonly accepted educational settings, that specifically involves normal educational practices that are not likely to adversely impact students' opportunity to learn required educational content or the assessment of educators who provide instruction. This includes most research on regular and special education instructional strategies, and research on the effectiveness of or the comparison among instructional techniques, curricula, or classroom management methods."<p><a href="https://www.ecfr.gov/current/title-45/subtitle-A/subchapter-A/part-46/subpart-A/section-46.104" rel="nofollow">https://www.ecfr.gov/current/title-45/subtitle-A/subchapter-...</a><p>So while this may have been a dick move by the instructors, it was probably legal.</p>
]]></description><pubDate>Fri, 02 Jan 2026 21:56:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=46469889</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46469889</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46469889</guid></item><item><title><![CDATA[New comment by xmddmx in "1.5 TB of VRAM on Mac Studio – RDMA over Thunderbolt 5"]]></title><description><![CDATA[
<p>I was impressed by the lack of dominance of Thunderbolt:<p>"Next I tested llama.cpp running AI models over 2.5 gigabit Ethernet versus Thunderbolt 5"<p>Results from that graph showed only a ~10% benefit from TB5 vs. Ethernet.<p>Note: The M3 studios support 10Gbps ethernet, but that wasn't tested.  Instead it was tested using 2.5Gbps ethernet.<p>If 2.5G ethernet was only 10% slower than TB, how would 10G Ethernet have fared?<p>Also, TB5 has to be wired so that every CPU is connected to every other over TB, limiting you to 4 macs.<p>By comparison, with Ethernet, you could use a hub & spoke configuration with a Ethernet switch, theoretically letting you use more than 4 CPUs.</p>
]]></description><pubDate>Fri, 19 Dec 2025 00:32:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=46320851</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46320851</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46320851</guid></item><item><title><![CDATA[New comment by xmddmx in "Claude CLI deleted my home directory and wiped my Mac"]]></title><description><![CDATA[
<p>I really hope the user was running Time Machine - in default settings, Time Machine does hourly snapshot backups of your whole Mac.  Restoring is super easy.</p>
]]></description><pubDate>Mon, 15 Dec 2025 00:13:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=46268669</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46268669</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46268669</guid></item><item><title><![CDATA[New comment by xmddmx in "IQ differences of identical twins reared apart are influenced by education"]]></title><description><![CDATA[
<p>There is something wrong with this article, possibly just copyediting mistakes but it makes me question the whole thing.<p>For example, check out this mess:<p>> “Unfortunately, there is one significant issue with the aforementioned data: schooling. Seeing as the majority of work to date includes only aggregate data, it is impossible to account. The first concerns small N: seeing as most publish studies only include a handful of TRA data, there is a lot of room for error and over.<p>Unfortunately, there is a largely unaccounted for confound in this aggregate data which may make generalized analysis questionable: schooling.”</p>
]]></description><pubDate>Tue, 25 Nov 2025 20:51:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=46050625</link><dc:creator>xmddmx</dc:creator><comments>https://news.ycombinator.com/item?id=46050625</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46050625</guid></item></channel></rss>