<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: dmw_ng</title><link>https://news.ycombinator.com/user?id=dmw_ng</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 12 Apr 2026 15:23:25 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=dmw_ng" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by dmw_ng in "Restic: Backups done right"]]></title><description><![CDATA[
<p>I'm a restic user, but have resisted the urge to attempt a bikeshed for a long time, mostly due to perf. It's index format seems to be slow and terrible and the chunking algorithm it uses (rabin fingerprints) is very slow compared to more recent alternatives (like FastCDC). Drives me nuts to watch it chugging along backing up or listing snapshots at nowhere close to the IO rate of the system while still making the fans run. Despite that it still seems to be the best free software option around</p>
]]></description><pubDate>Sun, 13 Oct 2024 19:02:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=41830608</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=41830608</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41830608</guid></item><item><title><![CDATA[New comment by dmw_ng in "Hetzner Object Storage"]]></title><description><![CDATA[
<p>That's been a feature of S3 for quite a long time now, called S3 Select <a href="https://docs.aws.amazon.com/AmazonS3/latest/userguide/selecting-content-from-objects.html" rel="nofollow">https://docs.aws.amazon.com/AmazonS3/latest/userguide/select...</a><p>Despite it being an awesome feature I've been itching to use, I've never actually found a use for it beyond messing around. Most places where S3 Select might make sense seems to be subsumed (for my uses) by Athena. Athena has a rather large amount of conceptual and actual boilerplate to get up and running with, though, S3 Select requires no upfront planning beyond building a fancy query string (or using their SDK wrappers)<p>Where S3 Select is likely to become fiddly is anywhere multiple files are involved. Athena makes querying large collections of CSVs (etc) straightforward, and handles all the scheduling and results merging for you.</p>
]]></description><pubDate>Mon, 07 Oct 2024 17:19:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=41768371</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=41768371</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41768371</guid></item><item><title><![CDATA[New comment by dmw_ng in "Microsoft donates the Mono Project to the Wine team"]]></title><description><![CDATA[
<p>Modern .net on Linux is lovely, you can initialize a project, pull in the S3 client and write a 1-3 line C# program that AOT compiles to a single binary with none of the perf issues or GIL hand-wringing that plagues life in Python.<p>Given modern Python means type annotations everywhere, the convenience edge between it and modern C# (which dispenses with much of the javaesque boilerplate) is surprisingly thin, and the capabilities of the .net runtime far superior in many ways, making it quite an appealing alternative especially for perf sensitive stuff.</p>
]]></description><pubDate>Tue, 27 Aug 2024 22:14:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=41373743</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=41373743</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41373743</guid></item><item><title><![CDATA[New comment by dmw_ng in "Intel N100 Radxa X4 First Thoughts"]]></title><description><![CDATA[
<p>I recently bought an n100 and within a matter of days got buyer's remorse and impulse-purchased an n305 to go right beside it, which is currently sitting with a wildly overpriced 48 GB stick installed and 2TB SN850X, it's an absolute joy perfwise and the absence of heat it generates.<p>The only thing I'd reserve judgement on is the tendency to throttle. I haven't got far enough to characterize it, but it's not clear how much value those extra cores will add over the n100 with TDP settings tweaked down in the BIOS, and if leaving the n305 to run at max TDP, heat/noise/cost/temperature-related instability may start to become an issue, especially when packing other hot components like a decent SSD into the tiny cases they come in.</p>
]]></description><pubDate>Sun, 28 Jul 2024 02:19:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=41090792</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=41090792</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41090792</guid></item><item><title><![CDATA[New comment by dmw_ng in "Proton launches its own version of Google Docs"]]></title><description><![CDATA[
<p>Seems like a massive distraction from their offering for a small company, wonder why they didn't consider something like tight integration with OnlyOffice or similar. Setting out to build a new office suite feels about as sensible as building a new web browser from scratch. Except at least with a browser, you have open specs helping you through most of the endless supply of compatibility problems.</p>
]]></description><pubDate>Wed, 03 Jul 2024 13:46:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=40865923</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40865923</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40865923</guid></item><item><title><![CDATA[New comment by dmw_ng in "120ms to 30ms: Python to Rust"]]></title><description><![CDATA[
<p>> converting huge amount of xml files<p>> pickling<p>Sounds like if this is the tooling and the task at hand, about the most complex things that should be passing through the pickler are partitioned lists of filenames rather than raw data. E.g. you can have each partition generate a parquet for combining in a final step (pyarrow.concat_tables() looks useful), or if it were some other format you were working with, potentially sending flat arrays back to the parent process as giant bytestrings or similar<p>This is not to say the limitations don't suck, just that very often there are simple approaches to avoid most of the pain</p>
]]></description><pubDate>Fri, 28 Jun 2024 00:33:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=40816687</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40816687</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40816687</guid></item><item><title><![CDATA[New comment by dmw_ng in "SSH as a Sudo Replacement"]]></title><description><![CDATA[
<p>It's comical to see the sudo codebase mentioned in the same breath as increasing auditability here</p>
]]></description><pubDate>Sun, 23 Jun 2024 08:29:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=40765708</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40765708</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40765708</guid></item><item><title><![CDATA[New comment by dmw_ng in "Scan HTML faster with SIMD instructions – Chrome edition"]]></title><description><![CDATA[
<p>Sufficiently fast software often allows leaving out whole layers of crap and needless indirection, the most common being caching. Fixing an algorithm so you can omit a dedicated database of intermediate results can be a huge maintainability/usability improvement. The same principle appears all over the place, e.g. immediate mode UIs, better networking (e.g. CSS image tiling vs. just fixing small request overhead in HTTP1 vs. QUIC), importing giant CSV files via some elaborate ETL process vs. just having a lightning fast parser+query engine, etc.<p>Depending on how you look at it, you could view large chunks of DOM state through this lens, as intermediate data that only exists to cache HTML parsing results. What's the point of allocating a hash table to represent element attributes if they are unchanged from the source document, and reparsing them from the source is just as fast as keeping around the parsed form? etc.  These kinds of tricks only tend to present themselves after optimization work is done, which is annoying, because it's usually so damn hard to justify optimization work in the first place.</p>
]]></description><pubDate>Fri, 14 Jun 2024 11:34:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=40679790</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40679790</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40679790</guid></item><item><title><![CDATA[New comment by dmw_ng in "DuckDB Isn't Just Fast"]]></title><description><![CDATA[
<p>I've had similar times with DuckDB, it feels nicer to use on the surface but in terms of perf and actual function I've had a better experience with clickhouse-local.</p>
]]></description><pubDate>Tue, 11 Jun 2024 10:41:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=40644671</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40644671</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40644671</guid></item><item><title><![CDATA[New comment by dmw_ng in "Spam blocklist SORBS closed by its owner, Proofpoint"]]></title><description><![CDATA[
<p>SPF+DKIM+DMARC are a classic case of Goodhart's law, the amount of spam they stop these days (at least anecdotally) is minimal. Most spam I get seems to come via SalesForce infrastructure, and a variety of similar bulk email marketing providers</p>
]]></description><pubDate>Sun, 09 Jun 2024 15:59:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=40625340</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40625340</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40625340</guid></item><item><title><![CDATA[New comment by dmw_ng in "NSA Ghidra open-source reverse engineering framework"]]></title><description><![CDATA[
<p>> Linux with HiDPI, is such a chore<p>There's a simple broad-spectrum fix for that. For the odd occasion I want to view high definition photos in Linux I just switch screen mode temporarily, for the rest it is sooo not worth the hassle</p>
]]></description><pubDate>Wed, 29 May 2024 13:02:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=40511542</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40511542</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40511542</guid></item><item><title><![CDATA[New comment by dmw_ng in "Cloudflare took down our website"]]></title><description><![CDATA[
<p>CloudFlare does not provide unmetered anything, at best they provide services on a discretionary basis while trying as hard as possible to make it appear this is not the case. It's better to think of their product line as a CRM system with some CDN features on the side</p>
]]></description><pubDate>Sun, 26 May 2024 13:54:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=40482208</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40482208</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40482208</guid></item><item><title><![CDATA[New comment by dmw_ng in "Amazon S3 will no longer charge for several HTTP error codes"]]></title><description><![CDATA[
<p>> do lots of useful work for free<p>Have often wondered about this in terms of some of their control plane APIs, a read-only IAM key used as part of C&C infrastructure for a botnet might be interesting, you get DNS/ClientHello signature to a legitimate and reputable service for free, while stuffing "DDoS this blog" e.g. in some tags of a free resource. Even better if the AWS account belonged to someone else.<p>But certainly, ability to serve an unlimited URL space from an account with only positive hits being billed seems ripe for abuse. Would guess there's already some ticket for a "top 404ers" internal report or similar</p>
]]></description><pubDate>Mon, 13 May 2024 20:22:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=40347945</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40347945</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40347945</guid></item><item><title><![CDATA[New comment by dmw_ng in "Amazon S3 will no longer charge for several HTTP error codes"]]></title><description><![CDATA[
<p>I prefer your version: Barr replies to a tweet before gatecrashing the next S3 planning session. "A customer is <i>hurting</i>, folks!". The call immediately falls silent with only occasional gasps heard from stunned engineers, and the gentle weeping of a PM. I wonder if Amazon offers free therapy following an incident like this</p>
]]></description><pubDate>Mon, 13 May 2024 20:10:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=40347802</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40347802</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40347802</guid></item><item><title><![CDATA[New comment by dmw_ng in "Amazon S3 will no longer charge for several HTTP error codes"]]></title><description><![CDATA[
<p>Can't imagine a change like this would be made without some analysis.. would love an internal view into a decision like this, I wonder if they already have log data to compute financial loss from the change, or if they have sampling instrumentation fancy enough to write/deploy custom reports like this quickly.<p>In any case 2 weeks seems like an impressive turnaround for such a large service, unless they'd been internally preparing to acknowledge the problem for longer</p>
]]></description><pubDate>Mon, 13 May 2024 19:13:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=40347108</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40347108</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40347108</guid></item><item><title><![CDATA[New comment by dmw_ng in "Jon Pretty wins in court against sexual harassment claims by Scala community"]]></title><description><![CDATA[
<p>He did not "win in court", a consent order means both parties agreed to a mutual resolution prior to a judgement occurring. If I had to guess it was settled this way because of the time, misery and tremendous cost that would otherwise be involved for all parties. "Wins in court" suggests some thorough process leading up to a final evidential determination made by a judge. That is not what happened here, it'd probably be more accurate to say the parties were motivated to cooperate via their solicitors under looming threat of enduring that process.<p>No skin in this game, but looking at it from the respondents' perspective, 20k split between 4 is incredibly attractive: 5k and an apology to put an end to any threat relating to the issue, given the alternative of potentially unbounded costs to demonstrate innocence with a tortuous and uncertain outcome. From the claimant's perspective it is more confusing, triggering such a heavyweight process then settling for so little given the claimed harms suggests all kinds of things, not least including the kind of advice he may have received given the strength of the case as his solicitors understood it.</p>
]]></description><pubDate>Fri, 26 Apr 2024 20:25:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=40173829</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40173829</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40173829</guid></item><item><title><![CDATA[New comment by dmw_ng in "I Used Netscape Composer in 2024"]]></title><description><![CDATA[
<p>We only very briefly had that around the start of Win95, by the time 98 came around (IIRC) Explorer and definitely IE already had flat toolbar buttons that only had bevels when hovering. I remember fetishizing how clean those new style controls looked (esp IE5ish with its grey-on-grey patterned background texture!) but it definitely broke the consistent interaction idiom you mention</p>
]]></description><pubDate>Fri, 19 Apr 2024 08:29:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=40084515</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40084515</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40084515</guid></item><item><title><![CDATA[New comment by dmw_ng in "50% discount on OpenAI pricing if you submit a batch and give them up to 24h"]]></title><description><![CDATA[
<p>I'm not promoting one way or the other, just pointing out why things are the way they are. Restarting a service with stateful networking is reason enough to avoid it where possible, watching entire buildings melt for 15 minutes because a single binary SEGV'd is a real outcome. For an extra helping of pain, add a herd of third party clients of random versions to a system that never needed the comms capabilities on offer and you have a problem to solve that never needed to exist in the first place</p>
]]></description><pubDate>Mon, 15 Apr 2024 20:09:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=40045023</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40045023</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40045023</guid></item><item><title><![CDATA[New comment by dmw_ng in "50% discount on OpenAI pricing if you submit a batch and give them up to 24h"]]></title><description><![CDATA[
<p>There's a reason pretty much everything that does not require low latency replies avoids stateful networking - everything from RSS to video streaming prefers stateless polling designs because it is vastly easier for both parties to implement and scale. Meanwhile, I couldn't name a single system in widespread use built around a MQ paradigm in its public interface, except for actual MQ APIs, and many of those (e.g. from AWS) are still built on polling for the reasons just described</p>
]]></description><pubDate>Mon, 15 Apr 2024 18:42:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=40044169</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=40044169</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40044169</guid></item><item><title><![CDATA[New comment by dmw_ng in "Cloudflare R2 IA storage tier"]]></title><description><![CDATA[
<p>AWS already give you intelligent tiering for this, it's a very nice product but it's also just a nice way of hiding the same fees. Your $0.004/GB becomes $0.023/GB on first read for 1 month then $0.0125/GB for 2 months, so the average cost of storing it over those 3 months becomes $0.016/GB, and that's before considering monitoring fees</p>
]]></description><pubDate>Wed, 03 Apr 2024 15:23:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=39918679</link><dc:creator>dmw_ng</dc:creator><comments>https://news.ycombinator.com/item?id=39918679</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39918679</guid></item></channel></rss>