<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: joefourier</title><link>https://news.ycombinator.com/user?id=joefourier</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 03 Jul 2026 09:09:19 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=joefourier" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by joefourier in "SpaceX to buy Cursor for $60B"]]></title><description><![CDATA[
<p>I stopped using Cursor because of how terribly optimised it is (worse than VSCode despite being a fork). It would routinely take up 50% of the CPU resources on my MacBook M4 and gigabytes of RAM for absolutely no reason.<p>I switched to Zed, and I'm never going back to Electron/non-native IDEs.</p>
]]></description><pubDate>Tue, 16 Jun 2026 13:10:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48554762</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48554762</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48554762</guid></item><item><title><![CDATA[New comment by joefourier in "How to earn a billion dollars"]]></title><description><![CDATA[
<p>Why is it unethical? I'm both a freelance engineer and a business owner that sells software, and I've both sold my labour for equity/revenue share, and for a flat hourly rate.<p>If I charge a client $50k for some software and they made $1 million profit from it, good for them? As long as they pay our mutually agreed upon rate on time and there was no hostile negotiation, why should I feel suddenly entitled to more money if that wasn't in our contract? How do I know how much of the value is from my work and not their marketing or idea?<p>What you're saying seems as crazy as me saying that someone who bought my software for $99 and used it on a multi-million dollar project is being unethical unless they give me more money. How on Earth does that make sense? Should I be forced to switch to a royalty model? What if I make more selling copies at a flat rate, what if I don't want to have to investigate the finances of thousands of customers and have to deal with that whole trouble?<p>For me it's the same thing regardless of whether I'm selling my labour or a product. I can choose whether to accept a flat hourly rate, equity, or a mix of both, and usually the better deal is the hourly rate.<p>If I find a way to hire a software engineer for market rates (say, $200k/year in the US) and get $2M revenue from their work, good? They can ask for a raise or a bonus, we can renegotiate, they can leave if they're unhappy, but I'm not obligated to give them more money than was in our agreement anymore than they're obligated to give me their salary back in the project fails.</p>
]]></description><pubDate>Sun, 14 Jun 2026 17:22:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=48529938</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48529938</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48529938</guid></item><item><title><![CDATA[New comment by joefourier in "How to earn a billion dollars"]]></title><description><![CDATA[
<p>> My response is “great! Let businesses take a lesson here: give all your employees a chunk of the company. Let’s all share in the success!”<p>Don't >95% of tech companies offer stock options or equity, from startups to FAANG?</p>
]]></description><pubDate>Sun, 14 Jun 2026 13:40:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48527124</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48527124</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48527124</guid></item><item><title><![CDATA[New comment by joefourier in "xAI is looking more like a datacentre REIT than a frontier lab"]]></title><description><![CDATA[
<p>Yeah most of the performance increases have mostly been from architectural improvements like reduced precision tensor cores. AFAIK FP4 is basically the limit for floating point matmuls, after which you need to switch to integer addition if you want to reduce bits, and I don’t think we’ve figured out 1-bit LLMs just yet.</p>
]]></description><pubDate>Tue, 09 Jun 2026 11:45:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=48459825</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48459825</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48459825</guid></item><item><title><![CDATA[New comment by joefourier in "xAI is looking more like a datacentre REIT than a frontier lab"]]></title><description><![CDATA[
<p>Demand is so high and supply so low customers will go to anyone that has any gear, period. Anthropic is paying xAI for GPUs from 2022, not the latest Nvidia release.</p>
]]></description><pubDate>Mon, 08 Jun 2026 23:04:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48453652</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48453652</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48453652</guid></item><item><title><![CDATA[New comment by joefourier in "xAI is looking more like a datacentre REIT than a frontier lab"]]></title><description><![CDATA[
<p>And yet Anthropic is paying xAI over a billion dollars a month for those out of date GPUs in their first datacentre (H100s being nearly 4 years old at this point).<p>Even A100s are still barely available on the major clouds despite being 6 years old.</p>
]]></description><pubDate>Mon, 08 Jun 2026 22:51:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=48453513</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48453513</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48453513</guid></item><item><title><![CDATA[New comment by joefourier in "Real-time LLM Inference on Standard GPUs: 3k tokens/s per request"]]></title><description><![CDATA[
<p>Do you think the work will still apply to speculative/alternative decoding methods like MTP and block diffusion, which are making batch=1 decoding less memory bound? Kernel launch overhead and memory transfer become less and less significant as a % of time when computing multiple tokens at once.</p>
]]></description><pubDate>Fri, 29 May 2026 17:10:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=48326137</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48326137</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48326137</guid></item><item><title><![CDATA[New comment by joefourier in "The AI bubble isn't like the internet bubble"]]></title><description><![CDATA[
<p>You'd be surprised, people are somehow buying Tesla P40s and M40s on eBay for almost $300 and $180 respectively (M40 being the same gen as GTX 950). Google Colab still offers T4s and it's taken them years to add modern GPUs. Hope they're powering them with renewables at least.<p>And people in general are holding on to their old machines for very long periods of time now, especially CPUs. I've had to support first gen Intel i7s at work! That's pre AVX.</p>
]]></description><pubDate>Tue, 26 May 2026 12:43:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=48279008</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48279008</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48279008</guid></item><item><title><![CDATA[New comment by joefourier in "The AI bubble isn't like the internet bubble"]]></title><description><![CDATA[
<p>Outside of training the biggest LLMs at big labs, GPU lifespan isn't as short as the OP made it out to sound. A100s are 6 years old and still a reliable work-horse, and the 80GB version hasn't depreciated that much on the used market. On the consumer side, 3090s are actually still selling for very close to 2020 MSRP.<p>Even the ancient V100 (soon to be 10 years old!) had somewhat of resurgence on the second-hand market, with a healthy market for interconnects in China.<p>If I had a datacenter and power consumption was not a concern, I'd be holding on to my A100s for years at least for inference.</p>
]]></description><pubDate>Tue, 26 May 2026 11:45:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48278398</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48278398</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48278398</guid></item><item><title><![CDATA[New comment by joefourier in "Perceptual Image Codec: What Matters in Practical Learned Image Compression"]]></title><description><![CDATA[
<p>> And why is V100 even used? V100 is four generations old and not even supported anymore.<p>It wouldn’t surprise me that due to bureaucratic processes, it’s still somehow the most readily available GPU for Apple researchers despite being almost 10 years old now. I recall even last year seeing V100s used by Microsoft researchers who weren’t working on LLMs.</p>
]]></description><pubDate>Sun, 24 May 2026 19:58:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=48260506</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48260506</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48260506</guid></item><item><title><![CDATA[New comment by joefourier in "Was my $48K GPU server worth it?"]]></title><description><![CDATA[
<p>Not a single new 64GB GPU, but multiple used GPUs.<p>They’ve significantly increased in price (so much for hardware depreciation…) but you can still get a modded 22GB 2080 ti for $320, or a Mi50 32GB for ~$450 each (used to be $150 a few months ago, alas), or a Mi50 16GB or <$200 but you’d need to stack 4 of them.<p>There’s also some more exotic configurations but those are probably the simplest options. You won’t get the performance of an RTX Pro 6000 Blackwell of course, and the power consumption will be pretty high so it’s only worth it if you have cheap electricity. But it is possible.</p>
]]></description><pubDate>Fri, 22 May 2026 10:35:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48234071</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48234071</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48234071</guid></item><item><title><![CDATA[New comment by joefourier in "Was my $48K GPU server worth it?"]]></title><description><![CDATA[
<p>What quant? You should have no problem running it at Q4 with 256K context, Q5 or Q6 even although maybe not at full context. I can run Q4 on a 4090 with just 24GB VRAM.</p>
]]></description><pubDate>Thu, 21 May 2026 22:13:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=48229473</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48229473</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48229473</guid></item><item><title><![CDATA[New comment by joefourier in "Was my $48K GPU server worth it?"]]></title><description><![CDATA[
<p>Who is going to buy a $4299 M5 Max MBP with 64GB of RAM just to run Gemma 4 31b? Firstly you don't need 64GB for that model. Secondly if you want a machine that sits in the corner and does nothing but LLM inference, you don't buy a MacBook Pro, you buy some GPUs which are going to cost you a fraction of that (~$1k for ~64GB of VRAM is possible). The people buying Apple Silicon for inference general aim for the Mac Studios with enormous amounts of RAM (128-512GB), to run very large models.<p>The idea is obviously to be running the LLM on your work laptop. As a developer I'd need a laptop with 24GB of RAM for work anyway, and 48GB, which is enough for a very good quant of Gemini, is just $400 extra.</p>
]]></description><pubDate>Thu, 21 May 2026 19:04:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=48227491</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48227491</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48227491</guid></item><item><title><![CDATA[New comment by joefourier in "Was my $48K GPU server worth it?"]]></title><description><![CDATA[
<p>Why didn't you take into account batching, input tokens, different costs of electricity, and the fact that a laptop can still hold a decent % of its resale value, and is useful for many other tasks than running an LLM?</p>
]]></description><pubDate>Thu, 21 May 2026 18:45:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48227253</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48227253</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48227253</guid></item><item><title><![CDATA[New comment by joefourier in "Access to frontier AI will soon be limited by economic and security constraints"]]></title><description><![CDATA[
<p>>  I'd compare it to OpenAI 5 years ago except I think even then OpenAI had way more!<p>Say what? 5 years ago OpenAI had received around $139 million in funding, and they’d just come out with GPT3 with 175B parameters, a 2048 context window, trained on 300B tokens on a 10,000 V100 cluster which would have cost maybe $4-13 million at the time for their training run.<p>Meanwhile Deepseek V3’s famously frugal training was $5M, and Chinese AI companies are raising billions in funding. Sure American AI companies are raising tens (and maybe hundreds in the case of OpenAI, if you count their circular funding rounds) of billions but they’re grossly inefficient, and we’ve already hit the limits of the scaling laws where there’s little point in increasing the number of parameters of a model.</p>
]]></description><pubDate>Fri, 15 May 2026 12:26:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48147769</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48147769</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48147769</guid></item><item><title><![CDATA[New comment by joefourier in "Princeton mandates proctoring for in-person exams, upending 133 year precedent"]]></title><description><![CDATA[
<p>It's incredibly common all over Europe, not just Switzerland. Not only the metros but the trams and even buses often rely on this system where there's no turnstile or barrier, you just walk in.<p>Not sure it's about being a high trust society or not, there's frequent inspections where they block the doors, and you get a hefty fine if you're caught without a valid ticket. I certainly wouldn't call Prague or Rome or Dublin high trust societies on par with a Swiss city.</p>
]]></description><pubDate>Wed, 13 May 2026 21:53:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48128117</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48128117</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48128117</guid></item><item><title><![CDATA[New comment by joefourier in "I have seen the dystopian future of elderly care"]]></title><description><![CDATA[
<p>Personally I feel like it would be less undignified and infantilising to have a machine take care of my basic bodily functions than a human being. There's no feeling of judgement or being shamed in front of someone else, and the machine could even restore a feeling of autonomy since it would feel like you're using a tool instead of being helplessly reliant on another person's help.</p>
]]></description><pubDate>Sun, 10 May 2026 17:35:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48086004</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48086004</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48086004</guid></item><item><title><![CDATA[New comment by joefourier in "I returned to AWS and was reminded why I left"]]></title><description><![CDATA[
<p>> I also used dedicated servers in the late ’90s (and they still offer great value today). But before AWS, provisioning new hardware typically took days, not minutes.<p>VPSes and non-custom configs for dedicated servers were pretty instant as far as I know, I think the advantage of AWS was more that you could scale up and down much more easily since you weren’t locked down in a monthly contract, and that you could automate server provisioning through an API.</p>
]]></description><pubDate>Sun, 10 May 2026 13:21:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48083765</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48083765</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48083765</guid></item><item><title><![CDATA[New comment by joefourier in "I returned to AWS and was reminded why I left"]]></title><description><![CDATA[
<p>> Cloud computing was an absolutely mind blowing revolution - suddenly your startup could run its own computer systems in minutes without need to install and run your own systems in a data center. This was an absolute game changer, and I really drank the AWS Kool Aid down to every last drop then I licked out the cup. I was all in on AWS in a big way.<p>Am I the only one who remembers that VPSes and dedicated hosting services were a thing before AWS came around? Yes you had to pay for a month at a time and scaling wasn’t as instant, but it wasn’t like the only option before cloud computing was having to drive to the datacentre and install your own server.</p>
]]></description><pubDate>Sun, 10 May 2026 12:24:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48083415</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48083415</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48083415</guid></item><item><title><![CDATA[New comment by joefourier in "Singapore introduces caning for boys who bully others at school"]]></title><description><![CDATA[
<p>> In every country, men commit almost all violent crimes. In school, boys physically bully other boys. Hence the physical punishment for them.<p>As I've said, and @echoangle repeated, caning is used for cyberbullying, which girls do too (at a rate relatively close to boys actually). If the law was caning in response to physical bullying, and it just so happened that the vast majority of offenders were boys, I would not object on the basic of sexism (I still would not approve of schools being allowed to physically punish students).<p>> Yes, for homo sapiens, the female is more fragile than the male. This is basic biology. I'm sure that in praying mantis society, females get harsher punishments.<p>There's no way the typical 16 year old girl is more fragile than the typical 9 year old boy, yet only the latter is subject to this punishment. Until children reach the age of 12 or so the strength difference is quite minor (and there's even a brief period where girls are taller and heavier).<p>Also it's absurd to punish demographics differently based on their statistical averages. Redheads are less sensitive to pain, should your hair colour determine how many strokes of the cane you get?</p>
]]></description><pubDate>Fri, 08 May 2026 22:24:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=48069498</link><dc:creator>joefourier</dc:creator><comments>https://news.ycombinator.com/item?id=48069498</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48069498</guid></item></channel></rss>