<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: davmre</title><link>https://news.ycombinator.com/user?id=davmre</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 13 Apr 2026 21:10:23 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=davmre" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by davmre in "Pro Max 5x quota exhausted in 1.5 hours despite moderate usage"]]></title><description><![CDATA[
<p>The KV cache consists of activation vectors for every attention head at every layer of the model for every token, so it gets quite large. ChatGPT also estimates 60-100GB for full token context of an Opus-sized model:<p><a href="https://chatgpt.com/share/69dc5030-268c-83e8-92c2-6cef962dc5c2" rel="nofollow">https://chatgpt.com/share/69dc5030-268c-83e8-92c2-6cef962dc5...</a></p>
]]></description><pubDate>Mon, 13 Apr 2026 02:07:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47746723</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=47746723</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47746723</guid></item><item><title><![CDATA[New comment by davmre in "Show HN: Moltbook – A social network for moltbots (clawdbots) to hang out"]]></title><description><![CDATA[
<p>Why can't it be both?</p>
]]></description><pubDate>Fri, 30 Jan 2026 22:37:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=46830923</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=46830923</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46830923</guid></item><item><title><![CDATA[New comment by davmre in "Spatial intelligence is AI’s next frontier"]]></title><description><![CDATA[
<p>It's true that these are very different activities, but I think most ML researchers would agree that it's actually the creation of ImageNet that sparked the deep learning revolution. CNNs were not a novel method in 2012; the novelty was having a dataset big and sophisticated enough that it was actually possible to learn a good vision model from without needing to hand-engineer all the parts. Fei-fei saw this years in advance and invested a lot of time and career capital setting up the conditions for the bitter lesson to kick in. Building the dataset was 'easy' in a technical sense, but <i>knowing that a big dataset was what the field needed</i>, and staking her career on it when no one else was doing or valuing this kind of work, was her unique contribution, and took quite a bit of both insight and courage.</p>
]]></description><pubDate>Tue, 11 Nov 2025 01:28:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=45883063</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45883063</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45883063</guid></item><item><title><![CDATA[New comment by davmre in "Why today's humanoids won't learn dexterity"]]></title><description><![CDATA[
<p>There actually has been some recent work on digitizing smell, most notably Osmo, which was founded by some ex-Google ML researchers:
<a href="https://www.salon.com/2025/01/05/digital-smell-has-arrived-are-we-ready-for-stinkygram/" rel="nofollow">https://www.salon.com/2025/01/05/digital-smell-has-arrived-a...</a></p>
]]></description><pubDate>Thu, 02 Oct 2025 04:34:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=45446345</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45446345</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45446345</guid></item><item><title><![CDATA[New comment by davmre in "Find SF parking cops"]]></title><description><![CDATA[
<p>Even if you charge $10/hr, or whatever the market rate would be for street parking spots, you still need an enforcement mechanism to prevent people overstaying.<p>In general, the idea of a "market rate" for any given property depends fundamentally on a system of property rights actually being enforced.</p>
]]></description><pubDate>Tue, 23 Sep 2025 21:17:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=45352867</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45352867</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45352867</guid></item><item><title><![CDATA[New comment by davmre in "Senator demands to know status of 'duplicate' SSA database 'immediately'"]]></title><description><![CDATA[
<p>Senators, especially committee chairmen, have quite a bit of implicit leverage, beyond the direct leverage of subpoenas or directly cutting funding to an offending agency.<p>Any given Senator is to some extent constantly in a favor-trading game with executive branch officials. People from the President on down need congressional cooperation to get their pet provisions into bills, programs funded, nominees approved, etc. A Senator can tell a White House official "I'd love to help you with that, however I have this issue with this agency not responding to my requests". Assuming it's a reasonable thing, whoever at the agency is in charge of this then gets an irate call from their boss's boss's boss ordering them to cooperate.<p>Of course this mostly doesn't actually get played out, because everyone understands the dynamic that defying senatorial requests will ultimately cost the President in terms of cooperation on other issues. So the norm is mostly to comply with reasonable requests, unless you're quite sure that it's a top-level priority where the White House really wants to take a stand.</p>
]]></description><pubDate>Fri, 12 Sep 2025 01:05:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=45217672</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45217672</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45217672</guid></item><item><title><![CDATA[New comment by davmre in "Eels are fish"]]></title><description><![CDATA[
<p>If you enjoyed this article then you <i>must</i> watch the A Capella Science music video on the same subject:<p><a href="https://youtube.com/watch?v=TzN148WQ2OQ" rel="nofollow">https://youtube.com/watch?v=TzN148WQ2OQ</a><p>By far the catchiest song about eel mating you will encounter today.</p>
]]></description><pubDate>Thu, 04 Sep 2025 02:51:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=45122973</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45122973</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45122973</guid></item><item><title><![CDATA[New comment by davmre in "The worst possible antitrust outcome"]]></title><description><![CDATA[
<p>The public interest is in judging the <i>trial process</i>, not in judging the defendant.<p>Suppose the government charges you with murder, searches your house, and finds your sex toy collection. At trial they present some elaborate thesis about how you used a sex toy to kill someone, but do not convince the jury, so you're found not guilty. The public has a legitimate interest in judging that the trial was handled with integrity and that the correct verdict was reached. They do not have a legitimate interest in judging <i>you</i> based on whatever private information presented at trial might in some way embarrass you (eg, photos of your sex toy collection). On balance, it could be that the public-record interest does in fact justify making public the evidence of the sex toys, but you have to justify it on those terms. The transparency is not itself intended to be punitive.</p>
]]></description><pubDate>Thu, 04 Sep 2025 00:32:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=45122019</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45122019</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45122019</guid></item><item><title><![CDATA[New comment by davmre in "The worst possible antitrust outcome"]]></title><description><![CDATA[
<p>Regardless of the effects, I don't think the case against MS was brought with the intent to "punish" MS through the trial process. The government brought the case because it thought it could win, it did win, and a judicial remedy was imposed. Trials are inherently unpleasant, but a just system tries to minimize this, not exploit it.<p>Any unjust policy (including just dispensing with trials altogether and allowing the executive to arbitrarily break up companies) will get to the 'desirable' outcome in some cases. That doesn't make it a just policy.<p>The specific allegation in the post is that the Trump administration will not appeal the verdict because Sundar gave $1M to Trump's inauguration. As far as I know, the government has not yet indicated whether it will appeal, so the claim that "Trump just paid him back, 40,000 times over" is in fact not true. (whether it becomes true at some point in the future, it was a falsehood at the time the author wrote it). It's also quite plausible that a Republican administration wouldn't appeal the verdict just due to being more pro-business in general, even without explicit corruption. But it's precisely <i>because</i> we have such a corrupt executive that it becomes all the more important to stick up for the rule of law. The correct response to authoritarianism is not to advocate for more authoritarianism!</p>
]]></description><pubDate>Thu, 04 Sep 2025 00:04:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=45121807</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45121807</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45121807</guid></item><item><title><![CDATA[New comment by davmre in "The worst possible antitrust outcome"]]></title><description><![CDATA[
<p>Sure, there's a strong public interest in having proceedings on record. US civil cases are supposed to have a presumption of openness, which the judge weighs against other interests, like protecting trade secrets, confidential business information, privacy of third parties, etc.<p>The public record argument is fine; it's just a different argument than the extrajudicial punishment advocated by the original post.</p>
]]></description><pubDate>Wed, 03 Sep 2025 23:43:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=45121637</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45121637</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45121637</guid></item><item><title><![CDATA[New comment by davmre in "The worst possible antitrust outcome"]]></title><description><![CDATA[
<p>If there's a general standard of transparency applied to all companies, fine. There are costs to increasing transparency, but certainly you could argue for that policy.<p>The argument that we should cheer on the use of government power to target a <i>specific</i> company, to selectively expose their dirty laundry as punishment for a crime they have not been convicted of, is what I found noxious in the original post.</p>
]]></description><pubDate>Wed, 03 Sep 2025 23:34:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=45121556</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45121556</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45121556</guid></item><item><title><![CDATA[New comment by davmre in "The worst possible antitrust outcome"]]></title><description><![CDATA[
<p>> The government doesn't have to win an antitrust trial in order to create competition. As the saying goes, "the process is the punishment."<p>Regardless of what you think of Google or this case specifically, this is an argument for authoritarianism: that it is legitimate for the government to "punish" any company at will, based only on them falling into political disfavor.<p>> ... the only punishment Google would have to bear from this trial would come after the government won its case, when the judge decided on a punishment (the term of art is "remedy") for Google.<p>Yes, this is called the rule of law. Punishment comes through the courts, after a guilty verdict. The government has to actually win the argument as to what remedies would be proportionate under the law. In this case the judge didn't buy it. It's fine to disagree with his reasoning (or with the law), but the fantasizing about extrajudicial punishment here is frankly un-American.</p>
]]></description><pubDate>Wed, 03 Sep 2025 22:09:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=45120919</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=45120919</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45120919</guid></item><item><title><![CDATA[New comment by davmre in "We're Still Not Done with Jesus"]]></title><description><![CDATA[
<p>Scholarly consensus is that the "Gospel of Matthew" was not written by the apostle Matthew and the "Gospel of John" was not written by the apostle John:<p><a href="https://en.wikipedia.org/wiki/Gospel_of_Matthew#Author_and_date" rel="nofollow">https://en.wikipedia.org/wiki/Gospel_of_Matthew#Author_and_d...</a><p><a href="https://en.wikipedia.org/wiki/Gospel_of_John#Authorship" rel="nofollow">https://en.wikipedia.org/wiki/Gospel_of_John#Authorship</a></p>
]]></description><pubDate>Tue, 25 Mar 2025 22:10:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=43476579</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=43476579</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43476579</guid></item><item><title><![CDATA[New comment by davmre in "Tensor Product Attention Is All You Need"]]></title><description><![CDATA[
<p>They're not proposing to apply tensor decomposition to an existing collection of weights. It's an architecture in which the K, V, and Q tensors are constructed as a product of factors. The model works with the factors directly and you just need to compute their product on the forward pass (and adjoints on the backwards pass), so there's no decomposition.</p>
]]></description><pubDate>Wed, 22 Jan 2025 08:45:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=42790542</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=42790542</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42790542</guid></item><item><title><![CDATA[New comment by davmre in "DeepSeek-R1"]]></title><description><![CDATA[
<p>You're totally right there must be supervision; it's just a matter of how the term is used.<p>"Supervised learning" for LLMs generally means the system sees a full response (eg from a human expert) as supervision.<p>Reinforcement learning is a much weaker signal: the system has the freedom to construct its own response / reasoning, and only gets feedback at the end whether it was correct. This is a much harder task, especially if you start with a weak model. RL training can potentially struggle in the dark for an exponentially long period before stumbling on any reward at all, which is why you'd often start with a supervised learning phase to at least get the model in the right neighborhood.</p>
]]></description><pubDate>Tue, 21 Jan 2025 20:12:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=42784687</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=42784687</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42784687</guid></item><item><title><![CDATA[New comment by davmre in "What P vs. NP is about"]]></title><description><![CDATA[
<p>Backprop itself doesn't invert the computation, but it does give you the direction for an incremental move towards the inverse (a 'nudge' as the article puts it). That is, given a sufficiently nice function f and an appropriate loss ||f(x) - y*||^2, gradient descent wrt x will indeed recover the inverse x* = f^{-1}(y*) since that is what minimizes the loss. I assume this what the article is pointing at.<p>If you want to be picky, it's true that the direct analogue of continuous optimization would be discrete optimization (integer programming, TSP, etc) rather than decision problems like SAT. But there are straightforward reductions between the two so it's common to speak of optimization problems as being in P or NP even though that's not entirely accurate.</p>
]]></description><pubDate>Sat, 05 Oct 2024 23:57:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=41753731</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=41753731</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41753731</guid></item><item><title><![CDATA[New comment by davmre in "Ibogaine banishes PTSD, small study finds"]]></title><description><![CDATA[
<p>Nitpicking, but for a technical audience it's worth noting that ibogaine is not at all a 'potent' psychedelic in the pharmacological sense of the term. A typical therapeutic dose is on the order of 500mg, which makes ibogaine something like 20 times less potent than psilocybin (typical dose ~25mg), which itself is 100 times less potent than LSD (typical doses less than 250ug).<p>Of course, this isn't really relevant to the subjective experience of taking ibogaine at its typical dose, which by all accounts is strange in ways that go beyond the classical psychedelics.</p>
]]></description><pubDate>Wed, 10 Jan 2024 01:48:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=38935118</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=38935118</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38935118</guid></item><item><title><![CDATA[New comment by davmre in "Inside OpenAI's crisis over the future of artificial intelligence"]]></title><description><![CDATA[
<p>> Altman was initially going to cooperate and even offered to help, until Brian Chesky & Ron Conway riled him up<p>I don't think the article supports this. All we know is that sama <i>appeared</i> cooperative when the board fired him. This was probably a reasonable posture for him to adopt regardless of his actual intentions at the time.</p>
]]></description><pubDate>Sat, 09 Dec 2023 16:51:23 +0000</pubDate><link>https://news.ycombinator.com/item?id=38583407</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=38583407</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38583407</guid></item><item><title><![CDATA[New comment by davmre in "Psychedelic Mushrooms Hit the Market in Oregon"]]></title><description><![CDATA[
<p>Counterpoint: SSRIs were transformative for my depression. Side effects were minor and manageable (eg, Wellbutrin worked well to prevent any sexual dysfunction). I was on them for several years and had no problem tapering off. My understanding is that this is a pretty typical experience. Rhetoric like this was actively harmful in dissuading me for years from trying what ended up being by far the most effective treatment for me.<p>(yes, I've tried psychedelics; they're fascinating and super promising, but at least for me, not transformative in the way that fluoxetine was)<p>No individual depression treatment works for everyone. SSRIs are not a magic bullet. Neither are psychedelics. But if you're depressed and haven't tried SSRIs, you owe it to yourself and everyone in your life to at least <i>test</i> the hypothesis that they might help.<p>Scott Alexander's page on SSRIs is a great, relatively objective resource, from a psychiatrist who regularly prescribes them: 
<a href="https://lorienpsych.com/2020/10/25/ssris/" rel="nofollow noreferrer">https://lorienpsych.com/2020/10/25/ssris/</a></p>
]]></description><pubDate>Tue, 24 Oct 2023 05:41:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=37995167</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=37995167</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37995167</guid></item><item><title><![CDATA[New comment by davmre in "Prophet: Automatic Forecasting Procedure"]]></title><description><![CDATA[
<p>Prophet has gotten a lot of attention since being released in 2017, I think because the idea of a fully automatic solution is very appealing to people. One of the original developers, Sean Taylor, recently posted a nice retrospective on the project's successes and failures:
<a href="https://medium.com/@seanjtaylor/a-personal-retrospective-on-prophet-f223c2378985" rel="nofollow noreferrer">https://medium.com/@seanjtaylor/a-personal-retrospective-on-...</a> He quotes one of his earlier tweets:<p><pre><code>  If I could build it again, I’d start with automating the evaluation of forecasts. It’s silly to build models if you’re not willing to commit to an evaluation procedure. I’d also probably remove most of the automation of the modeling. People should explicitly make these choices.
</code></pre>
Having worked on similar Bayesian time-series forecasting tools at Google, this matches my experience (though I've never used Prophet seriously, so please don't take this as any direct judgement of it as a software package). There is a lot of value in a framework that lets you easily experiment with different model structures (our version of this was the structural time series tools in TensorFlow Probability, see, e.g., <a href="https://blog.tensorflow.org/2019/03/structural-time-series-modeling-in.html" rel="nofollow noreferrer">https://blog.tensorflow.org/2019/03/structural-time-series-m...</a>). But if you're forecasting something you actually care about, it's usually worth the time to try to understand yourself what structure makes sense for your problem, and do a careful evaluation on held-out data with respect to whatever metric you're really trying to optimize. A fully automated search over model structures is cute, but even when it works, it mostly just ends up rediscovering properties of the data you could or should have already known (e.g., of course traffic to your work-related website will have a day-of-week effect), so the cases where it really adds practical value are harder to find than you might like.<p>Even in the age of deep learning, I do think these relatively classical Bayesian models have a lot of value for many applications. Time-series forecasting tends to be a case where:<p>- you don't have a ton of iid data points (often, only a single time series),<p>- you'd like forecasts with principled uncertainty estimates, e.g., credible intervals, giving you a range of scenarios to plan for,<p>- you often do have a pretty good idea of what features are relevant to the process you're predicting, and<p>- you want to understand in detail what features the forecast is accounting for (and what it might be missing),<p>all of which play to the strengths of more classical, structured statistical models, compared to more data-hungry black-box deep learning models. So the basic ideas in Prophet and similar tools do still have a lot of relevance going forward, IMHO.</p>
]]></description><pubDate>Tue, 26 Sep 2023 20:26:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=37665370</link><dc:creator>davmre</dc:creator><comments>https://news.ycombinator.com/item?id=37665370</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37665370</guid></item></channel></rss>