<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: ma2rten</title><link>https://news.ycombinator.com/user?id=ma2rten</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 10 Jun 2026 02:00:27 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=ma2rten" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by ma2rten in "Ask HN: What was your "oh shit" moment with GenAI?"]]></title><description><![CDATA[
<p>My personal "oh shit" moment was in 2015, when this paper came out:
<a href="https://arxiv.org/abs/1506.05869" rel="nofollow">https://arxiv.org/abs/1506.05869</a><p>It showed me that a model trained only on movie subtitles data exhibited some (very primitive) reasoning. I have been working on Deep Learning and later LLMs ever since.</p>
]]></description><pubDate>Sun, 07 Jun 2026 04:30:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=48431803</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=48431803</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48431803</guid></item><item><title><![CDATA[New comment by ma2rten in "Yann LeCun raises $1B to build AI that understands the physical world"]]></title><description><![CDATA[
<p>Erm, ... OpenAI has hyped when it started and it took 6 years to take off. It's way to early to declare the SSI and Thinking Machines have failed.</p>
]]></description><pubDate>Tue, 10 Mar 2026 15:19:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47324468</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=47324468</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47324468</guid></item><item><title><![CDATA[New comment by ma2rten in "OpenAI declares 'code red' as Google catches up in AI race"]]></title><description><![CDATA[
<p>Delaying doesn't necessarily mean they stop working on it. Also it might be a question of compute resource allocation as well.</p>
]]></description><pubDate>Tue, 02 Dec 2025 16:17:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=46122760</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=46122760</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46122760</guid></item><item><title><![CDATA[New comment by ma2rten in "Show HN:emma019 Real-Time AI-Powered Texas Hold'em in Python and Flask"]]></title><description><![CDATA[
<p>You can add Show HN to the title for your own projects. They will show up in the show tab.</p>
]]></description><pubDate>Sat, 22 Nov 2025 00:02:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=46010608</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=46010608</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46010608</guid></item><item><title><![CDATA[New comment by ma2rten in "How Airbus took off"]]></title><description><![CDATA[
<p>Europe is quite conservative, in the sense that they would not invest billions into an unproven venture. It makes sense that it would excel at an industry that requires putting safety above everything.</p>
]]></description><pubDate>Sun, 09 Nov 2025 05:14:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=45863098</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=45863098</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45863098</guid></item><item><title><![CDATA[New comment by ma2rten in "BERT is just a single text diffusion step"]]></title><description><![CDATA[
<p>It's actually true on many levels, if you think about is needed for generating syntactically and grammatically correct sentences, coherent text and working code.</p>
]]></description><pubDate>Tue, 21 Oct 2025 02:30:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=45651812</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=45651812</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45651812</guid></item><item><title><![CDATA[New comment by ma2rten in "BERT is just a single text diffusion step"]]></title><description><![CDATA[
<p>Interpretability research has found that Autoregressive LLMs also plan ahead what they are going to say.</p>
]]></description><pubDate>Mon, 20 Oct 2025 16:08:29 +0000</pubDate><link>https://news.ycombinator.com/item?id=45645523</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=45645523</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45645523</guid></item><item><title><![CDATA[New comment by ma2rten in "Boeing has started working on a 737 MAX replacement"]]></title><description><![CDATA[
<p>Your use of the phrase makes no sense. It's the "no parking" that proofs the rule and not the exception.</p>
]]></description><pubDate>Wed, 01 Oct 2025 03:55:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=45434108</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=45434108</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45434108</guid></item><item><title><![CDATA[New comment by ma2rten in "Are OpenAI and Anthropic losing money on inference?"]]></title><description><![CDATA[
<p>You can also look at the price of opensource models on openrouter, which are a fraction of the cost of closed source models. This is a market that is heavily commoditized, so I would expect it reflect the true cost with a small margin.</p>
]]></description><pubDate>Fri, 29 Aug 2025 02:47:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=45059554</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=45059554</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45059554</guid></item><item><title><![CDATA[New comment by ma2rten in "Curious about the training data of OpenAI's new GPT-OSS models? I was too"]]></title><description><![CDATA[
<p>Presumably the model is trained in post-training to produce a response to a prompt, but not to reproduce the prompt itself. So if you prompt it with an empty prompt it's going to be out of distribution.</p>
]]></description><pubDate>Sun, 10 Aug 2025 08:19:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=44853667</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=44853667</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44853667</guid></item><item><title><![CDATA[New comment by ma2rten in "MIT study explains why laws are written in an incomprehensible style"]]></title><description><![CDATA[
<p>The study seemed not very convincing to me, at least the way it was described in the article. To summarize: they asked crowdworkers to write a law who used legalese, but not when writing news stories about it or when explaining the law. From that the researchers concluded that people use legalese to convey authority.<p>But what if people just imitated the writing style of existing laws, but not with the intention to make it authoritative but because that is what they understood their task to be?</p>
]]></description><pubDate>Tue, 17 Dec 2024 05:53:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=42438708</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=42438708</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42438708</guid></item><item><title><![CDATA[New comment by ma2rten in "Ask HN: How does Alexa avoid interrupting itself when saying its own name?"]]></title><description><![CDATA[
<p>This is the same problem as echo cancellation on calls. This is something that built into a lot of software and hardware.</p>
]]></description><pubDate>Sat, 29 Jun 2024 15:14:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=40831165</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=40831165</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40831165</guid></item><item><title><![CDATA[New comment by ma2rten in "Maxtext: A simple, performant and scalable Jax LLM"]]></title><description><![CDATA[
<p>t5x was used to train PaLM 1.</p>
]]></description><pubDate>Wed, 24 Apr 2024 15:01:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=40145186</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=40145186</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40145186</guid></item><item><title><![CDATA[New comment by ma2rten in "Travelling with Tailscale"]]></title><description><![CDATA[
<p><i>I have an upcoming trip to Europe, which I am quite excited about. I wanted to set up a Tailscale exit node to ensure that critical apps I depend on, such as banking portals continue working from outside the country.</i><p>I've never had an issue accessing banking portals from Europe.</p>
]]></description><pubDate>Mon, 15 Apr 2024 12:26:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=40039666</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=40039666</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40039666</guid></item><item><title><![CDATA[New comment by ma2rten in "Apple cuts off Beeper Mini's access"]]></title><description><![CDATA[
<p>Apples cares about the privacy and security of iPhones as a differentiator.</p>
]]></description><pubDate>Fri, 08 Dec 2023 23:08:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=38575984</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=38575984</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38575984</guid></item><item><title><![CDATA[New comment by ma2rten in "Gemini AI"]]></title><description><![CDATA[
<p>Noam.</p>
]]></description><pubDate>Wed, 06 Dec 2023 18:43:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=38547894</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=38547894</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38547894</guid></item><item><title><![CDATA[New comment by ma2rten in "Gemini AI"]]></title><description><![CDATA[
<p>No this is not correct. Arguably OpenAI invented LLMs with GPT3 and the preceding scaling laws paper. I worked on LAMDA, it came after GPT4 and was not as capable. Google did invent the transformer, but all the authors of the paper have left since.</p>
]]></description><pubDate>Wed, 06 Dec 2023 18:06:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=38547429</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=38547429</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38547429</guid></item><item><title><![CDATA[New comment by ma2rten in "OpenAI is exploring making its own AI chips"]]></title><description><![CDATA[
<p>Both Amazon and Google already do this, there are reports that Microsoft does as well.</p>
]]></description><pubDate>Fri, 06 Oct 2023 15:09:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=37791815</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=37791815</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37791815</guid></item><item><title><![CDATA[New comment by ma2rten in "How Transformers Work"]]></title><description><![CDATA[
<p>Yes, I think that is a reasonable way to think about it, in my opinion. However, with the language modeling objective it predicts the next token and because of the residual connections each intermediate layer is in the same space. So, maybe it would be more accurate to say that it is an increasingly accurate representation of the next token.</p>
]]></description><pubDate>Fri, 06 Oct 2023 11:33:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=37789352</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=37789352</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37789352</guid></item><item><title><![CDATA[New comment by ma2rten in "How Transformers Work"]]></title><description><![CDATA[
<p>Attention takes in all tokens in the sequence and outputs a new representation of the current token in context. Each layer of the transformer adds more context to the token.<p>I haven't read this explanation in detail and although they have some nice animations, I wouldn't go to FT to explain machine learning concepts. Here are two well known explanations that might be better:<p><a href="http://jalammar.github.io/illustrated-transformer/" rel="nofollow noreferrer">http://jalammar.github.io/illustrated-transformer/</a><p><a href="http://nlp.seas.harvard.edu/annotated-transformer/" rel="nofollow noreferrer">http://nlp.seas.harvard.edu/annotated-transformer/</a>.</p>
]]></description><pubDate>Thu, 05 Oct 2023 10:27:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=37776853</link><dc:creator>ma2rten</dc:creator><comments>https://news.ycombinator.com/item?id=37776853</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37776853</guid></item></channel></rss>