<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: mlpro</title><link>https://news.ycombinator.com/user?id=mlpro</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 24 May 2026 22:29:30 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=mlpro" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by mlpro in "Ideas are cheap, execution is cheaper"]]></title><description><![CDATA[
<p>Novel Ideas are never cheap, lol.</p>
]]></description><pubDate>Fri, 16 Jan 2026 03:09:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=46642551</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46642551</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46642551</guid></item><item><title><![CDATA[New comment by mlpro in "Universal Reasoning Model (53.8% pass 1 ARC1 and 16.0% ARC 2)"]]></title><description><![CDATA[
<p>Lol. trying to copy the Universal Weight Subspace paper's naming to get famous.</p>
]]></description><pubDate>Tue, 23 Dec 2025 07:31:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=46363225</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46363225</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46363225</guid></item><item><title><![CDATA[New comment by mlpro in "Coarse is better"]]></title><description><![CDATA[
<p>Lol, yeah.</p>
]]></description><pubDate>Sun, 21 Dec 2025 23:07:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=46349567</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46349567</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46349567</guid></item><item><title><![CDATA[New comment by mlpro in "TRELLIS.2: state-of-the-art large 3D generative model (4B)"]]></title><description><![CDATA[
<p>Oh, look - a new 3D model with a new idea - more data.</p>
]]></description><pubDate>Sun, 21 Dec 2025 23:03:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=46349535</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46349535</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46349535</guid></item><item><title><![CDATA[New comment by mlpro in "You have reached the end of the internet (2006)"]]></title><description><![CDATA[
<p>I don't understand.</p>
]]></description><pubDate>Sun, 21 Dec 2025 23:01:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=46349525</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46349525</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46349525</guid></item><item><title><![CDATA[New comment by mlpro in "Waymo halts service during S.F. blackout after causing traffic jams"]]></title><description><![CDATA[
<p>Waymo should do a bit more research in reliability and explainability of their AI models.</p>
]]></description><pubDate>Sun, 21 Dec 2025 23:00:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=46349521</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46349521</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46349521</guid></item><item><title><![CDATA[New comment by mlpro in "The universal weight subspace hypothesis"]]></title><description><![CDATA[
<p>Read the paper end to end today. I think its the most outrageous ideas of 2025 - at least amongst the papers I've read. So counterintuitive initially and yet so intuitive. Personally, kinda hate the implications. But, a paper like this was definitely needed.</p>
]]></description><pubDate>Thu, 11 Dec 2025 02:00:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=46226749</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46226749</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46226749</guid></item><item><title><![CDATA[New comment by mlpro in "The universal weight subspace hypothesis"]]></title><description><![CDATA[
<p>They are not trained on the same data. Even a skim of the paper shows very disjoint data.<p>The LLMs are finetuned on very disjoint data. I checked some are on Chinese and other are for Math. The pretrained model provides a good initialization. I'm convinced.</p>
]]></description><pubDate>Tue, 09 Dec 2025 16:04:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=46206498</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46206498</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46206498</guid></item><item><title><![CDATA[New comment by mlpro in "The universal weight subspace hypothesis"]]></title><description><![CDATA[
<p>I think its very surprising, although I would like the paper to show more experiments (they already have a lot, i know).<p>The ViT models are never really trained from scratch - they are always finetuned as they require large amounts of data to converge nicely. The pretraining just provides a nice initialization. Why would one expect two ViT's finetuned on two different things - image and text classification end up in the same subspace as they show? I think this is groundbreaking.<p>I don't really agree with the drift far from the parent model idea. I think they drift pretty far in terms of their norms. Even the small LoRA adapters drift pretty far from the base model.</p>
]]></description><pubDate>Tue, 09 Dec 2025 16:00:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=46206419</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46206419</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46206419</guid></item><item><title><![CDATA[New comment by mlpro in "The universal weight subspace hypothesis"]]></title><description><![CDATA[
<p>Why would they be similar if they are trained on very different data? Also, trained from scratch models are also analyzed, imo.</p>
]]></description><pubDate>Tue, 09 Dec 2025 05:50:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=46201686</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46201686</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46201686</guid></item><item><title><![CDATA[New comment by mlpro in "The universal weight subspace hypothesis"]]></title><description><![CDATA[
<p>It's about weights/parameters, not representations.</p>
]]></description><pubDate>Tue, 09 Dec 2025 05:38:08 +0000</pubDate><link>https://news.ycombinator.com/item?id=46201618</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46201618</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46201618</guid></item><item><title><![CDATA[New comment by mlpro in "The universal weight subspace hypothesis"]]></title><description><![CDATA[
<p>The analysis is on image classification, LLMs, Diffusion models, etc.</p>
]]></description><pubDate>Tue, 09 Dec 2025 03:13:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=46200854</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46200854</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46200854</guid></item><item><title><![CDATA[New comment by mlpro in "The universal weight subspace hypothesis"]]></title><description><![CDATA[
<p>It does seem to be working for novel tasks.</p>
]]></description><pubDate>Tue, 09 Dec 2025 03:12:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=46200848</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46200848</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46200848</guid></item><item><title><![CDATA[New comment by mlpro in "The universal weight subspace hypothesis"]]></title><description><![CDATA[
<p>Not really. If the models are trained on different dataset - like one ViT trained on satellite images and another on medical X-rays - one would expect their parameters, which were randomly initialized to be completely different or even orthogonal.</p>
]]></description><pubDate>Tue, 09 Dec 2025 03:11:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=46200843</link><dc:creator>mlpro</dc:creator><comments>https://news.ycombinator.com/item?id=46200843</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46200843</guid></item></channel></rss>