<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: Smith42</title><link>https://news.ycombinator.com/user?id=Smith42</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 18 Jun 2026 08:09:19 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=Smith42" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by Smith42 in "US Government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>It's always been this way ever since the first industrial revolution.</p>
]]></description><pubDate>Sat, 13 Jun 2026 01:33:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48511467</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=48511467</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48511467</guid></item><item><title><![CDATA[New comment by Smith42 in "AI and the Ship of Theseus"]]></title><description><![CDATA[
<p>So write it! Shouldn't be much extra to add to the AGPL licence?</p>
]]></description><pubDate>Fri, 06 Mar 2026 15:28:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47276145</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=47276145</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47276145</guid></item><item><title><![CDATA[Opencode]]></title><description><![CDATA[
<p>Article URL: <a href="https://opencode.ai/">https://opencode.ai/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44562045">https://news.ycombinator.com/item?id=44562045</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 14 Jul 2025 16:29:21 +0000</pubDate><link>https://opencode.ai/</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=44562045</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44562045</guid></item><item><title><![CDATA[US halts student visa appointments and plans expanded social media vetting]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.bbc.co.uk/news/articles/cy75eenl46eo">https://www.bbc.co.uk/news/articles/cy75eenl46eo</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44113751">https://news.ycombinator.com/item?id=44113751</a></p>
<p>Points: 8</p>
<p># Comments: 3</p>
]]></description><pubDate>Wed, 28 May 2025 08:10:32 +0000</pubDate><link>https://www.bbc.co.uk/news/articles/cy75eenl46eo</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=44113751</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44113751</guid></item><item><title><![CDATA[Mistral Small 3 Announcement]]></title><description><![CDATA[
<p>Article URL: <a href="https://twitter.com/MistralAI/status/1884968836606136636">https://twitter.com/MistralAI/status/1884968836606136636</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42877950">https://news.ycombinator.com/item?id=42877950</a></p>
<p>Points: 15</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 30 Jan 2025 14:24:51 +0000</pubDate><link>https://twitter.com/MistralAI/status/1884968836606136636</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=42877950</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42877950</guid></item><item><title><![CDATA[The Multimodal Universe: Enabling Large-Scale ML with 100TB of Astro Data]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2412.02527">https://arxiv.org/abs/2412.02527</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42314726">https://news.ycombinator.com/item?id=42314726</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 04 Dec 2024 05:36:31 +0000</pubDate><link>https://arxiv.org/abs/2412.02527</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=42314726</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42314726</guid></item><item><title><![CDATA[New comment by Smith42 in "Will we run out of data? Limits of LLM scaling based on human-generated data"]]></title><description><![CDATA[
<p>We investigate the potential constraints on LLM scaling posed by the availability of public human-generated text data. We forecast the growing demand for training data based on current trends and estimate the total stock of public human text data. Our findings indicate that if current LLM development trends continue, models will be trained on datasets roughly equal in size to the available stock of public human text data between 2026 and 2032, or slightly earlier if models are overtrained. We explore how progress in language modeling can continue when human-generated text datasets cannot be scaled any further. We argue that synthetic data generation, transfer learning from data-rich domains, and data efficiency improvements might support further progress.</p>
]]></description><pubDate>Fri, 07 Jun 2024 17:09:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=40610636</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=40610636</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40610636</guid></item><item><title><![CDATA[Will we run out of data? Limits of LLM scaling based on human-generated data]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2211.04325">https://arxiv.org/abs/2211.04325</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40610622">https://news.ycombinator.com/item?id=40610622</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 07 Jun 2024 17:08:29 +0000</pubDate><link>https://arxiv.org/abs/2211.04325</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=40610622</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40610622</guid></item><item><title><![CDATA[New comment by Smith42 in "Astronomy Generates Mountains of Data. That's Perfect for AI – Universe Today"]]></title><description><![CDATA[
<p>That really isn't the case, and I am not sure how you could arrive at that unsubstantiated conclusion.</p>
]]></description><pubDate>Tue, 04 Jun 2024 12:52:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=40574030</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=40574030</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40574030</guid></item><item><title><![CDATA[Astronomy Generates Mountains of Data. That's Perfect for AI – Universe Today]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.universetoday.com/167153/astronomy-generates-mountains-of-data-thats-perfect-for-ai/">https://www.universetoday.com/167153/astronomy-generates-mountains-of-data-thats-perfect-for-ai/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40573725">https://news.ycombinator.com/item?id=40573725</a></p>
<p>Points: 2</p>
<p># Comments: 2</p>
]]></description><pubDate>Tue, 04 Jun 2024 12:21:01 +0000</pubDate><link>https://www.universetoday.com/167153/astronomy-generates-mountains-of-data-thats-perfect-for-ai/</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=40573725</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40573725</guid></item><item><title><![CDATA[New comment by Smith42 in "AstroPT: Scaling Large Observation Models for Astronomy"]]></title><description><![CDATA[
<p>Abstract:<p>This work presents AstroPT, an autoregressive pretrained transformer developed with astronomical use-cases in mind. The AstroPT models presented here have been pretrained on 8.6 million 512 × 512 pixel grz-band galaxy postage stamp observations from the DESI Legacy Survey DR8. We train a selection of foundation models of increasing size from 1 million to 2.1 billion parameters, and find that AstroPT follows a similar saturating log-log scaling law to textual models. We also find that the models' performances on downstream tasks as measured by linear probing improves with model size up to the model parameter saturation point. We believe that collaborative community development paves the best route towards realising an open source `Large Observation Model' -- a model trained on data taken from the observational sciences at the scale seen in natural language processing. To this end, we release the source code, weights, and dataset for AstroPT under the MIT license, and invite potential collaborators to join us in collectively building and researching these models.</p>
]]></description><pubDate>Mon, 27 May 2024 09:21:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=40489111</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=40489111</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40489111</guid></item><item><title><![CDATA[AstroPT: Scaling Large Observation Models for Astronomy]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2405.14930">https://arxiv.org/abs/2405.14930</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40489110">https://news.ycombinator.com/item?id=40489110</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Mon, 27 May 2024 09:21:45 +0000</pubDate><link>https://arxiv.org/abs/2405.14930</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=40489110</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40489110</guid></item><item><title><![CDATA[New comment by Smith42 in "Building a deep learning rig"]]></title><description><![CDATA[
<p>$15k!</p>
]]></description><pubDate>Sat, 24 Feb 2024 14:18:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=39491648</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=39491648</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39491648</guid></item><item><title><![CDATA[New comment by Smith42 in "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"]]></title><description><![CDATA[
<p>"Large Observation Model" has a nice ring to it</p>
]]></description><pubDate>Wed, 21 Feb 2024 15:56:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=39455378</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=39455378</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39455378</guid></item><item><title><![CDATA[New comment by Smith42 in "A decoder-only foundation model for time-series forecasting"]]></title><description><![CDATA[
<p>If you are interested in this also check out EarthPT, which is also a time series decoding transformer (and has the code and weights released under the MIT licence): <a href="https://arxiv.org/abs/2309.07207" rel="nofollow">https://arxiv.org/abs/2309.07207</a></p>
]]></description><pubDate>Sat, 03 Feb 2024 07:32:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=39238241</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=39238241</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39238241</guid></item><item><title><![CDATA[New comment by Smith42 in "Sxmo: Linux tiling window manager for phones"]]></title><description><![CDATA[
<p>What's new with SXMO? Haven't been keeping up since 2021. Is there a stable phone to run this on now?</p>
]]></description><pubDate>Sat, 27 Jan 2024 13:25:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=39155345</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=39155345</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39155345</guid></item><item><title><![CDATA[New comment by Smith42 in "EarthPT: A time series transformer foundation model"]]></title><description><![CDATA[
<p>Wanted to share the code release of EarthPT, a model that predicts future satellite observations in a zero shot setting! I'm the first author so please shoot any questions you have at me.<p>EarthPT is a 700 million parameter decoding transformer foundation model trained in an autoregressive self-supervised manner and developed specifically with EO use-cases in mind. EarthPT can accurately predict future satellite observations across the 400-2300 nm range well into the future (we found six months!).<p>The embeddings learnt by EarthPT hold semantically meaningful information and could be exploited for downstream tasks such as highly granular, dynamic land use classification.<p>The coolest takeaway for me is that EO data provides us with -- in theory -- quadrillions of training tokens. Therefore, if we assume that EarthPT follows neural scaling laws akin to those derived for Large Language Models (LLMs), there is currently no data-imposed limit to scaling EarthPT and other similar ‘Large Observation Models.’(!)<p>Code: <a href="https://github.com/aspiaspace/EarthPT">https://github.com/aspiaspace/EarthPT</a><p>Paper: <a href="https://arxiv.org/abs/2309.07207" rel="nofollow">https://arxiv.org/abs/2309.07207</a></p>
]]></description><pubDate>Fri, 19 Jan 2024 12:44:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=39054783</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=39054783</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39054783</guid></item><item><title><![CDATA[EarthPT: A time series transformer foundation model]]></title><description><![CDATA[
<p>Article URL: <a href="https://github.com/aspiaspace/EarthPT">https://github.com/aspiaspace/EarthPT</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39054782">https://news.ycombinator.com/item?id=39054782</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 19 Jan 2024 12:44:18 +0000</pubDate><link>https://github.com/aspiaspace/EarthPT</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=39054782</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39054782</guid></item><item><title><![CDATA[New comment by Smith42 in "OpenAI researchers warned board of AI breakthrough ahead of CEO ouster"]]></title><description><![CDATA[
<p>Wishful thinkin buddy</p>
]]></description><pubDate>Thu, 23 Nov 2023 00:17:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=38387303</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=38387303</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38387303</guid></item><item><title><![CDATA[New comment by Smith42 in "OpenAI Just Killed an Entire Market in 45 Minutes"]]></title><description><![CDATA[
<p>Anyone have a paste of the article? There is a paywall</p>
]]></description><pubDate>Thu, 16 Nov 2023 16:10:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=38291302</link><dc:creator>Smith42</dc:creator><comments>https://news.ycombinator.com/item?id=38291302</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38291302</guid></item></channel></rss>