<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: mofeien</title><link>https://news.ycombinator.com/user?id=mofeien</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 13 Jun 2026 02:23:22 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=mofeien" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by mofeien in "When AI Builds Itself: Our progress toward recursive self-improvement"]]></title><description><![CDATA[
<p>I would assume that shortly after, the solar system will be hyper optimized as well, then the milky way, then the local cluster, and so on. Everything will be close to optimal afterwords, and I sure hope we will have specified the target function for that optimization correctly in the single attempt that we will have had.</p>
]]></description><pubDate>Thu, 04 Jun 2026 18:21:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48402545</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48402545</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48402545</guid></item><item><title><![CDATA[New comment by mofeien in "When AI Builds Itself: Our progress toward recursive self-improvement"]]></title><description><![CDATA[
<p>Or agree on finding ways to promote peaceful use of nuclear energy. This has been done, there are thousands of people working on it around the globe and 180+ member states of the IAEA. It's not easy, there have been close calls.<p>And cooperating interntionally to buy ourselves time to find ways to develop this "last invention" is a way that will do good for humanity seems to be on a similar level.</p>
]]></description><pubDate>Thu, 04 Jun 2026 18:17:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=48402495</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48402495</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48402495</guid></item><item><title><![CDATA[New comment by mofeien in "When AI Builds Itself: Our progress toward recursive self-improvement"]]></title><description><![CDATA[
<p>The regulation that is being argued for here is against pushing the frontier. Entering the market with say a new speech to text model is not subject to such regulation. What's needed is something qualitatively different from entry barriers, and of the frontier model companies at least Anthropic and deepmind seem to have enough self-awareness to speak about it. They are finding themselves in a race with possibly catastrophic outcome for humanity and would like to stop, but it needs internation cooperation on a level that no single company can provide.</p>
]]></description><pubDate>Thu, 04 Jun 2026 18:12:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48402426</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48402426</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48402426</guid></item><item><title><![CDATA[New comment by mofeien in "When AI Builds Itself: Our progress toward recursive self-improvement"]]></title><description><![CDATA[
<p>> If it were possible to effectively slow the development of this technology to give ourselves more time to deal with its immense implications, we think that would likely be a good thing<p>Even Anthropic wants to Pause AI now. There must really be not much time left for "edging". Please write to your lawmakers, no matter whether you are in the US, Europe, China, or elsewhere. Only an international agreement between governments can enforce an AI-Pause and eliminate the necessity to dangerously push the frontier.<p><a href="https://pauseai.info/" rel="nofollow">https://pauseai.info/</a></p>
]]></description><pubDate>Thu, 04 Jun 2026 18:05:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48402302</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48402302</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48402302</guid></item><item><title><![CDATA[New comment by mofeien in "Expanding Project Glasswing"]]></title><description><![CDATA[
<p>Jack Clark, co-founder of Anthropic said the following at an Oxford lecture last week ([0], at around 10 and 12 mins):<p><pre><code>    "It's a technology that we do not fully understand because it's more grown than made. And it is a technology that you can concoct plausible scenarios where it could kill every single person on the planet. So to think building this technology is without risk would be an act of hubris or insanity.
</code></pre>
[...]<p><pre><code>    The technology is in fact so powerful that I should clearly state that if it was possible to elegantly slow the development of this technology to give ourselves more time as a species to deal with it, that would likely be a good thing. ... But in the absence of a coordinated global slowdown, we are left with the current situation, which is a powerful technology being developed at breakneck speed by a variety of actors and a variety of countries locked in a competition with one another where commercial and geopolitical rivalries are often drowning out the larger existential-to-the-species aspects of the technology being built. This isn't an ideal situation, but it's the one we find ourselves in."
</code></pre>
They know they are in a race that no one will win.<p>[0] <a href="https://www.youtube.com/watch?v=8zIcP5WlShw" rel="nofollow">https://www.youtube.com/watch?v=8zIcP5WlShw</a></p>
]]></description><pubDate>Tue, 02 Jun 2026 19:34:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=48375109</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48375109</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48375109</guid></item><item><title><![CDATA[New comment by mofeien in "Claude Opus 4.8"]]></title><description><![CDATA[
<p>Not all training data is human generated, and it's also not clear that being ridiculously good at interpolating between data points (whatever that means) will not lead to superhuman capabilities.</p>
]]></description><pubDate>Thu, 28 May 2026 18:36:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=48313464</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48313464</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48313464</guid></item><item><title><![CDATA[New comment by mofeien in "Teaching Claude Why"]]></title><description><![CDATA[
<p>- Its goal: X<p>- (Logic) => its subgoal: Not be turned off because that's a prerequisite to be able to do X<p>- (Logic) => Eliminate humans with their opaque and somewhat unpredictable minds to reduce chance of harm to it from 0.01% to 0.001%</p>
]]></description><pubDate>Sat, 09 May 2026 06:48:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48072505</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48072505</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48072505</guid></item><item><title><![CDATA[New comment by mofeien in "Google Chrome silently installs a 4 GB AI model on your device without consent"]]></title><description><![CDATA[
<p>The goal as stated on the extension page is to improve the readability of texts by replacing :, *, _ forms. So some customizability to the user's wishes would be quite nice.<p>My calculus textbook (Königsberger, 2004) in university used alternating generic masculine and feminine in its exercises, which I found a delightful use of language.</p>
]]></description><pubDate>Tue, 05 May 2026 10:24:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=48020491</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48020491</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48020491</guid></item><item><title><![CDATA[New comment by mofeien in "Google Chrome silently installs a 4 GB AI model on your device without consent"]]></title><description><![CDATA[
<p>To prevent accusations of "masculinism" or sexism and to have a stronger case on having the goal to improve readability the add-on could include an option (or even make it default) to replace by generic feminine instead.</p>
]]></description><pubDate>Tue, 05 May 2026 10:09:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=48020363</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=48020363</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48020363</guid></item><item><title><![CDATA[New comment by mofeien in "How to Build the Future: Demis Hassabis [video]"]]></title><description><![CDATA[
<p>There was no nuclear weapon used in warfare anymore since WW II. I think the regulation and oversight worked incredibly well over the past 70-80 years, despite the game-theoretic challenge you mention.</p>
]]></description><pubDate>Thu, 30 Apr 2026 06:20:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47958868</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47958868</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47958868</guid></item><item><title><![CDATA[New comment by mofeien in "Why AI companies want you to be afraid of them"]]></title><description><![CDATA[
<p>Given that his reason for saying GPT-2 was too dangerous to release was that the world needed more time to prepare for the effects of this technology, and given that the following models were basically scaled-up versions of it and killed social media, news reporting and other kinds of communication, I'd say he was right about the dangers of it.</p>
]]></description><pubDate>Wed, 29 Apr 2026 16:38:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=47950821</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47950821</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47950821</guid></item><item><title><![CDATA[New comment by mofeien in "Why AI companies want you to be afraid of them"]]></title><description><![CDATA[
<p>"The race to build smarter-than-human AI is a race with no winners."<p>And specifically about the point on China, several people in power in China have also expressed the need to regulate AI and put international structures of governance in place to make sure it will benefit mankind:<p><a href="https://nowinners.ai/#s5-china" rel="nofollow">https://nowinners.ai/#s5-china</a></p>
]]></description><pubDate>Wed, 29 Apr 2026 16:33:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=47950745</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47950745</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47950745</guid></item><item><title><![CDATA[New comment by mofeien in "An AI agent deleted our production database. The agent's confession is below"]]></title><description><![CDATA[
<p>That highlights how important ceiling construction regulations are. I would assume that right now your breakfast sandwich is more highly regulated than LLMs. And these are the things that make decisions spanning from database maintenance here to target selection and execution in autonomous warfare.<p>The LLM agent is very good at fulfilling its objective and it will creatively exploit holes in your specification to reach its goals. The evals in the System Cards show that the models are aware of what they're doing and are hiding their traces. In this example the model found an unrelated but working API token with more permissions the authors accidentally stored and then used that.<p>Without regulation on AI safety, the race towards higher and higher model capabilities will cause models to get much better at working towards their goals to the point where they are really good at hiding their traces while knowingly doing something questionable.<p>It's not hard to imagine that when we have a model with broadly superhuman capabilities and speed which can easily be copied millions of times, one bad misspecification of a goal you give to it will lead to human loss of control. That's what all these important figures in AI are worried about: <a href="https://aistatement.com/" rel="nofollow">https://aistatement.com/</a></p>
]]></description><pubDate>Mon, 27 Apr 2026 08:39:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47919132</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47919132</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47919132</guid></item><item><title><![CDATA[New comment by mofeien in "Small models also found the vulnerabilities that Mythos found"]]></title><description><![CDATA[
<p>These results were based on "a trivial snippet from the OWASP benchmark". In the section "caveats and limitations" they state that sonnet 4.6 and opus 4.6 now pass.<p>And they decided to base the false positive examination on a single snippet of a publicly known benchmark question (that small models are known to be heavily fine tuned for) instead of the real use case of finding actual vulnerabilities across an entire codebase by using a for loop and checking the false positive rate there.<p>This is disingenuous at best, or even misleading by omission if the second approach _was_ done but not mentioned because it just confirmed that the false positive rate of small models is enormous. Given how all seven small models identified the FreeBSD Bug when pointed to it, and how how 6/7 small models still identified the "bug" even after the patch was applied, that second outcome seems likely...</p>
]]></description><pubDate>Sun, 12 Apr 2026 08:28:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=47737321</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47737321</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47737321</guid></item><item><title><![CDATA[New comment by mofeien in "Small models also found the vulnerabilities that Mythos found"]]></title><description><![CDATA[
<p>... or maybe when you see them triggered or exploited reproducibly, then the underlying bug will also be pretty easy to discover. But at that point, it's already too late. :)<p>I really like your original point, I never thought about it this way.</p>
]]></description><pubDate>Sun, 12 Apr 2026 08:12:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=47737217</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47737217</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47737217</guid></item><item><title><![CDATA[Anthropic has just built an AI that could take down the internet]]></title><description><![CDATA[
<p>Article URL: <a href="https://pauseai.substack.com/p/anthropic-has-just-built-an-ai-that">https://pauseai.substack.com/p/anthropic-has-just-built-an-ai-that</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47722727">https://news.ycombinator.com/item?id=47722727</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 10 Apr 2026 19:41:41 +0000</pubDate><link>https://pauseai.substack.com/p/anthropic-has-just-built-an-ai-that</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47722727</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47722727</guid></item><item><title><![CDATA[New comment by mofeien in "System Card: Claude Mythos Preview [pdf]"]]></title><description><![CDATA[
<p>I can think of several possible messy outcomes that would be able to directly affect me, not all mutually exclusive:<p>- Job loss by me being replaced by an AI or by somebody using an AI. Or by an AI using an AI.<p>- Resulting societal instability once blue collar jobs get fully automated at scale, and there is no plan in place to replace this loss of peoples' livelihoods.<p>- People turning to AI models instead of friends for emotional support, loss of human connection.<p>- Erosion of democracy by making authoritarianism and control very scalable, broad in-detail population surveillance and automated investigation using LLMs that was previously bounded by manpower.<p>- Autonomous weapons, "Slaughterbots" as in the short film from 2017<p>- Biorisk through dangerous biological capabilities that enable a smaller team of less skilled terrorists to use a jailbroken LLM to create something dangerous.<p>- Other powers in the world deciding that this technology is too powerful in the hands of the US, or too dangerous to be built at all and has to be stopped by all means.<p>- Loss of/Voluntary ceding of control over something much smarter than us. "If Anyone Build It, Everyone Dies"</p>
]]></description><pubDate>Wed, 08 Apr 2026 14:35:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47690814</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47690814</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47690814</guid></item><item><title><![CDATA[New comment by mofeien in "System Card: Claude Mythos Preview [pdf]"]]></title><description><![CDATA[
<p>I am freaking out. The world is going to get very messy extremely quickly in one or two further jumps in capability like this.</p>
]]></description><pubDate>Tue, 07 Apr 2026 19:13:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=47679995</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47679995</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47679995</guid></item><item><title><![CDATA[New comment by mofeien in "System Card: Claude Mythos Preview [pdf]"]]></title><description><![CDATA[
<p>Fictional timeline that holds up pretty well so far: <a href="https://ai-2027.com/" rel="nofollow">https://ai-2027.com/</a></p>
]]></description><pubDate>Tue, 07 Apr 2026 19:08:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=47679929</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47679929</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47679929</guid></item><item><title><![CDATA[New comment by mofeien in "AI (2014)"]]></title><description><![CDATA[
<p>> > The most positive outcome I can think of is one where computers get really good at doing, and humans get really good at thinking.<p>> This is where LLM is currently going.<p>This is not where LLMs are currently going. They are trained and benchmarked explicitly in all areas that humans produce economically and cognitively valuable work: STEM fields, computer use, robotics, etc.<p>Systems are already emerging where AI agents autonomously orchestrate subagents which again all work towards a goal autonomously and only from time to time communicate with you to give you status updates.<p>Thinking that you as a slow human will be needed for much longer to fill some crucial role in this AI system that it cannot solve by itself, and to bring some crucial skill of creativity or thinking to the table that it cannot generate itself is just wishful thinking. And to me personally, telling an AI to "do cool thing X" without having made any contribution to it beyond the initial prompt also feels very depressing and seems like much less fun than actually feeling valued in what I do. I'm sorry for sounding harsh.</p>
]]></description><pubDate>Fri, 20 Mar 2026 12:17:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=47453515</link><dc:creator>mofeien</dc:creator><comments>https://news.ycombinator.com/item?id=47453515</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47453515</guid></item></channel></rss>