<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: z7</title><link>https://news.ycombinator.com/user?id=z7</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 04 Jun 2026 11:37:23 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=z7" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by z7 in "Child's Play: Tech's new generation and the end of thinking"]]></title><description><![CDATA[
<p>"As Alexander predicted in 'AI 2027,' OpenAI did release a major new model in 2025; unlike in his forecast, it’s been a damp squib. Advances seem to be plateauing; the conversation in tech circles is now less about superintelligence and more about the possibility of an AI bubble."<p>I'm not sure how many AI researchers would find this accurate. It seems to me that under conditions of ambiguity people often default to describing their preferred version of reality.</p>
]]></description><pubDate>Sat, 21 Feb 2026 03:09:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47097094</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=47097094</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47097094</guid></item><item><title><![CDATA[New comment by z7 in "Tesla’s autonomous vehicles are crashing at a rate much higher tha human drivers"]]></title><description><![CDATA[
<p>The comparison isn't really like-for-like. NHTSA SGO AV reports can include very minor, low-speed contact events that would often never show up as police-reported crashes for human drivers, meaning the Tesla crash count may be drawing from a broader category than the human baseline it's being compared to.<p>There's also a denominator problem. The mileage figure appears to be cumulative miles "as of November," while the crashes are drawn from a specific July-November window in Austin. It's not clear that those miles line up with the same geography and time period.<p>The sample size is tiny (nine crashes), uncertainty is huge, and the analysis doesn't distinguish between at-fault and not-at-fault incidents, or between preventable and non-preventable ones.<p>Also, the comparison to Waymo is stated without harmonizing crash definitions and reporting practices.</p>
]]></description><pubDate>Fri, 30 Jan 2026 11:21:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=46823084</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=46823084</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46823084</guid></item><item><title><![CDATA[New comment by z7 in "Over 36,500 killed in Iran's deadliest massacre, documents reveal"]]></title><description><![CDATA[
<p>> The West is not complicit in the actions of the Iranian regime<p>What about the 1953 CIA/MI6 coup that overthrew Iran's elected prime minister?</p>
]]></description><pubDate>Tue, 27 Jan 2026 11:27:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=46778586</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=46778586</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46778586</guid></item><item><title><![CDATA[New comment by z7 in "Self-hosting a NAT Gateway"]]></title><description><![CDATA[
<p>"You only live once."<p>Why state this as absolute fact? Seems a bit lacking in epistemic humility.</p>
]]></description><pubDate>Sat, 22 Nov 2025 01:19:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=46011162</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=46011162</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46011162</guid></item><item><title><![CDATA[New comment by z7 in "Hi, it's me, Wikipedia, and I am ready for your apology"]]></title><description><![CDATA[
<p>Here's the Grokipedia submission (currently censored / flagged):<p><a href="https://news.ycombinator.com/item?id=45726459">https://news.ycombinator.com/item?id=45726459</a></p>
]]></description><pubDate>Tue, 28 Oct 2025 16:16:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=45734813</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=45734813</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45734813</guid></item><item><title><![CDATA[New comment by z7 in "It's insulting to read AI-generated blog posts"]]></title><description><![CDATA[
<p>Hypothetically, what if the AI-generated blog post were better than what the human author of the blog would have written?</p>
]]></description><pubDate>Mon, 27 Oct 2025 17:16:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=45723663</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=45723663</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45723663</guid></item><item><title><![CDATA[New comment by z7 in "The dawn of the post-literate society – and the end of civilisation"]]></title><description><![CDATA[
<p>List of dates predicted for apocalyptic events:<p><a href="https://en.wikipedia.org/wiki/List_of_dates_predicted_for_apocalyptic_events" rel="nofollow">https://en.wikipedia.org/wiki/List_of_dates_predicted_for_ap...</a></p>
]]></description><pubDate>Sat, 20 Sep 2025 15:30:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=45314199</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=45314199</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45314199</guid></item><item><title><![CDATA[New comment by z7 in "DeepMind and OpenAI win gold at ICPC"]]></title><description><![CDATA[
<p>Current cope collection:<p>- It's not a fair match, these models have more compute and memory than humans<p>- Contestants weren't really elite, they're just college level programmers, not the world's best<p>- This doesn't matter for the real world, competitive programming is very different from regular software engineering<p>- It's marketing, they're just cranking up the compute to unrealistic levels to gain PR points<p>- It's brute force, not intelligence</p>
]]></description><pubDate>Wed, 17 Sep 2025 22:42:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=45282313</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=45282313</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45282313</guid></item><item><title><![CDATA[New comment by z7 in "An LLM is a lossy encyclopedia"]]></title><description><![CDATA[
<p>An encyclopaedia is a lossy representation of reality.</p>
]]></description><pubDate>Tue, 02 Sep 2025 17:10:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=45105954</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=45105954</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45105954</guid></item><item><title><![CDATA[New comment by z7 in "His psychosis was a mystery–until doctors learned about ChatGPT's health advice"]]></title><description><![CDATA[
<p>Meanwhile this new paper claims that GPT-5 surpasses medical professionals in medical reasoning:<p>"On MedXpertQA MM, GPT-5 improves reasoning and understanding scores by +29.62% and +36.18% over GPT-4o, respectively, and surpasses pre-licensed human experts by +24.23% in reasoning and +29.40% in understanding."<p><a href="https://arxiv.org/abs/2508.08224" rel="nofollow">https://arxiv.org/abs/2508.08224</a></p>
]]></description><pubDate>Wed, 13 Aug 2025 13:40:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=44888330</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=44888330</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44888330</guid></item><item><title><![CDATA[New comment by z7 in "GPT-5"]]></title><description><![CDATA[
<p>Yes, but the jump in performance from o3 is well beyond marginal while also fitting an exponential trend, which undermines the parent's claim on two counts.</p>
]]></description><pubDate>Thu, 07 Aug 2025 22:49:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=44831361</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=44831361</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44831361</guid></item><item><title><![CDATA[New comment by z7 in "GPT-5"]]></title><description><![CDATA[
<p>>The actual benchmark improvements are marginal at best<p>GPT-5 demonstrates exponential growth in task completion times:<p><a href="https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/" rel="nofollow">https://metr.org/blog/2025-03-19-measuring-ai-ability-to-com...</a></p>
]]></description><pubDate>Thu, 07 Aug 2025 19:20:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=44829143</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=44829143</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44829143</guid></item><item><title><![CDATA[New comment by z7 in "GPT-5"]]></title><description><![CDATA[
<p>GPT-5 is #1 on WebDev Arena with +75 pts over Gemini 2.5 Pro and +100 pts over Claude Opus 4:<p><a href="https://lmarena.ai/leaderboard" rel="nofollow">https://lmarena.ai/leaderboard</a></p>
]]></description><pubDate>Thu, 07 Aug 2025 17:31:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=44827548</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=44827548</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44827548</guid></item><item><title><![CDATA[New comment by z7 in "OpenAI claims gold-medal performance at IMO 2025"]]></title><description><![CDATA[
<p>Some previous predictions:<p>In 2021 Paul Christiano wrote he would update from 30% to "50% chance of hard takeoff" if we saw an IMO gold by 2025.<p>He thought there was an 8% chance of this happening.<p>Eliezer Yudkowsky said "at least 16%".<p>Source:<p><a href="https://www.lesswrong.com/posts/sWLLdG6DWJEy3CH7n/imo-challenge-bet-with-eliezer" rel="nofollow">https://www.lesswrong.com/posts/sWLLdG6DWJEy3CH7n/imo-challe...</a></p>
]]></description><pubDate>Sat, 19 Jul 2025 10:34:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=44614269</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=44614269</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44614269</guid></item><item><title><![CDATA[New comment by z7 in "Grok 4 Launch [video]"]]></title><description><![CDATA[
<p>How do you explain Grok 4 achieving new SOTA on ARC-AGI-2, nearly doubling the previous commercial SOTA?<p><a href="https://x.com/arcprize/status/1943168950763950555" rel="nofollow">https://x.com/arcprize/status/1943168950763950555</a></p>
]]></description><pubDate>Thu, 10 Jul 2025 14:17:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=44521386</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=44521386</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44521386</guid></item><item><title><![CDATA[New comment by z7 in "Grok 4 Launch [video]"]]></title><description><![CDATA[
<p>"Grok 4 (Thinking) achieves new SOTA on ARC-AGI-2 with 15.9%."<p>"This nearly doubles the previous commercial SOTA and tops the current Kaggle competition SOTA."<p><a href="https://x.com/arcprize/status/1943168950763950555" rel="nofollow">https://x.com/arcprize/status/1943168950763950555</a></p>
]]></description><pubDate>Thu, 10 Jul 2025 10:09:41 +0000</pubDate><link>https://news.ycombinator.com/item?id=44519308</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=44519308</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44519308</guid></item><item><title><![CDATA[New comment by z7 in "O3 beats a master-level GeoGuessr player, even with fake EXIF data"]]></title><description><![CDATA[
<p>Quoting Chollet:<p>>I have repeatedly said that "can LLM reason?" was the wrong question to ask. Instead the right question is, "can they adapt to novelty?".<p><a href="https://x.com/fchollet/status/1866348355204595826" rel="nofollow">https://x.com/fchollet/status/1866348355204595826</a></p>
]]></description><pubDate>Tue, 29 Apr 2025 17:10:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=43835335</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=43835335</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43835335</guid></item><item><title><![CDATA[New comment by z7 in "AI 2027"]]></title><description><![CDATA[
<p>It's just predicting tokens:<p><a href="https://old.reddit.com/r/singularity/comments/1jl5qfs/its_just_predicting_tokens_v2/" rel="nofollow">https://old.reddit.com/r/singularity/comments/1jl5qfs/its_ju...</a></p>
]]></description><pubDate>Sat, 05 Apr 2025 15:12:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=43594087</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=43594087</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43594087</guid></item><item><title><![CDATA[New comment by z7 in "Marine Le Pen banned from running in 2027 and given four-year sentence"]]></title><description><![CDATA[
<p>Why are you hallucinating feelings? Also, appeal to authority. ("Why are your feelings relevant to the wizarding laws of Hogwarts?")</p>
]]></description><pubDate>Mon, 31 Mar 2025 15:50:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=43536398</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=43536398</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43536398</guid></item><item><title><![CDATA[New comment by z7 in "4o Image Generation"]]></title><description><![CDATA[
<p>>For starters, this completely blocks generation of anything remotely related to copy-protected IPs<p>It did Dragon Ball Z here:<p><a href="https://old.reddit.com/r/ChatGPT/comments/1jjtcn9/the_new_image_generator_released_today_is_so_good" rel="nofollow">https://old.reddit.com/r/ChatGPT/comments/1jjtcn9/the_new_im...</a><p>Rick and Morty:<p><a href="https://old.reddit.com/r/ChatGPT/comments/1jjtcn9/the_new_image_generator_released_today_is_so_good/mjpvh24" rel="nofollow">https://old.reddit.com/r/ChatGPT/comments/1jjtcn9/the_new_im...</a><p>South Park:<p><a href="https://old.reddit.com/r/ChatGPT/comments/1jjyn5q/openais_new_4o_image_generation_is_insane/mjrdc4c" rel="nofollow">https://old.reddit.com/r/ChatGPT/comments/1jjyn5q/openais_ne...</a></p>
]]></description><pubDate>Wed, 26 Mar 2025 03:31:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=43478602</link><dc:creator>z7</dc:creator><comments>https://news.ycombinator.com/item?id=43478602</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43478602</guid></item></channel></rss>