<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: Davidzheng</title><link>https://news.ycombinator.com/user?id=Davidzheng</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 30 May 2026 21:52:14 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=Davidzheng" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by Davidzheng in "Qwen3.7-Max: The Agent Frontier"]]></title><description><![CDATA[
<p>someone correct if i'm wrong, but I think the max models are usually non-open</p>
]]></description><pubDate>Wed, 20 May 2026 14:21:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48208320</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=48208320</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48208320</guid></item><item><title><![CDATA[New comment by Davidzheng in "The Zulip Foundation"]]></title><description><![CDATA[
<p>My only gripe is that on my phone sometimes it takes like 30 seconds to load, which doesn't seem to happen for almost anything else</p>
]]></description><pubDate>Sat, 16 May 2026 02:45:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48156374</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=48156374</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48156374</guid></item><item><title><![CDATA[New comment by Davidzheng in "A recent experience with ChatGPT 5.5 Pro"]]></title><description><![CDATA[
<p>Deep think still makes many many many more mistakes than gpt 5.5 pro on math</p>
]]></description><pubDate>Sat, 09 May 2026 11:47:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48074165</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=48074165</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48074165</guid></item><item><title><![CDATA[New comment by Davidzheng in "AlphaEvolve: Gemini-powered coding agent scaling impact across fields"]]></title><description><![CDATA[
<p>I don't think there is a fundamental divide between implementation speedups and optimization and algorithmic/architecture optimizations</p>
]]></description><pubDate>Fri, 08 May 2026 05:31:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48058954</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=48058954</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48058954</guid></item><item><title><![CDATA[New comment by Davidzheng in "System Card: Claude Mythos Preview [pdf]"]]></title><description><![CDATA[
<p>But would miracle drugs and trading algorithms be as profitable as AI research/chip design/energy research? Probably if AI is by far the biggest growth in the economy majority of the AI's usage internally should (as incentivized by economics) in some way work towards making itself better.</p>
]]></description><pubDate>Wed, 08 Apr 2026 08:45:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=47687212</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47687212</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47687212</guid></item><item><title><![CDATA[New comment by Davidzheng in "Project Glasswing: Securing critical software for the AI era"]]></title><description><![CDATA[
<p>There's still RL</p>
]]></description><pubDate>Wed, 08 Apr 2026 06:18:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47686066</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47686066</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47686066</guid></item><item><title><![CDATA[New comment by Davidzheng in "From 0% to 36% on Day 1 of ARC-AGI-3"]]></title><description><![CDATA[
<p>I agree it's not cheating that restricted sense. But I'm not really convinced that it can't be cheating in a more general sense. You can try like 10^10 variations of harnesses and select the one that performs best. And probably if you then look at it, it will not look like it's necessarily cheating. But you have biased the estimator by selecting the harness according to the value.</p>
]]></description><pubDate>Fri, 27 Mar 2026 04:08:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47539032</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47539032</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47539032</guid></item><item><title><![CDATA[New comment by Davidzheng in "ARC-AGI-3"]]></title><description><![CDATA[
<p>There's no objective measure of intelligence comparisons, we only say llm is jagged compared to humans.</p>
]]></description><pubDate>Thu, 26 Mar 2026 01:13:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47525575</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47525575</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47525575</guid></item><item><title><![CDATA[New comment by Davidzheng in "ARC-AGI-3"]]></title><description><![CDATA[
<p>How can you tell?</p>
]]></description><pubDate>Thu, 26 Mar 2026 01:12:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=47525563</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47525563</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47525563</guid></item><item><title><![CDATA[New comment by Davidzheng in "ARC-AGI-3"]]></title><description><![CDATA[
<p>Important to remember that intelligence is not a singular thing and when the last gap is closed, most aspects will be highly superhuman</p>
]]></description><pubDate>Thu, 26 Mar 2026 01:11:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47525556</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47525556</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47525556</guid></item><item><title><![CDATA[New comment by Davidzheng in "Israel Strikes Oil Facilities in Iran"]]></title><description><![CDATA[
<p>Obviously yes in the form that the comment you replied to refers to--US would be much more careful stringing a country with nuclear weapons. So while the invasion may not be caused by proximity it can be allowed bc Iran doesn't have one.</p>
]]></description><pubDate>Sun, 08 Mar 2026 03:42:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47294182</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47294182</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47294182</guid></item><item><title><![CDATA[New comment by Davidzheng in "Statement on the comments from Secretary of War Pete Hegseth"]]></title><description><![CDATA[
<p>How much value is there in individual values?<p>Many of us remember that OpenAI was also started by people with strong personal values. Their charter said that they would not monetize after reaching AGI, their fiduciary duty is to humanity, and the non-profit board would curtail the ambitions of the for-profit incentives. Was this not also believed by a sizeable portion of the employees there at the time? And what is left of these values after the financial incentives grew?<p>The market forces from the huge economic upside of AI devalues individual values in two ways. It rewards those that choose whatever accelerates AI the most over any individuals who are more careful and act on individual values--the latter simply loses power in the long run until their virtue has no influence. As Anthropic says in their mission statements, it is not of much use to humanity to be virtuous if you are irrelevant. The latter, as is true for many technologies, is that economic prosperity is deeply linked to human welfare. And slowing or limiting progress leads to real immediate harm to the human population. And thus any government regulations which are against AI progress will always be unpopular, because those values which are arguing future harm of AIs is fighting against the values of saving people from diseases and starvation today.</p>
]]></description><pubDate>Sat, 28 Feb 2026 08:08:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47192074</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47192074</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47192074</guid></item><item><title><![CDATA[New comment by Davidzheng in "Two insider cases we've recently closed"]]></title><description><![CDATA[
<p>Election odds, chance of US bombing Iran, and many others</p>
]]></description><pubDate>Fri, 27 Feb 2026 03:30:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=47176113</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47176113</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47176113</guid></item><item><title><![CDATA[New comment by Davidzheng in "Statement from Dario Amodei on our discussions with the Department of War"]]></title><description><![CDATA[
<p>Then maybe Dario will realize that the moral superiority that he bases his advocacy against Chinese open models is naive at best.</p>
]]></description><pubDate>Fri, 27 Feb 2026 00:59:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=47174828</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47174828</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47174828</guid></item><item><title><![CDATA[New comment by Davidzheng in "Statement from Dario Amodei on our discussions with the Department of War"]]></title><description><![CDATA[
<p>Neither of these things are useful signals. Other labs surely trained on similar material (presumably not even buying hard copies). Also how "bothered" someone is about their predictions is a bad indicator -- the prediction, taken at face value, is supposed to be trying to ask people to prepare for what he cannot stop if he wanted to.<p>None of this means I am a huge fan of Dario - I think he has over-idealization of the implementation of democratic ideals in western countries and is unhealthily obsessed with US "winning" over China based on this. But I don't like the reasons you listed.</p>
]]></description><pubDate>Fri, 27 Feb 2026 00:57:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47174805</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47174805</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47174805</guid></item><item><title><![CDATA[Quo Vadis, LLM Benchmarks?]]></title><description><![CDATA[
<p>Article URL: <a href="https://florianbrand.com/posts/benches-2026">https://florianbrand.com/posts/benches-2026</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47171835">https://news.ycombinator.com/item?id=47171835</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 26 Feb 2026 20:49:50 +0000</pubDate><link>https://florianbrand.com/posts/benches-2026</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47171835</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47171835</guid></item><item><title><![CDATA[New comment by Davidzheng in "Nano Banana 2: Google's latest AI image generation model"]]></title><description><![CDATA[
<p>Re: But they aren't alive, they don't live in the world and have experiences, and they can't create something truly new.<p>Is it possible for a character in a novel to have novel experiences? Or for you to experience a novel dream? I would argue yes. You can know the rules of the environment and the starting conditions, but with a bit of randomness (or not) you can generate from that novel experiences which were unexpected - so too from the data & distribution that AIs are already trained on they can experience new experiences.<p>Another source of novelty is from good verifiers/recognition of a class of object which is hard to construct but easy to verify - here the AI can search and from that obtain novel solutions which were unthought of before.<p>N.B novelty itself is basically trivial - just generate random strings. But both of the above are mechanisms to generate novel samples inside some constraint of "meaningfulness"</p>
]]></description><pubDate>Thu, 26 Feb 2026 19:58:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=47171224</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47171224</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47171224</guid></item><item><title><![CDATA[New comment by Davidzheng in "How will OpenAI compete?"]]></title><description><![CDATA[
<p>Not if they can leverage their superior abundance of compute/intelligence to invade other industries.</p>
]]></description><pubDate>Thu, 26 Feb 2026 06:28:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47162603</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47162603</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47162603</guid></item><item><title><![CDATA[New comment by Davidzheng in "How will OpenAI compete?"]]></title><description><![CDATA[
<p>Once the majority of work at a company can be done by AI, Anthropic has an alternative revenue stream to selling AIs to that company--directly competing with that company with a completely integrated AI system. There's of course many barriers to entry/various advantages of incumbents--but it's possible to see a world in which the company selling the AI has a huge advantage too.</p>
]]></description><pubDate>Thu, 26 Feb 2026 06:27:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47162590</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47162590</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47162590</guid></item><item><title><![CDATA[New comment by Davidzheng in "Ask HN: Have top AI research institutions just given up on the idea of safety?"]]></title><description><![CDATA[
<p>Well I do think there's some degree of unsafeness which is inexorably linked to capability--if the model when deployed with full control of a machine is capable of large scale cyberattacks and blackmailing for example.</p>
]]></description><pubDate>Wed, 25 Feb 2026 15:52:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=47153186</link><dc:creator>Davidzheng</dc:creator><comments>https://news.ycombinator.com/item?id=47153186</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47153186</guid></item></channel></rss>