<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: blazespin</title><link>https://news.ycombinator.com/user?id=blazespin</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 09 Apr 2026 03:45:26 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=blazespin" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by blazespin in "Muse Spark: Scaling towards personal superintelligence"]]></title><description><![CDATA[
<p>Because bots and trillion dollar ipos and even bigger stakes.  People need to better appreciate the level of manipulation going on.  Social media has an outsized impact.  Bots and even people are getting paid to post and upvote/downvote narratives.</p>
]]></description><pubDate>Wed, 08 Apr 2026 21:26:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47696492</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47696492</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47696492</guid></item><item><title><![CDATA[New comment by blazespin in "GLM-5.1: Towards Long-Horizon Tasks"]]></title><description><![CDATA[
<p>Anthropic's reply?  A model you can't use.</p>
]]></description><pubDate>Tue, 07 Apr 2026 22:00:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=47681883</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47681883</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47681883</guid></item><item><title><![CDATA[New comment by blazespin in "Project Glasswing: Securing critical software for the AI era"]]></title><description><![CDATA[
<p>Dario is big on beating china, and no doubt he believes cyber security is how to do that. You can tell, but anthropic is sht at everything else.  Nobody uses it for real research.</p>
]]></description><pubDate>Tue, 07 Apr 2026 21:19:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47681501</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47681501</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47681501</guid></item><item><title><![CDATA[New comment by blazespin in "System Card: Claude Mythos Preview [pdf]"]]></title><description><![CDATA[
<p>Anthropic needs money like the 112B OpenAI got.  They could be hyping and this is good hype.  Who knows how benchmaxxed they are.<p>If they provide access to 3rd party benchmarking (not just one) than maybe I'll believe it.  Until then...</p>
]]></description><pubDate>Tue, 07 Apr 2026 20:51:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=47681204</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47681204</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47681204</guid></item><item><title><![CDATA[New comment by blazespin in "System Card: Claude Mythos Preview [pdf]"]]></title><description><![CDATA[
<p>Yeah, need some good RE benchmarks for the LLMs.  :)<p>RE is very interesting problem. A lot more that SWE can be RE'd.  I've found the LLMs are reluctant to assist, though you can workaround.</p>
]]></description><pubDate>Tue, 07 Apr 2026 20:48:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=47681177</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47681177</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47681177</guid></item><item><title><![CDATA[New comment by blazespin in "A new Polymarket account made over $500k betting on the U.S. strike against Iran"]]></title><description><![CDATA[
<p>All I've ever seen is it encouraging people and not discouraging them.  The thrill of the easy money, right?<p>Forget it jake, it's Polymarket.</p>
]]></description><pubDate>Sun, 01 Mar 2026 20:00:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47210114</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47210114</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47210114</guid></item><item><title><![CDATA[New comment by blazespin in "A new Polymarket account made over $500k betting on the U.S. strike against Iran"]]></title><description><![CDATA[
<p>Yeah, trying to beat the market on actually predicting will get you pretty lame returns.  Probably do better in non zero sum games like the stock market.  At least there you get the benefit of the market always going up eventually.<p>No, the best way to win on Polymarket is purely by insider trading.  Which is why it's a useful thing to watch.  Insider news..<p>That said, the definition of 'insider trading' is always tricky.  At what point does it become insider?  Some things people call insider others just call clever detective work.</p>
]]></description><pubDate>Sun, 01 Mar 2026 19:56:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47210082</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47210082</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47210082</guid></item><item><title><![CDATA[New comment by blazespin in "A new Polymarket account made over $500k betting on the U.S. strike against Iran"]]></title><description><![CDATA[
<p>"People don't play in corrupt markets for very long." ... ahhh, it's not a bug with Polymarket - it's a feature.</p>
]]></description><pubDate>Sun, 01 Mar 2026 19:55:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=47210072</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47210072</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47210072</guid></item><item><title><![CDATA[New comment by blazespin in ""Cancel ChatGPT" movement goes mainstream after OpenAI closes deal with U.S. Dow"]]></title><description><![CDATA[
<p>I'd say it's more a kowtow to his voters and democracy.  Whatever you think of him, he was democratically elected.  Do remember to show up this November, though, and remind all your friends..</p>
]]></description><pubDate>Sun, 01 Mar 2026 19:39:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=47209922</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=47209922</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47209922</guid></item><item><title><![CDATA[New comment by blazespin in "US blocks all offshore wind construction, says reason is classified"]]></title><description><![CDATA[
<p>trump literally said he wants venez to return the oil it 'stole' when it nationalized.</p>
]]></description><pubDate>Tue, 23 Dec 2025 19:41:37 +0000</pubDate><link>https://news.ycombinator.com/item?id=46368614</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46368614</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46368614</guid></item><item><title><![CDATA[Advanced fusion control breakthrough brings clean, reliable energy closer]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.jpost.com/science/article-880512">https://www.jpost.com/science/article-880512</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=46302026">https://news.ycombinator.com/item?id=46302026</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 17 Dec 2025 16:55:31 +0000</pubDate><link>https://www.jpost.com/science/article-880512</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46302026</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46302026</guid></item><item><title><![CDATA[New comment by blazespin in "AI agents are starting to eat SaaS"]]></title><description><![CDATA[
<p>There is a significant risk of uncertainty in all of this, the most damaging aspect really.  If AI improves, and it is threatening to, then growth in SaaS may decline to a point where investing in it needs to be reconsidered.<p>The problem is, nobody knows how much and how fast AI will improve or how much it will cost if it does.<p>That uncertainty alone is very problematic and I think is being underestimated in terms of its impact on everything it can potentially touch.<p>For now though, I've seen a wall form in benchmarks like swe-rebench and swebench pro. Greenfield is expanding, but maintenance is still a problem.<p>I think AI needs to get much better at maintenance before serious companies can choose build over buy for anything but the most trivial apps.</p>
]]></description><pubDate>Mon, 15 Dec 2025 00:47:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=46268931</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46268931</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46268931</guid></item><item><title><![CDATA[New comment by blazespin in "The Gorman Paradox: Where Are All the AI-Generated Apps?"]]></title><description><![CDATA[
<p>Nobody collapses, everything just shrinks.<p>And we're seeing that in the labor numbers.<p>Sometimes things are harder to see because it's chipping away and everywhere at the margins.</p>
]]></description><pubDate>Sun, 14 Dec 2025 23:16:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=46268175</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46268175</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46268175</guid></item><item><title><![CDATA[New comment by blazespin in "DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf]"]]></title><description><![CDATA[
<p>who likely wins, fify</p>
]]></description><pubDate>Fri, 28 Nov 2025 20:13:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=46082362</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46082362</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46082362</guid></item><item><title><![CDATA[New comment by blazespin in "DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf]"]]></title><description><![CDATA[
<p>if you read the paper that is the intention, to guide stuff like lean.<p>i don't think llm is a great pure rlvr</p>
]]></description><pubDate>Fri, 28 Nov 2025 20:11:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=46082347</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46082347</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46082347</guid></item><item><title><![CDATA[New comment by blazespin in "DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf]"]]></title><description><![CDATA[
<p>Advanced math solving, as the results indicate.  Informal proof reasoning is advancing faster than formal proof reasoning because the latter is slow and compute intensive.<p>I suspect it's also because there isn't a lot of data to train on.</p>
]]></description><pubDate>Thu, 27 Nov 2025 23:08:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=46073990</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46073990</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46073990</guid></item><item><title><![CDATA[New comment by blazespin in "DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf]"]]></title><description><![CDATA[
<p>Verifying math requires something like Lean which is a huge bottleneck, as the paper explains.<p>Plus there isn't a lot of training data in lean.<p>Most gains come from training on stuff already out there, not really the RLVR part which just amps it up a bit.</p>
]]></description><pubDate>Thu, 27 Nov 2025 23:07:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=46073985</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46073985</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46073985</guid></item><item><title><![CDATA[New comment by blazespin in "DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf]"]]></title><description><![CDATA[
<p>More training data on advanced math.  Lean is cool, but it's mostly about formalizing stuff we already know.</p>
]]></description><pubDate>Thu, 27 Nov 2025 23:05:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=46073974</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46073974</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46073974</guid></item><item><title><![CDATA[New comment by blazespin in "Google Antigravity exfiltrates data via indirect prompt injection attack"]]></title><description><![CDATA[
<p>You can't process untrustworthy data, period.  There are so many things that can go wrong with that.</p>
]]></description><pubDate>Wed, 26 Nov 2025 01:19:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=46053019</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=46053019</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46053019</guid></item><item><title><![CDATA[New comment by blazespin in "Correction: Anthropic attack did not have 1000/s requests"]]></title><description><![CDATA[
<p>Given they made a geopolitical accusations and dozens of mainstream publications repeated the "thousands of requests per second", this seems like a grossly negligent flub that should not be dismissed as a mere typo.<p>I frequently, alone, do 1000s of requests over a period of time, especially ones that are mostly cache hits, which can be $10-$50 in API costs.<p>This was not a "large scale" attack by any means.</p>
]]></description><pubDate>Sat, 15 Nov 2025 07:04:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=45935592</link><dc:creator>blazespin</dc:creator><comments>https://news.ycombinator.com/item?id=45935592</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45935592</guid></item></channel></rss>