<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: pyentropy</title><link>https://news.ycombinator.com/user?id=pyentropy</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sat, 13 Jun 2026 14:04:28 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=pyentropy" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by pyentropy in "Ask HN: How are thinking efforts implemented?"]]></title><description><![CDATA[
<p>I'm considering the possibility that it's <i>good</i> to break the prefix and cache because the LLM itself was rewarded (during post-training) with different prefixes/system prompts, each containing reasoning traces of the correct size.<p>I might be very very wrong though and LLMs disagree with me, insisting that cache is preserved and the system message doesn't have to change (even though it often contains effort level in context) if effort level changes across turns, and that all you have to do is tell the inference lib that parses think tags to early-close think tags that are too long.</p>
]]></description><pubDate>Sun, 07 Jun 2026 21:40:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48438845</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=48438845</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48438845</guid></item><item><title><![CDATA[New comment by pyentropy in "Ask HN: How are thinking efforts implemented?"]]></title><description><![CDATA[
<p>Examples with inference of different reasoning effort levels is in the OpenAI docs as well - <a href="https://developers.openai.com/cookbook/articles/openai-harmony#example-system-message" rel="nofollow">https://developers.openai.com/cookbook/articles/openai-harmo...</a><p><a href="https://docs.vllm.ai/en/latest/features/reasoning_outputs/#automatic-enable_thinking-activation" rel="nofollow">https://docs.vllm.ai/en/latest/features/reasoning_outputs/#a...</a><p><a href="https://developers.openai.com/api/docs/guides/reasoning" rel="nofollow">https://developers.openai.com/api/docs/guides/reasoning</a></p>
]]></description><pubDate>Sun, 07 Jun 2026 21:09:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48438591</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=48438591</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48438591</guid></item><item><title><![CDATA[New comment by pyentropy in "Ask HN: How are thinking efforts implemented?"]]></title><description><![CDATA[
<p>LLM-judge/parallel branching ≠ multi-token prediction ≠ reasoning effort.<p>See <a href="https://developers.openai.com/cookbook/articles/openai-harmony#example-system-message" rel="nofollow">https://developers.openai.com/cookbook/articles/openai-harmo...</a> and src/openai/types/shared/reasoning_effort.py</p>
]]></description><pubDate>Sun, 07 Jun 2026 21:09:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=48438577</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=48438577</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48438577</guid></item><item><title><![CDATA[New comment by pyentropy in "Ask HN: How are thinking efforts implemented?"]]></title><description><![CDATA[
<p>The number of tokens you predict at time (multi or not) has nothing to do with whether the model wants to emit any, some or a lot of reasoning tokens in reasoning tag -- similar to how branch prediction will not really change the for loop iteration count.</p>
]]></description><pubDate>Sun, 07 Jun 2026 16:38:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=48436454</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=48436454</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48436454</guid></item><item><title><![CDATA[New comment by pyentropy in "Ask HN: How are thinking efforts implemented?"]]></title><description><![CDATA[
<p>Take a look at the harmony repo which specifies the internal OpenAI format - the effort level is specified in the context after the <|start|> tag - <a href="https://github.com/openai/harmony" rel="nofollow">https://github.com/openai/harmony</a><p>Note that inference libs also have parsers that put hard limits on reasoning tokens with separate counters (similar to how you can put a limit on token generation per completion versus waiting for an <eos>). For that, take a look at vllm reasoning docs.</p>
]]></description><pubDate>Sun, 07 Jun 2026 15:48:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48435972</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=48435972</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48435972</guid></item><item><title><![CDATA[Dancing to the State of the Art? How candidate lists influence LKH TSP solvers]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2407.03927">https://arxiv.org/abs/2407.03927</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45050543">https://news.ycombinator.com/item?id=45050543</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 28 Aug 2025 10:40:21 +0000</pubDate><link>https://arxiv.org/abs/2407.03927</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=45050543</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45050543</guid></item><item><title><![CDATA[New comment by pyentropy in "Nvidia’s $589B DeepSeek rout"]]></title><description><![CDATA[
<p>If H800 is a memory-constrained model that NVIDIA built to avoid the Chinese export ban on H100 with equivalent fp8 performance,
it makes zero sense to believe Elon Musk, Dario Armodei and Alexandr Wang's claims that DeepSeek smuggled H100s.<p>The only reason why a team would allocate time on memory optimizations and writing NVPTX code rather than focusing on posttraining is if they severely struggled with memory during training.<p>I mean, take a look at the numbers:<p><a href="https://www.fibermall.com/blog/nvidia-ai-chip.htm#A100_vs_A800_H100_vs_H800" rel="nofollow">https://www.fibermall.com/blog/nvidia-ai-chip.htm#A100_vs_A8...</a><p>This is a massive trick pulled by Jensen, take the H100 design whose sales are regulated by the government, make it look 40x weaker and call it H800, while conveniently leaving 8-bit computation as fast as H100. Then bring it to China and let companies stockpile without disclosing production or sales numbers, and have no export controls.<p>Eventually, after 7 months, US govt starts noticing the H800 sales and introduces new export controls, but it's too late. By this point, DeepSeek has started research using fp8. They slowly build bigger and bigger models, work on the bandwidth and memory consumptions, until they make r1 - their reasoning model.</p>
]]></description><pubDate>Tue, 28 Jan 2025 09:10:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=42850333</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=42850333</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42850333</guid></item><item><title><![CDATA[Why do I have a blog (and has it ever paid off?)]]></title><description><![CDATA[
<p>Article URL: <a href="https://fikisipi.substack.com/p/why-do-i-have-a-blog-and-has-it-ever">https://fikisipi.substack.com/p/why-do-i-have-a-blog-and-has-it-ever</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=41135233">https://news.ycombinator.com/item?id=41135233</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Fri, 02 Aug 2024 01:08:47 +0000</pubDate><link>https://fikisipi.substack.com/p/why-do-i-have-a-blog-and-has-it-ever</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=41135233</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41135233</guid></item><item><title><![CDATA[New comment by pyentropy in "Why haven't biologists cured cancer?"]]></title><description><![CDATA[
<p>You should start a blog... or maybe not - pursue the battle in academia/work and occasionally drop nuggets of wisdom like this somewhere. But do not delete them.</p>
]]></description><pubDate>Sun, 07 Jul 2024 11:26:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=40896816</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=40896816</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40896816</guid></item><item><title><![CDATA[Busy Beaver, the current BB(5) conjecture and bbchallenge.org]]></title><description><![CDATA[
<p>Article URL: <a href="https://fikisipi.substack.com/p/busy-beaver-the-current-bb5-conjecture">https://fikisipi.substack.com/p/busy-beaver-the-current-bb5-conjecture</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40780146">https://news.ycombinator.com/item?id=40780146</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 24 Jun 2024 20:10:43 +0000</pubDate><link>https://fikisipi.substack.com/p/busy-beaver-the-current-bb5-conjecture</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=40780146</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40780146</guid></item><item><title><![CDATA[New comment by pyentropy in "Is Aschenbrenner's 165 page paper on AI the naivety of a 25 year old?"]]></title><description><![CDATA[
<p>I updated the post with a a link to counter-argument from Sabine Hossenfelder, the arguments from Zvi and three points from my side.</p>
]]></description><pubDate>Thu, 13 Jun 2024 22:23:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=40675513</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=40675513</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40675513</guid></item><item><title><![CDATA[New comment by pyentropy in "Is Aschenbrenner's 165 page paper on AI the naivety of a 25 year old?"]]></title><description><![CDATA[
<p>I updated the post with a a link to counter-argument from Sabine Hossenfelder, the arguments from Zvi and three points from my side.</p>
]]></description><pubDate>Thu, 13 Jun 2024 22:22:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=40675505</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=40675505</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40675505</guid></item><item><title><![CDATA[New comment by pyentropy in "Is Aschenbrenner's 165 page paper on AI the naivety of a 25 year old?"]]></title><description><![CDATA[
<p>Scott worked at OpenAI Safety and he likes it: <a href="https://scottaaronson.blog/?p=8047" rel="nofollow">https://scottaaronson.blog/?p=8047</a><p>But is the "-ed" in worked a problem?</p>
]]></description><pubDate>Thu, 13 Jun 2024 16:18:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=40671551</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=40671551</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40671551</guid></item><item><title><![CDATA[New comment by pyentropy in "Is Aschenbrenner's 165 page paper on AI the naivety of a 25 year old?"]]></title><description><![CDATA[
<p>Thank you.</p>
]]></description><pubDate>Thu, 13 Jun 2024 16:17:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=40671537</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=40671537</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40671537</guid></item><item><title><![CDATA[New comment by pyentropy in "Is Aschenbrenner's 165 page paper on AI the naivety of a 25 year old?"]]></title><description><![CDATA[
<p>It is a question. I tried to put what my opinion is on a few statements but I absolutely cannot summarize 160 pages (Business Insider did using GPT, which I find insulting and funny) nor have a 100% opinion on something that involves national security, secrets and other stuff that I don't have access to.</p>
]]></description><pubDate>Thu, 13 Jun 2024 16:17:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=40671532</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=40671532</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40671532</guid></item><item><title><![CDATA[Is Aschenbrenner's 165 page paper on AI the naivety of a 25 year old?]]></title><description><![CDATA[
<p>Article URL: <a href="https://fikisipi.substack.com/p/is-aschenbrenners-165-page-paper">https://fikisipi.substack.com/p/is-aschenbrenners-165-page-paper</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40667545">https://news.ycombinator.com/item?id=40667545</a></p>
<p>Points: 44</p>
<p># Comments: 68</p>
]]></description><pubDate>Thu, 13 Jun 2024 09:20:52 +0000</pubDate><link>https://fikisipi.substack.com/p/is-aschenbrenners-165-page-paper</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=40667545</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40667545</guid></item><item><title><![CDATA[Short post: A look at Devin, the AI-powered software engineer]]></title><description><![CDATA[
<p>Article URL: <a href="https://fikisipi.substack.com/p/short-post-a-look-at-devin-the-ai">https://fikisipi.substack.com/p/short-post-a-look-at-devin-the-ai</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=39686936">https://news.ycombinator.com/item?id=39686936</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 13 Mar 2024 01:10:25 +0000</pubDate><link>https://fikisipi.substack.com/p/short-post-a-look-at-devin-the-ai</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=39686936</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39686936</guid></item><item><title><![CDATA[My 2023 prediction mistakes and the new Metaculus scoring function]]></title><description><![CDATA[
<p>Article URL: <a href="https://fikisipi.substack.com/p/my-2023-prediction-mistakes-and-the">https://fikisipi.substack.com/p/my-2023-prediction-mistakes-and-the</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=38914346">https://news.ycombinator.com/item?id=38914346</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Mon, 08 Jan 2024 16:26:26 +0000</pubDate><link>https://fikisipi.substack.com/p/my-2023-prediction-mistakes-and-the</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=38914346</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38914346</guid></item><item><title><![CDATA[New comment by pyentropy in "In the long run, we're all Dad"]]></title><description><![CDATA[
<p>You haven't read Scott's blog enough :)<p>He's an atheist psychiatrist. However, he enjoys how natural selection, social dynamics and reputation can also be modeled by the moral rules of most religions. For example, going to therapy isn't that different from practicing confessions in a church.</p>
]]></description><pubDate>Fri, 22 Dec 2023 13:40:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=38734017</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=38734017</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38734017</guid></item><item><title><![CDATA[Time.mk: disrupting the Macedonian online media using clustering algorithms]]></title><description><![CDATA[
<p>Article URL: <a href="https://fikisipi.substack.com/p/the-man-that-disrupted-macedonian">https://fikisipi.substack.com/p/the-man-that-disrupted-macedonian</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=38703249">https://news.ycombinator.com/item?id=38703249</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 19 Dec 2023 23:11:19 +0000</pubDate><link>https://fikisipi.substack.com/p/the-man-that-disrupted-macedonian</link><dc:creator>pyentropy</dc:creator><comments>https://news.ycombinator.com/item?id=38703249</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38703249</guid></item></channel></rss>