<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: veryluckyxyz</title><link>https://news.ycombinator.com/user?id=veryluckyxyz</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Tue, 14 Apr 2026 09:56:43 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=veryluckyxyz" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by veryluckyxyz in "SkyRL brings Tinker to your GPUs (2025)"]]></title><description><![CDATA[
<p>Consider fixing your “published date” on the site.</p>
]]></description><pubDate>Wed, 18 Feb 2026 22:35:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=47067376</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=47067376</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47067376</guid></item><item><title><![CDATA[New comment by veryluckyxyz in "SkyRL brings Tinker to your GPUs (2025)"]]></title><description><![CDATA[
<p>The date is wrong. It is 2026, not 2025</p>
]]></description><pubDate>Wed, 18 Feb 2026 22:34:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=47067370</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=47067370</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47067370</guid></item><item><title><![CDATA[Generalizing Test-Time Compute-Optimal Scaling as an Optimizable Graph]]></title><description><![CDATA[
<p>Article URL: <a href="https://huggingface.co/papers/2511.00086">https://huggingface.co/papers/2511.00086</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45819069">https://news.ycombinator.com/item?id=45819069</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 05 Nov 2025 04:27:28 +0000</pubDate><link>https://huggingface.co/papers/2511.00086</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=45819069</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45819069</guid></item><item><title><![CDATA[Hidden drivers of HRM's performance on ARC-AGI]]></title><description><![CDATA[
<p>Article URL: <a href="https://arcprize.org/blog/hrm-analysis">https://arcprize.org/blog/hrm-analysis</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45511888">https://news.ycombinator.com/item?id=45511888</a></p>
<p>Points: 31</p>
<p># Comments: 2</p>
]]></description><pubDate>Wed, 08 Oct 2025 03:54:36 +0000</pubDate><link>https://arcprize.org/blog/hrm-analysis</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=45511888</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45511888</guid></item><item><title><![CDATA[Set Block Decoding Is a Language Model Inference Accelerator]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2509.04185">https://arxiv.org/abs/2509.04185</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45176890">https://news.ycombinator.com/item?id=45176890</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 09 Sep 2025 02:59:58 +0000</pubDate><link>https://arxiv.org/abs/2509.04185</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=45176890</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45176890</guid></item><item><title><![CDATA[Deep Think with Confidence]]></title><description><![CDATA[
<p>Article URL: <a href="https://jiaweizzhao.github.io/deepconf/">https://jiaweizzhao.github.io/deepconf/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45006388">https://news.ycombinator.com/item?id=45006388</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 24 Aug 2025 18:16:32 +0000</pubDate><link>https://jiaweizzhao.github.io/deepconf/</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=45006388</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45006388</guid></item><item><title><![CDATA[A Batch Size and Token NUM- BER Agnostic Learning Rate Scheduler]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2408.13359">https://arxiv.org/abs/2408.13359</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44166370">https://news.ycombinator.com/item?id=44166370</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 03 Jun 2025 04:42:25 +0000</pubDate><link>https://arxiv.org/abs/2408.13359</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=44166370</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44166370</guid></item><item><title><![CDATA[Easily Understand Rdma Technology]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.naddod.com/blog/easily-understand-rdma-technology">https://www.naddod.com/blog/easily-understand-rdma-technology</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44150599">https://news.ycombinator.com/item?id=44150599</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Sun, 01 Jun 2025 13:07:32 +0000</pubDate><link>https://www.naddod.com/blog/easily-understand-rdma-technology</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=44150599</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44150599</guid></item><item><title><![CDATA[Model Merging in Pre-Training of Large Language Models]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2505.12082">https://arxiv.org/abs/2505.12082</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44047429">https://news.ycombinator.com/item?id=44047429</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 21 May 2025 01:12:29 +0000</pubDate><link>https://arxiv.org/abs/2505.12082</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=44047429</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44047429</guid></item><item><title><![CDATA[Understanding Perception and Reasoning Through Model Merging]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2505.05464">https://arxiv.org/abs/2505.05464</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43991510">https://news.ycombinator.com/item?id=43991510</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 15 May 2025 03:20:27 +0000</pubDate><link>https://arxiv.org/abs/2505.05464</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43991510</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43991510</guid></item><item><title><![CDATA[Building and better understanding vision-language models (2024)]]></title><description><![CDATA[
<p>Article URL: <a href="https://huggingface.co/papers/2408.12637">https://huggingface.co/papers/2408.12637</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43946375">https://news.ycombinator.com/item?id=43946375</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 10 May 2025 15:22:46 +0000</pubDate><link>https://huggingface.co/papers/2408.12637</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43946375</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43946375</guid></item><item><title><![CDATA[HF smolagents computer-agent demo]]></title><description><![CDATA[
<p>Article URL: <a href="https://huggingface.co/spaces/smolagents/computer-agent">https://huggingface.co/spaces/smolagents/computer-agent</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43915097">https://news.ycombinator.com/item?id=43915097</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 07 May 2025 13:03:15 +0000</pubDate><link>https://huggingface.co/spaces/smolagents/computer-agent</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43915097</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43915097</guid></item><item><title><![CDATA[Do Reasoning Models Show Better Verbalized Calibration?]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2504.06564">https://arxiv.org/abs/2504.06564</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43738952">https://news.ycombinator.com/item?id=43738952</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 19 Apr 2025 19:48:49 +0000</pubDate><link>https://arxiv.org/abs/2504.06564</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43738952</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43738952</guid></item><item><title><![CDATA[Robustly identifying concepts introduced during chat fine-tuning with crosscoder]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2504.02922">https://arxiv.org/abs/2504.02922</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43669700">https://news.ycombinator.com/item?id=43669700</a></p>
<p>Points: 6</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 13 Apr 2025 02:57:21 +0000</pubDate><link>https://arxiv.org/abs/2504.02922</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43669700</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43669700</guid></item><item><title><![CDATA[Retrieval with Learned Similarities]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/pdf/2407.15462vsl.com">https://arxiv.org/pdf/2407.15462vsl.com</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43441592">https://news.ycombinator.com/item?id=43441592</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 21 Mar 2025 22:53:42 +0000</pubDate><link>https://arxiv.org/pdf/2407.15462vsl.com</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43441592</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43441592</guid></item><item><title><![CDATA[The Curse of Depth in Large Language Models]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2502.05795">https://arxiv.org/abs/2502.05795</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43432182">https://news.ycombinator.com/item?id=43432182</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 21 Mar 2025 05:49:48 +0000</pubDate><link>https://arxiv.org/abs/2502.05795</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43432182</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43432182</guid></item><item><title><![CDATA[New comment by veryluckyxyz in "Looking Back at Speculative Decoding"]]></title><description><![CDATA[
<p><a href="https://pytorch.org/blog/hitchhikers-guide-speculative-decoding/" rel="nofollow">https://pytorch.org/blog/hitchhikers-guide-speculative-decod...</a><p><a href="https://colab.research.google.com/github/sanchit-gandhi/notebooks/blob/main/speculative_decoding.ipynb#scrollTo=af0b3757-72dc-48a8-9d9d-fc135386cae5" rel="nofollow">https://colab.research.google.com/github/sanchit-gandhi/note...</a></p>
]]></description><pubDate>Sat, 01 Mar 2025 06:29:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=43216538</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43216538</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43216538</guid></item><item><title><![CDATA[Looking Back at Speculative Decoding]]></title><description><![CDATA[
<p>Article URL: <a href="https://research.google/blog/looking-back-at-speculative-decoding/">https://research.google/blog/looking-back-at-speculative-decoding/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43216518">https://news.ycombinator.com/item?id=43216518</a></p>
<p>Points: 36</p>
<p># Comments: 5</p>
]]></description><pubDate>Sat, 01 Mar 2025 06:24:47 +0000</pubDate><link>https://research.google/blog/looking-back-at-speculative-decoding/</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43216518</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43216518</guid></item><item><title><![CDATA[Long-Context GRPO]]></title><description><![CDATA[
<p>Article URL: <a href="https://unsloth.ai/blog/grpo">https://unsloth.ai/blog/grpo</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=43124091">https://news.ycombinator.com/item?id=43124091</a></p>
<p>Points: 60</p>
<p># Comments: 22</p>
]]></description><pubDate>Fri, 21 Feb 2025 04:39:51 +0000</pubDate><link>https://unsloth.ai/blog/grpo</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=43124091</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43124091</guid></item><item><title><![CDATA[HippoRAG: Neurobiologically Inspired Long-Term Memory for LLMs (2024)]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2405.14831">https://arxiv.org/abs/2405.14831</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=42969750">https://news.ycombinator.com/item?id=42969750</a></p>
<p>Points: 65</p>
<p># Comments: 4</p>
]]></description><pubDate>Fri, 07 Feb 2025 05:34:59 +0000</pubDate><link>https://arxiv.org/abs/2405.14831</link><dc:creator>veryluckyxyz</dc:creator><comments>https://news.ycombinator.com/item?id=42969750</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=42969750</guid></item></channel></rss>