<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: Palmik</title><link>https://news.ycombinator.com/user?id=Palmik</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Mon, 25 May 2026 21:50:33 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=Palmik" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by Palmik in "DeepSeek makes the V4 Pro price discount permanent"]]></title><description><![CDATA[
<p>DeepSeek V4's KV cache is very efficient due to its heavily compressed and sparse attention architecture.<p>DeepSeek V3.2 which uses DSA only (sparse attention, but without compression from HCA and CSA) is a smaller model but uses 10x more memory at 1M context window compared to DS V4 Pro.<p>Also, I have to say, DeepSeek's API has a very good cache hit rate. With the same workload, I see ~80% KV cache hit rate with the DS API vs ~50% with the major western inference providers for open weight models.</p>
]]></description><pubDate>Sat, 23 May 2026 08:25:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=48245863</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=48245863</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48245863</guid></item><item><title><![CDATA[New comment by Palmik in "DeepSeek makes the V4 Pro price discount permanent"]]></title><description><![CDATA[
<p>I really hope Huawei ramps up Ascend production and DeepSeek open sources their optimized inference engine (they already open source a lot of their kernels -- kudos to them). This could shake things up.</p>
]]></description><pubDate>Sat, 23 May 2026 08:18:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=48245835</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=48245835</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48245835</guid></item><item><title><![CDATA[New comment by Palmik in "DeepSeek makes the V4 Pro price discount permanent"]]></title><description><![CDATA[
<p>There are several things at play:<p>Inference stack efficiency: Many of these providers take off the shelf sglang / vllm / trtllm and hope for the best. Meanwhile DeepSeek team is known for pushing the boundary of optimizations.<p>Now, sglang and vllm are great pieces of software, but take DeepSeek's Sparse Attention (DSA). Introduced 1.5 years ago (<a href="https://arxiv.org/abs/2512.02556" rel="nofollow">https://arxiv.org/abs/2512.02556</a>), used by DeepSeek 3.2, GLM 5, DeepSeek V4. Only <i>now</i> is it slowly strating to get optimized in the major inference engines: (<a href="https://github.com/sgl-project/sglang/issues/19380" rel="nofollow">https://github.com/sgl-project/sglang/issues/19380</a> <a href="https://github.com/sgl-project/sglang/pull/22851" rel="nofollow">https://github.com/sgl-project/sglang/pull/22851</a> etc.). Of course, DS V4 adds extra optimizations into the model architecture on top of DSA, and those will take more time to be taken full advantage of by the open source inference engines.<p>Privacy: Betting that people will pay extra for inference hosted outside China. This is especially true with DeepSeek, because DeepSeek is transparent about using API data for model improvements.<p>And few other things (scale (matters a lot for MoEs), reliability, soft enterprise lock in, etc.)<p>---<p>There is also, likely, tacit collusion at play here. Look at GLM 5 and GLM 5.1 prices. GLM 5 and 5.1 cost the same to run, but providers decided to charge much more for 5.1 because it is much better model, and because Z.AI raised their price as well.</p>
]]></description><pubDate>Sat, 23 May 2026 08:17:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48245824</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=48245824</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48245824</guid></item><item><title><![CDATA[House Committees Probe Cursor Parent, Airbnb over Chinese AI]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.semafor.com/article/04/29/2026/house-committee-probes-cursor-parent-airbnb-over-chinese-ai">https://www.semafor.com/article/04/29/2026/house-committee-probes-cursor-parent-airbnb-over-chinese-ai</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48033664">https://news.ycombinator.com/item?id=48033664</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 06 May 2026 08:20:09 +0000</pubDate><link>https://www.semafor.com/article/04/29/2026/house-committee-probes-cursor-parent-airbnb-over-chinese-ai</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=48033664</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48033664</guid></item><item><title><![CDATA[New comment by Palmik in "DeepSeek V4 – almost on the frontier"]]></title><description><![CDATA[
<p>Why was the title changed from "DeepSeek V4—almost on the frontier, a fraction of the price" to "DeepSeek V4—almost on the frontier"?</p>
]]></description><pubDate>Sun, 03 May 2026 05:50:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47993783</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47993783</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47993783</guid></item><item><title><![CDATA[New comment by Palmik in "Anthropic Joins the Blender Development Fund as Corporate Patron"]]></title><description><![CDATA[
<p>Surely art also exists in textual realm.</p>
]]></description><pubDate>Tue, 28 Apr 2026 19:55:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=47939739</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47939739</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47939739</guid></item><item><title><![CDATA[Anthropic Claude Code HERMES.md billing flaw]]></title><description><![CDATA[
<p>Article URL: <a href="https://consumerrights.wiki/w/Anthropic_Claude_Code_HERMES.md_billing_flaw">https://consumerrights.wiki/w/Anthropic_Claude_Code_HERMES.md_billing_flaw</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47931492">https://news.ycombinator.com/item?id=47931492</a></p>
<p>Points: 1</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 28 Apr 2026 07:39:14 +0000</pubDate><link>https://consumerrights.wiki/w/Anthropic_Claude_Code_HERMES.md_billing_flaw</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47931492</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47931492</guid></item><item><title><![CDATA[New comment by Palmik in "DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles"]]></title><description><![CDATA[
<p>I don't think "friendly" and "publishing benchmarks" are at odds with each other.<p>Model makers (both open and closed weight) typically publish benchmarks against other models and when they do not, people rightfully call them out.<p>Including comparison against "other OSS engine" is just not helpful (what if it's a sandbagged baseline like HF Transformers?)</p>
]]></description><pubDate>Sun, 26 Apr 2026 05:40:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=47907670</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47907670</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47907670</guid></item><item><title><![CDATA[New comment by Palmik in "DeepSeek-V4 on Day 0: From Fast Inference to Verified RL with SGLang and Miles"]]></title><description><![CDATA[
<p>Similar article for vLLM: <a href="https://vllm-website-pdzeaspbm-inferact-inc.vercel.app/blog/deepseek-v4" rel="nofollow">https://vllm-website-pdzeaspbm-inferact-inc.vercel.app/blog/...</a><p>Bechmarks from InferenceX (they do not have apples-to-apples setups to compare the different engines for whatever reason): <a href="https://inferencex.semianalysis.com/inference?i_hc=1&g_model=DeepSeek-V4-Pro&g_rundate=2026-04-25&g_runid=24943464864&i_prec=fp4%2Cfp8" rel="nofollow">https://inferencex.semianalysis.com/inference?i_hc=1&g_model...</a><p>I find it odd that sglang, vLLM, TRTLLM don't seem to want to publish benchmarks comparing each other. They used to, but now there seems to be some unspoken rule against it.<p>At least we get comparison against "other OSS engine" this time, but that could be HF's Transformers as well :)</p>
]]></description><pubDate>Sun, 26 Apr 2026 04:48:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=47907446</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47907446</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47907446</guid></item><item><title><![CDATA[DeepSeek V4 in vLLM: Efficient Long-Context Attention]]></title><description><![CDATA[
<p>Article URL: <a href="https://vllm-website-pdzeaspbm-inferact-inc.vercel.app/blog/deepseek-v4">https://vllm-website-pdzeaspbm-inferact-inc.vercel.app/blog/deepseek-v4</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47902025">https://news.ycombinator.com/item?id=47902025</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sat, 25 Apr 2026 15:02:53 +0000</pubDate><link>https://vllm-website-pdzeaspbm-inferact-inc.vercel.app/blog/deepseek-v4</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47902025</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47902025</guid></item><item><title><![CDATA[New comment by Palmik in "DeepSeek v4"]]></title><description><![CDATA[
<p>Or there will be DSv4.1/2/3 ;)</p>
]]></description><pubDate>Fri, 24 Apr 2026 09:58:26 +0000</pubDate><link>https://news.ycombinator.com/item?id=47887989</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47887989</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47887989</guid></item><item><title><![CDATA[New comment by Palmik in "ChatGPT Images 2.0"]]></title><description><![CDATA[
<p>Misleading conclusion.<p>This model is 8 times cheaper than Gemini for 1K images. Gemini is extremely overpriced.<p>1K image with Gemini is roughly $0.08 and only $0.01 with GPT Image.</p>
]]></description><pubDate>Wed, 22 Apr 2026 20:47:01 +0000</pubDate><link>https://news.ycombinator.com/item?id=47869101</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47869101</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47869101</guid></item><item><title><![CDATA[New comment by Palmik in "ChatGPT Images 2.0"]]></title><description><![CDATA[
<p>Did you enable thinking for your experiment? Are you sure you were on the 2.0 rather than 1.5 version?</p>
]]></description><pubDate>Wed, 22 Apr 2026 16:38:52 +0000</pubDate><link>https://news.ycombinator.com/item?id=47865979</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47865979</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47865979</guid></item><item><title><![CDATA[New comment by Palmik in "ChatGPT Images 2.0"]]></title><description><![CDATA[
<p>I do not think this is a good prompt or useful benchmark, but nonetheless, it seems to work better for me: <a href="https://chatgpt.com/share/69e88a94-ded8-8395-b5dc-abceb2f44d02" rel="nofollow">https://chatgpt.com/share/69e88a94-ded8-8395-b5dc-abceb2f44d...</a></p>
]]></description><pubDate>Wed, 22 Apr 2026 08:47:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=47860861</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47860861</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47860861</guid></item><item><title><![CDATA[NSA is using Anthropic's Mythos despite blacklist]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.axios.com/2026/04/19/nsa-anthropic-mythos-pentagon">https://www.axios.com/2026/04/19/nsa-anthropic-mythos-pentagon</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47832222">https://news.ycombinator.com/item?id=47832222</a></p>
<p>Points: 485</p>
<p># Comments: 348</p>
]]></description><pubDate>Mon, 20 Apr 2026 10:00:35 +0000</pubDate><link>https://www.axios.com/2026/04/19/nsa-anthropic-mythos-pentagon</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47832222</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47832222</guid></item><item><title><![CDATA[New comment by Palmik in "Show HN: Smol machines – subsecond coldstart, portable virtual machines"]]></title><description><![CDATA[
<p>Could it be made even faster using some of the ideas from <a href="https://github.com/zerobootdev/zeroboot" rel="nofollow">https://github.com/zerobootdev/zeroboot</a> ?</p>
]]></description><pubDate>Sat, 18 Apr 2026 07:34:12 +0000</pubDate><link>https://news.ycombinator.com/item?id=47813938</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47813938</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47813938</guid></item><item><title><![CDATA[New comment by Palmik in "Mozilla Announces "Thunderbolt" as an Open-Source, Enterprise AI Client"]]></title><description><![CDATA[
<p>Official announcement: <a href="https://www.thunderbolt.io/announcing-thunderbolt" rel="nofollow">https://www.thunderbolt.io/announcing-thunderbolt</a></p>
]]></description><pubDate>Thu, 16 Apr 2026 19:10:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=47798058</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47798058</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47798058</guid></item><item><title><![CDATA[Mozilla Announces "Thunderbolt" as an Open-Source, Enterprise AI Client]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.phoronix.com/news/Mozilla-Thunderbolt">https://www.phoronix.com/news/Mozilla-Thunderbolt</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47798042">https://news.ycombinator.com/item?id=47798042</a></p>
<p>Points: 25</p>
<p># Comments: 11</p>
]]></description><pubDate>Thu, 16 Apr 2026 19:09:07 +0000</pubDate><link>https://www.phoronix.com/news/Mozilla-Thunderbolt</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47798042</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47798042</guid></item><item><title><![CDATA[Google, Pentagon discuss classified AI deal, the Information reports]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.reuters.com/technology/google-pentagon-discuss-classified-ai-deal-information-reports-2026-04-16/">https://www.reuters.com/technology/google-pentagon-discuss-classified-ai-deal-information-reports-2026-04-16/</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47796950">https://news.ycombinator.com/item?id=47796950</a></p>
<p>Points: 6</p>
<p># Comments: 2</p>
]]></description><pubDate>Thu, 16 Apr 2026 17:45:23 +0000</pubDate><link>https://www.reuters.com/technology/google-pentagon-discuss-classified-ai-deal-information-reports-2026-04-16/</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47796950</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47796950</guid></item><item><title><![CDATA[OpenAI, Anthropic, Google Unite to Combat Model Copying in China]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.bloomberg.com/news/articles/2026-04-06/openai-anthropic-google-unite-to-combat-model-copying-in-china">https://www.bloomberg.com/news/articles/2026-04-06/openai-anthropic-google-unite-to-combat-model-copying-in-china</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47672120">https://news.ycombinator.com/item?id=47672120</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Tue, 07 Apr 2026 08:11:43 +0000</pubDate><link>https://www.bloomberg.com/news/articles/2026-04-06/openai-anthropic-google-unite-to-combat-model-copying-in-china</link><dc:creator>Palmik</dc:creator><comments>https://news.ycombinator.com/item?id=47672120</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47672120</guid></item></channel></rss>