<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: disiplus</title><link>https://news.ycombinator.com/user?id=disiplus</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 10 Jun 2026 10:12:42 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=disiplus" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by disiplus in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>Which model are you running ?</p>
]]></description><pubDate>Sun, 31 May 2026 16:30:48 +0000</pubDate><link>https://news.ycombinator.com/item?id=48347061</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=48347061</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48347061</guid></item><item><title><![CDATA[New comment by disiplus in "The solution might be cancelling my AI subscription"]]></title><description><![CDATA[
<p>Diagnosed with ADHD, ultimately does not change anything for me even through i had the same idea as you. Reason is that i can now start even more stuff in parallel. And some part of them get finished more before i can just prompt more when in focus, but instead of finishing i add more features.</p>
]]></description><pubDate>Sun, 31 May 2026 16:24:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=48347000</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=48347000</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48347000</guid></item><item><title><![CDATA[New comment by disiplus in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>same, but you need more then 100k of hw to run something like kimi k2.6 for a bigger team. on the other hand there is a ds4 flash that you can run on a macbook with 128gb ram. an that one is perfectly usable for a lot of tasks.<p><a href="https://github.com/antirez/ds4" rel="nofollow">https://github.com/antirez/ds4</a></p>
]]></description><pubDate>Wed, 27 May 2026 20:31:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=48300199</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=48300199</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48300199</guid></item><item><title><![CDATA[New comment by disiplus in "Agents can now create Cloudflare accounts, buy domains, and deploy"]]></title><description><![CDATA[
<p>The problem is not website, the problem is discovery and discovery is on Instagram, TikTok, and social networks. You don't have any incentive to build a website for a regular audience. What you might do is build an audience on a social network and then try to move them to a website.<p>But at that point you're big enough to build it properly.</p>
]]></description><pubDate>Wed, 06 May 2026 10:13:36 +0000</pubDate><link>https://news.ycombinator.com/item?id=48034481</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=48034481</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48034481</guid></item><item><title><![CDATA[New comment by disiplus in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>depends, a super small one finetuned to do function calling instead sending it to big model and waiting, instead, you ask for a revenue in last month, i do a small llm function call -> show results. some bigger ones, analysis, summary, classification.
what is great with smaller ones, and im looking at 2b, 4b is you can get a huge throughput with just vllm and a couple of consumer gpus.
what i usually do is basically distillation of a big one onto smaller one.</p>
]]></description><pubDate>Tue, 05 May 2026 17:31:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48025727</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=48025727</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48025727</guid></item><item><title><![CDATA[New comment by disiplus in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>i dont know what are you talking about, i replaced an older gpt4o with a finetuned qwen. there is a huge amount of "AI, that can be done with those models, or partly by those models." Huge amount of people would not notice the difference. And if you prepare the context correctly, even bigger slice of people would not notice.</p>
]]></description><pubDate>Tue, 05 May 2026 17:02:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48025270</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=48025270</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48025270</guid></item><item><title><![CDATA[New comment by disiplus in "Accelerating Gemma 4: faster inference with multi-token prediction drafters"]]></title><description><![CDATA[
<p>nice, will run it later agains qwen3.6 27b, the speed was one of the reasons why in was running qwen and not gemma. the difference was big, there is some magic that happpens when you have more then 100tps.</p>
]]></description><pubDate>Tue, 05 May 2026 16:54:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48025139</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=48025139</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48025139</guid></item><item><title><![CDATA[New comment by disiplus in "DeepSeek v4"]]></title><description><![CDATA[
<p>Depends how many users you have and what is "production grade" for you but like 500k gets you a 8x B200 machine.</p>
]]></description><pubDate>Fri, 24 Apr 2026 05:06:19 +0000</pubDate><link>https://news.ycombinator.com/item?id=47885778</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47885778</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47885778</guid></item><item><title><![CDATA[New comment by disiplus in "Kimi K2.6: Advancing open-source coding"]]></title><description><![CDATA[
<p>was part of the beta, its properly good model, in some sense i forgot that im not on opus or gpt. opus is still better. gpt is the one struggling for me. it has some niche in backend work but you can get the same with opus with skills, its lacking in almost all others.</p>
]]></description><pubDate>Mon, 20 Apr 2026 20:40:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=47840225</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47840225</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47840225</guid></item><item><title><![CDATA[New comment by disiplus in "ChatGPT Pro now starts at $100/month"]]></title><description><![CDATA[
<p>It looks like its called prolite.<p><a href="https://snipboard.io/jmGKfM.jpg" rel="nofollow">https://snipboard.io/jmGKfM.jpg</a></p>
]]></description><pubDate>Thu, 09 Apr 2026 18:42:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=47707850</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47707850</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47707850</guid></item><item><title><![CDATA[New comment by disiplus in "I've sold out"]]></title><description><![CDATA[
<p>yet</p>
]]></description><pubDate>Wed, 08 Apr 2026 10:53:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=47688390</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47688390</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47688390</guid></item><item><title><![CDATA[New comment by disiplus in "GLM-5.1: Towards Long-Horizon Tasks"]]></title><description><![CDATA[
<p>i have glm and kimi. kimi was in most of the cases better and my replacement for claude when i run out of tokens. Now im finding myself using glm more then kimi. Its funny that glm vs kimi, is like codex vs claude. Where glm and codex are better for backend and kimi and claude more for frontend.<p>as kimi did a huge amount of claude distilation it seems to be somewhat based in data<p><a href="https://www.anthropic.com/news/detecting-and-preventing-distillation-attacks" rel="nofollow">https://www.anthropic.com/news/detecting-and-preventing-dist...</a></p>
]]></description><pubDate>Tue, 07 Apr 2026 19:18:09 +0000</pubDate><link>https://news.ycombinator.com/item?id=47680049</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47680049</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47680049</guid></item><item><title><![CDATA[New comment by disiplus in "GLM-5.1: Towards Long-Horizon Tasks"]]></title><description><![CDATA[
<p>Yeah it seems they did not align it to much, at least for now. Yesterday it helped me bypass the bot detection on a local marketplace. that i wanted to scrap some listing for my personal alerting system. Al the others failed but glm5.1 found a set of parameters and tweaks how to make my browser in container not be detected.</p>
]]></description><pubDate>Tue, 07 Apr 2026 19:13:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=47680000</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47680000</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47680000</guid></item><item><title><![CDATA[New comment by disiplus in "GLM-5.1: Towards Long-Horizon Tasks"]]></title><description><![CDATA[
<p>basically my expirience as well. Sometimes it can break past 100k and be ok, but mostly it breaks down.</p>
]]></description><pubDate>Tue, 07 Apr 2026 19:08:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=47679931</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47679931</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47679931</guid></item><item><title><![CDATA[New comment by disiplus in "GLM-5.1: Towards Long-Horizon Tasks"]]></title><description><![CDATA[
<p>When it works and its not slow it can impress. Like yesterday it solved something that kimi k2.5 could not. and kimi was best open source model for me. But it still slow sometimes. I have z.ai and kimi subscription when i run out of tokens for claude (max) and codex(plus).<p>i have a feeling its nearing opus 4.5 level if they could fix it getting crazy after like 100k tokens.</p>
]]></description><pubDate>Tue, 07 Apr 2026 18:15:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=47679203</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47679203</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47679203</guid></item><item><title><![CDATA[New comment by disiplus in "Denmark was reportedly preparing for full-scale war with the US over Greenland"]]></title><description><![CDATA[
<p>The post mentions, france, germany and nordic nations. France, Holand and nordic nations helped in the early stages of US.</p>
]]></description><pubDate>Thu, 19 Mar 2026 12:21:18 +0000</pubDate><link>https://news.ycombinator.com/item?id=47438112</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47438112</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47438112</guid></item><item><title><![CDATA[New comment by disiplus in "I am directing the Department of War to designate Anthropic a supply-chain risk"]]></title><description><![CDATA[
<p>It will also cost openai dearly if they don't communicate clearly, because I for one will internally push to switch from openai (we are on azure actually) to anthropic. Besides that my private account also.</p>
]]></description><pubDate>Fri, 27 Feb 2026 22:58:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47187073</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47187073</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47187073</guid></item><item><title><![CDATA[New comment by disiplus in "Google restricting Google AI Pro/Ultra subscribers for using OpenClaw"]]></title><description><![CDATA[
<p>I have them all. They're not just as good. Whoever tells you that looked only at the benchmarks, not real use. They all fall short at some point.<p>Kimi K2.5 is the best one, but it's still not at the level of what Anthropic released with opus 4.5.</p>
]]></description><pubDate>Mon, 23 Feb 2026 13:54:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=47122355</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47122355</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47122355</guid></item><item><title><![CDATA[New comment by disiplus in "Ggml.ai joins Hugging Face to ensure the long-term progress of Local AI"]]></title><description><![CDATA[
<p>I think in the West we think everything is blocked. But for example, if you book an eSIM, when you visit you already get direct access to Western services because they route it to some other server. Hong Kong is totally different: they basically use WhatsApp and Google Maps, and everything worked when I was there.</p>
]]></description><pubDate>Fri, 20 Feb 2026 15:23:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=47089143</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47089143</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47089143</guid></item><item><title><![CDATA[New comment by disiplus in "Ggml.ai joins Hugging Face to ensure the long-term progress of Local AI"]]></title><description><![CDATA[
<p>Yeah, they're the good guys. I suspect the open source work is mostly advertisements for them to sell consulting and services to enterprises. Otherwise, the work they do doesn't make sense to offer for free.</p>
]]></description><pubDate>Fri, 20 Feb 2026 15:20:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=47089109</link><dc:creator>disiplus</dc:creator><comments>https://news.ycombinator.com/item?id=47089109</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47089109</guid></item></channel></rss>