<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: idiliv</title><link>https://news.ycombinator.com/user?id=idiliv</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Fri, 05 Jun 2026 02:28:29 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=idiliv" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by idiliv in "Uber's $1,500/month AI limit is a useful signal for AI tool pricing"]]></title><description><![CDATA[
<p>Uber is likely on an enterprise plan - these charge tokens at API cost, which can be much more expensive than the $20 flat rate.</p>
]]></description><pubDate>Wed, 03 Jun 2026 18:54:49 +0000</pubDate><link>https://news.ycombinator.com/item?id=48388204</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=48388204</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48388204</guid></item><item><title><![CDATA[New comment by idiliv in "GLM-4.7-Flash"]]></title><description><![CDATA[
<p>Sometimes model developers coordinate with inference platforms to time releases in sync.</p>
]]></description><pubDate>Mon, 19 Jan 2026 15:44:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=46680228</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=46680228</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46680228</guid></item><item><title><![CDATA[New comment by idiliv in "Replace OCR with Vision Language Models"]]></title><description><![CDATA[
<p>Wait, but we're doing that already, and it works well (Qwen 2.5 VL)? If need be, you can always resort to structured generation to enforce schema conformity?</p>
]]></description><pubDate>Wed, 26 Feb 2025 22:20:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=43188925</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=43188925</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43188925</guid></item><item><title><![CDATA[New comment by idiliv in "AI engineers claim new algorithm reduces AI power consumption by 95%"]]></title><description><![CDATA[
<p>Duplicate, posted on October 9: <a href="https://news.ycombinator.com/item?id=41784591">https://news.ycombinator.com/item?id=41784591</a></p>
]]></description><pubDate>Sat, 19 Oct 2024 20:11:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=41890412</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=41890412</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41890412</guid></item><item><title><![CDATA[New comment by idiliv in "Llama 3.2 released: Multimodal, 1B to 90B sizes"]]></title><description><![CDATA[
<p>Where do you see the MMLU-Pro evaluation for Llama 3.2 90B? On the link I only see Llama 3.2 90B evaluated against multimodal benchmarks.</p>
]]></description><pubDate>Wed, 25 Sep 2024 18:03:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=41650115</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=41650115</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41650115</guid></item><item><title><![CDATA[New comment by idiliv in "Coffee Stats – Maximize Caffeine Intake and Get to Bed at Night"]]></title><description><![CDATA[
<p>Is the "Ultra Deep" analysis worth it over the standard "Deep" analysis?</p>
]]></description><pubDate>Mon, 23 Sep 2024 15:07:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=41626977</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=41626977</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41626977</guid></item><item><title><![CDATA[New comment by idiliv in "Learning to Reason with LLMs"]]></title><description><![CDATA[
<p>In the demo, O1 implements an incorrect version of the "squirrel finder" game?<p>The instructions state that the squirrel icon should spawn after three seconds,
yet it spawns immediately in the first game (also noted by the guy doing the demo).<p>Edit: I'm referring to the demo video here: <a href="https://openai.com/index/introducing-openai-o1-preview/" rel="nofollow">https://openai.com/index/introducing-openai-o1-preview/</a></p>
]]></description><pubDate>Thu, 12 Sep 2024 18:20:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=41523892</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=41523892</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=41523892</guid></item><item><title><![CDATA[Adversarial Perturbations Cannot Reliably Protect Artists from Generative AI]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2406.12027">https://arxiv.org/abs/2406.12027</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=40748080">https://news.ycombinator.com/item?id=40748080</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 21 Jun 2024 10:32:02 +0000</pubDate><link>https://arxiv.org/abs/2406.12027</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=40748080</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40748080</guid></item><item><title><![CDATA[New comment by idiliv in "IKEA's retailer's solved global 'unhappy worker' crisis by raising salaries"]]></title><description><![CDATA[
<p>How are flexible working hours equivalent to more money?</p>
]]></description><pubDate>Fri, 14 Jun 2024 07:59:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=40678642</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=40678642</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40678642</guid></item><item><title><![CDATA[New comment by idiliv in "AMD's MI300X Outperforms Nvidia's H100 for LLM Inference"]]></title><description><![CDATA[
<p>You can rent them online for ~ 4-5 $ per hour per GPU. Not cheap, but definitely feasible as a weekend project.</p>
]]></description><pubDate>Thu, 13 Jun 2024 11:56:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=40668538</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=40668538</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40668538</guid></item><item><title><![CDATA[New comment by idiliv in "Mistral AI Launches New 8x22B MOE Model"]]></title><description><![CDATA[
<p>Just tried this again and I also arrive at 16.92B. Not sure what I did wrong the first time, thanks for double-checking this!</p>
]]></description><pubDate>Wed, 10 Apr 2024 20:09:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=39995200</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=39995200</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39995200</guid></item><item><title><![CDATA[New comment by idiliv in "Mistral AI Launches New 8x22B MOE Model"]]></title><description><![CDATA[
<p>Oh, and to answer your actual question: Assuming that the model is released with 16 bits per parameter, then it as 281GB / 16 bit = 140.5 parameters.</p>
]]></description><pubDate>Wed, 10 Apr 2024 07:55:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=39988119</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=39988119</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39988119</guid></item><item><title><![CDATA[New comment by idiliv in "Mistral AI Launches New 8x22B MOE Model"]]></title><description><![CDATA[
<p>In Mixtral 8x7B, the 8 means that the model uses Mixture-of-Experts (MoE) layers with 8 experts. The 7B means that if you were to remove 7 of the 8 experts in each  layer, then you would end up with a 7B model (which would have exactly the same architecture as Mistral 7B). Therefore, a 1x7B model has 7B params. An 8x7B model has 1 * 7B + (8-1) * sz_expert params, where sz_expert is some constant value that the MoE layers increase by when adding one expert. In the case of Mixtral 8x7B the model size is 46.3GB, so, sz_expert ≈ 5.6B.<p>If these assumptions port over to 8x22B, then 8x22B has, at 281GB,  sz_expert ≈ 13.8B.</p>
]]></description><pubDate>Wed, 10 Apr 2024 07:52:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=39988103</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=39988103</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39988103</guid></item><item><title><![CDATA[New comment by idiliv in "Martin Kleppmann talk on local-first (LoFi)"]]></title><description><![CDATA[
<p>Hi Martin! It's Robert from Cambridge (you were my DOS :)). Glad to see your name pop up on HN!</p>
]]></description><pubDate>Tue, 20 Feb 2024 20:08:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=39446232</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=39446232</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39446232</guid></item><item><title><![CDATA[New comment by idiliv in "Sora: Creating video from text"]]></title><description><![CDATA[
<p>People here seem mostly impressed by the high resolution of these examples.<p>Based on my experience doing research on Stable Diffusion, scaling up the resolution is the conceptually easy part that only requires larger models and more high-resolution training data.<p>The hard part is semantic alignment with the prompt. Attempts to scale Stable Diffusion, like SDXL, have resulted only in marginally better prompt understanding (likely due to the continued reliance on CLIP prompt embeddings).<p>So, the key question here is how well Sora does prompt alignment.</p>
]]></description><pubDate>Thu, 15 Feb 2024 19:08:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=39387009</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=39387009</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39387009</guid></item><item><title><![CDATA[New comment by idiliv in "Huge proportion of internet is AI-generated slime, researchers find"]]></title><description><![CDATA[
<p>Hmm, are you sure that translations of LLMs like ChatGPT are not incorporating cultural context?</p>
]]></description><pubDate>Sat, 20 Jan 2024 17:50:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=39070274</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=39070274</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39070274</guid></item><item><title><![CDATA[New comment by idiliv in "Benchmarks and comparison of LLM AI models and API hosting providers"]]></title><description><![CDATA[
<p>I'm curious how they evaluated model quality. The only information I could find is "Quality: Index based on several quality benchmarks".</p>
]]></description><pubDate>Tue, 16 Jan 2024 19:17:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=39017632</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=39017632</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=39017632</guid></item><item><title><![CDATA[New comment by idiliv in "OpenAI Engineers Earning $800k a Year Turn Rare Skillset into Leverage"]]></title><description><![CDATA[
<p>They could join Mistral AI, which has published weights for at least some of its models. Another option is Meta AI, which has published weights for Llama and Llama 2.</p>
]]></description><pubDate>Mon, 25 Dec 2023 10:38:17 +0000</pubDate><link>https://news.ycombinator.com/item?id=38761482</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=38761482</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38761482</guid></item><item><title><![CDATA[Hugging Face releases Optimum-Nvidia to accelerate LLM inference]]></title><description><![CDATA[
<p>Article URL: <a href="https://huggingface.co/blog/optimum-nvidia">https://huggingface.co/blog/optimum-nvidia</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=38554585">https://news.ycombinator.com/item?id=38554585</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 07 Dec 2023 09:38:48 +0000</pubDate><link>https://huggingface.co/blog/optimum-nvidia</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=38554585</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38554585</guid></item><item><title><![CDATA[New comment by idiliv in "Sorry, but a new prompt for GPT-4 is not a paper"]]></title><description><![CDATA[
<p>Parent post is talking about LLMs, i.e. Large LMs. Research on LLMs is indeed in its infancy.</p>
]]></description><pubDate>Tue, 05 Dec 2023 14:58:59 +0000</pubDate><link>https://news.ycombinator.com/item?id=38531645</link><dc:creator>idiliv</dc:creator><comments>https://news.ycombinator.com/item?id=38531645</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38531645</guid></item></channel></rss>