<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: rohansood15</title><link>https://news.ycombinator.com/user?id=rohansood15</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Thu, 09 Apr 2026 11:10:09 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=rohansood15" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by rohansood15 in "In Japan, the robot isn't coming for your job; it's filling the one nobody wants"]]></title><description><![CDATA[
<p>Why do you think that making people do what they don't want to do for more money is the most effective way to distribute wealth?</p>
]]></description><pubDate>Mon, 06 Apr 2026 00:59:28 +0000</pubDate><link>https://news.ycombinator.com/item?id=47655669</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=47655669</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47655669</guid></item><item><title><![CDATA[New comment by rohansood15 in "Tell HN: Anthropic no longer allowing Claude Code subscriptions to use OpenClaw"]]></title><description><![CDATA[
<p>This email gives out the endgame - eventually, Claude subscription would be ~30% cheaper than API costs.<p>Our engineering team averages 1.5k per dev per month on credit costs, without busting Max limits today.</p>
]]></description><pubDate>Sat, 04 Apr 2026 01:47:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=47634772</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=47634772</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47634772</guid></item><item><title><![CDATA[New comment by rohansood15 in "TurboQuant: Redefining AI efficiency with extreme compression"]]></title><description><![CDATA[
<p>The paper is about vector quantization, which affects KV cache not model weights/sizes.</p>
]]></description><pubDate>Wed, 25 Mar 2026 16:38:10 +0000</pubDate><link>https://news.ycombinator.com/item?id=47519719</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=47519719</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47519719</guid></item><item><title><![CDATA[New comment by rohansood15 in "TurboQuant: Redefining AI efficiency with extreme compression"]]></title><description><![CDATA[
<p>Thank you.</p>
]]></description><pubDate>Wed, 25 Mar 2026 16:32:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=47519643</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=47519643</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47519643</guid></item><item><title><![CDATA[Private LLM Inference on Consumer Blackwell GPUs]]></title><description><![CDATA[
<p>Article URL: <a href="https://arxiv.org/abs/2601.09527">https://arxiv.org/abs/2601.09527</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=47361528">https://news.ycombinator.com/item?id=47361528</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Fri, 13 Mar 2026 07:17:13 +0000</pubDate><link>https://arxiv.org/abs/2601.09527</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=47361528</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47361528</guid></item><item><title><![CDATA[New comment by rohansood15 in "Gemini 3.1 Flash-Lite: Built for intelligence at scale"]]></title><description><![CDATA[
<p>What's the cheaper alternative from Gemini for Flash-2.5-lite level intelligence when it gets deprecated on 22nd July 2026?</p>
]]></description><pubDate>Tue, 03 Mar 2026 17:55:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=47236133</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=47236133</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47236133</guid></item><item><title><![CDATA[New comment by rohansood15 in "Gemini 3.1 Flash-Lite: Built for intelligence at scale"]]></title><description><![CDATA[
<p>Yea but there is a whole world of tasks for which Flash 2.5-lite was sufficiently intelligent. Given Google's depreciation policy, there will soon be no way to get that intelligence at that price.</p>
]]></description><pubDate>Tue, 03 Mar 2026 17:52:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=47236092</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=47236092</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47236092</guid></item><item><title><![CDATA[New comment by rohansood15 in "Gemini 3.1 Flash-Lite: Built for intelligence at scale"]]></title><description><![CDATA[
<p>For the last 2 years, startup wisdom has been that models will continue to get cheaper and better. Claude first, and now Gemini has shown that it's not the case.<p>We priced an enterprise contract using Flash 1.5 pricing last summer, and today that contract would be unit economic negative if we used Flash 3. Flash 2.5 and now Flash 3.1 Lite barely breaks even.<p>I predict open-source models and fine-tuning are going to make a real comeback this year for economic reasons.</p>
]]></description><pubDate>Tue, 03 Mar 2026 17:33:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=47235791</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=47235791</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47235791</guid></item><item><title><![CDATA[New comment by rohansood15 in "The Codex App"]]></title><description><![CDATA[
<p>+1 to this. Been using Codex the last few months, and this morning I asked it to plan a change. It gave me generic instructions like 'Check if you're using X' or 'Determine if logic is doing Y' - I was like WTF.</p>
]]></description><pubDate>Wed, 04 Feb 2026 03:12:00 +0000</pubDate><link>https://news.ycombinator.com/item?id=46880987</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=46880987</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46880987</guid></item><item><title><![CDATA[New comment by rohansood15 in "Show HN: Build Web Automations via Demonstration"]]></title><description><![CDATA[
<p>Playwright codegen.</p>
]]></description><pubDate>Wed, 28 Jan 2026 16:12:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=46797243</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=46797243</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46797243</guid></item><item><title><![CDATA[New comment by rohansood15 in "There is an AI code review bubble"]]></title><description><![CDATA[
<p>> Would you trust a Cursor review of Claude-written code more, less, or the same as a Cursor review of Cursor-written code?<p>You're assuming models/prompts insist on a previous iteration of their work being right. They don't. Models try to follow instructions, so if you ask them to find issues, they will. 'Trust' is a human problem, not a model/harness problem.<p>> Our view is that code validation will be completely autonomous in the medium term.<p>If reviews are going to be autonomous, they'd be part of the coding agent. Nobody would see it as an independent activity, you mentioned above.<p>> Our first step towards making this easier is a native Claude Code plugin.<p>Claude can review code based on a specific set of instructions/context in an MD file. An additional plugin is unnecessary.<p>My view is that to operate in this space, you gotta build a coding agent or get acquired by one. The writing was on the wall a year ago.</p>
]]></description><pubDate>Tue, 27 Jan 2026 02:35:24 +0000</pubDate><link>https://news.ycombinator.com/item?id=46774819</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=46774819</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46774819</guid></item><item><title><![CDATA[New comment by rohansood15 in "Launch HN: Tweeks (YC W25) – Browser extension to deshittify the web"]]></title><description><![CDATA[
<p>I agree that it should be open-source, but I think it can still be a YC company. Improving the user experience on the web is definitely a billion-dollar market.</p>
]]></description><pubDate>Thu, 13 Nov 2025 18:58:07 +0000</pubDate><link>https://news.ycombinator.com/item?id=45918981</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=45918981</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45918981</guid></item><item><title><![CDATA[New comment by rohansood15 in "Devpush – Open-source and self-hostable alternative to Vercel, Render, Netlify"]]></title><description><![CDATA[
<p>This looks really slick, though it's a bummer that there isn't a quick way to try the hosted version. You mentioned the Vercel UX in the comments, and I think the single-click install on the hosted version is a significant part of it.<p>EDIT: Just got approved for access - thanks!</p>
]]></description><pubDate>Tue, 07 Oct 2025 13:37:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=45502920</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=45502920</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45502920</guid></item><item><title><![CDATA[New comment by rohansood15 in "The RAG Obituary: Killed by agents, buried by context windows"]]></title><description><![CDATA[
<p>I don't get why folks are so dismissive here.<p>If you ever saw Claude Code/Codex use grep, you will find that it constructs complex queries that encompass a whole range of keywords which may not even be present in the original user query. So the 'semantic meaning' isn't actually lost.<p>And nobody is putting an entire enterprise's knowledge base inside the context window. How many enterprise tasks are there that need referencing more that a dozen docs? And even those that do, can be broken down into sub-tasks of manageable size.<p>Lastly, nobody here mentions how much of a pain it is to build, maintain and secure an enterprise vector database. People spend months cleaning the data, chunking and vectorizing it, only for newer versions of the same data making it redundant overnight. And good look recreating your entire permissioning and access control stack on top of the vector database you just created.<p>The RAG obituary is a bit provocative, and maybe that's intentional. But it's surprising how negative/dismissive the reactions in this thread are.</p>
]]></description><pubDate>Thu, 02 Oct 2025 16:02:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=45451464</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=45451464</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45451464</guid></item><item><title><![CDATA[New comment by rohansood15 in "Improved Gemini 2.5 Flash and Flash-Lite"]]></title><description><![CDATA[
<p>2.0 Flash is significantly cheaper than 2.5 Flash, and is/was better than 2.5-Flash-Lite before this latest update. It's a great workhorse model for basic text parsing/summary/image understanding etc. Though looks like 2.5-Flash-Lite will make it redundant.</p>
]]></description><pubDate>Fri, 26 Sep 2025 16:48:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=45388541</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=45388541</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45388541</guid></item><item><title><![CDATA[New comment by rohansood15 in "Gemma 3 270M: Compact model for hyper-efficient AI"]]></title><description><![CDATA[
<p>This is why we should have a downvote button on HN.<p>They say you shouldn't attribute to malice what can be attributed to incompetence, but this sure seems like malice.<p>The whole point of a 270M model is to condense the intelligence, and not the knowledge. Of course it doesn't fare well on a quiz.</p>
]]></description><pubDate>Fri, 15 Aug 2025 08:52:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=44910055</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=44910055</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44910055</guid></item><item><title><![CDATA[New comment by rohansood15 in "GPT-5: Key characteristics, pricing and system card"]]></title><description><![CDATA[
<p>What do you think it is 'mocking'? It is exactly the behavior that would make the tests work. And unless I give it access to production, it has no way to verify tasks like how values (in this case secrets/envs) are being passed.<p>Plus, this is all besides the point. Simon argued that the model hallucinates less, not a specific product.</p>
]]></description><pubDate>Sun, 10 Aug 2025 06:34:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=44853224</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=44853224</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44853224</guid></item><item><title><![CDATA[New comment by rohansood15 in "What the Windsurf sale means for the AI coding ecosystem"]]></title><description><![CDATA[
<p>The company can also issue a share buyback. Doesn't have to be profits. And you're right about the preference rights.<p>Employees who haven't vested their shares can't complain/enforce tag-along/sue for minority investor rights.</p>
]]></description><pubDate>Sun, 10 Aug 2025 06:30:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=44853204</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=44853204</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44853204</guid></item><item><title><![CDATA[New comment by rohansood15 in "What the Windsurf sale means for the AI coding ecosystem"]]></title><description><![CDATA[
<p>1.2B went to investors, the remaining 1.2B was actually an incentive/payout for the founders/employees that google took. The company basically has whatever money it had in the bank, plus a bit more from Google - but no investor liabilities.</p>
]]></description><pubDate>Sat, 09 Aug 2025 06:53:06 +0000</pubDate><link>https://news.ycombinator.com/item?id=44844589</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=44844589</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44844589</guid></item><item><title><![CDATA[New comment by rohansood15 in "GPT-5: Key characteristics, pricing and system card"]]></title><description><![CDATA[
<p>On multiple occasions, Claude Code claims it completed a task when it actually just wrote mock code. It will also answer questions with certainity (for e.g. where is this value being passed), but in reality it is making it up. So if you haven't been seeing hallucinations on Opus/Sonnet, you probably aren't looking deep enough.</p>
]]></description><pubDate>Fri, 08 Aug 2025 02:58:40 +0000</pubDate><link>https://news.ycombinator.com/item?id=44832939</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=44832939</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44832939</guid></item></channel></rss>