<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: rohansood15</title><link>https://news.ycombinator.com/user?id=rohansood15</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 14 Jun 2026 22:22:55 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=rohansood15" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by rohansood15 in "Fable situation update from David Sacks"]]></title><description><![CDATA[
<p>Cyber-attacks to start, and real-world terrorist attacks/bombings (inc. chemical/biological weapons) later.</p>
]]></description><pubDate>Sat, 13 Jun 2026 18:03:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=48519788</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48519788</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48519788</guid></item><item><title><![CDATA[Fable situation update from David Sacks]]></title><description><![CDATA[
<p>Article URL: <a href="https://twitter.com/DavidSacks/status/2065853007619588171">https://twitter.com/DavidSacks/status/2065853007619588171</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48519695">https://news.ycombinator.com/item?id=48519695</a></p>
<p>Points: 10</p>
<p># Comments: 7</p>
]]></description><pubDate>Sat, 13 Jun 2026 17:54:53 +0000</pubDate><link>https://twitter.com/DavidSacks/status/2065853007619588171</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48519695</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48519695</guid></item><item><title><![CDATA[New comment by rohansood15 in "Statement on US government directive to suspend access to Fable 5 and Mythos 5"]]></title><description><![CDATA[
<p>I mean, we all pay via CC so it's bit like they can't know who you are if they wanted to.</p>
]]></description><pubDate>Sat, 13 Jun 2026 01:37:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48511507</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48511507</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48511507</guid></item><item><title><![CDATA[New comment by rohansood15 in "AWS Bedrock to require sharing data with Anthropic for Mythos and future models"]]></title><description><![CDATA[
<p>It is only abuse flagged data and there too for OpenAI they're not sharing that data with them. But for Anthropic they are.</p>
]]></description><pubDate>Wed, 10 Jun 2026 12:34:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=48475364</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48475364</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48475364</guid></item><item><title><![CDATA[New comment by rohansood15 in "AWS Bedrock to require sharing data with Anthropic for Mythos and future models"]]></title><description><![CDATA[
<p>Pretty sure this doesn't work for any regulated enterprise or government client. But AWS knows this, so I am curious why they'd agree to it.</p>
]]></description><pubDate>Wed, 10 Jun 2026 08:55:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=48473451</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48473451</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48473451</guid></item><item><title><![CDATA[New comment by rohansood15 in "Launch HN: General Instinct (YC P26) – Frontier models on edge devices"]]></title><description><![CDATA[
<p>I can't find it. Can you state your performance versus comparable 3-bit quantization from Unsloth/Bartowski? Edit: I appreciate that you seem to have open-sourced the quantization pipeline. This is not to question your work, but to understand where the outputs stand relative to the SoTA for quantization.</p>
]]></description><pubDate>Fri, 05 Jun 2026 17:58:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48416016</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48416016</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48416016</guid></item><item><title><![CDATA[New comment by rohansood15 in "Launch HN: General Instinct (YC P26) – Frontier models on edge devices"]]></title><description><![CDATA[
<p>Have you benchmarked against other 3-bit dynamic quants like Unsloth? I am sorry but this framing against  a full precision, newer, smaller MoE just seems misleading. Also, Gemma-4-26B-A4B is not the SOTA for edge. Even at launch, that would be the 31B.</p>
]]></description><pubDate>Fri, 05 Jun 2026 17:40:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48415809</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48415809</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48415809</guid></item><item><title><![CDATA[New comment by rohansood15 in "OpenAI frontier models and Codex are now available on AWS"]]></title><description><![CDATA[
<p>Anthropic better get that IPO out soon. Their incredible revenue run-up was basically a result of botched Gemini releases and OpenAI having their hands-tied behind their Azure backs.<p>Anthropic models were quite literally the only viable serverless API (i.e. Bedrock) models on AWS. They didn't even bother releasing the recent Qwen 3.5/3.6 series. Combined with the token efficiency/ROI focus, I would really like to see how Antrhopic ends Q3.</p>
]]></description><pubDate>Tue, 02 Jun 2026 02:31:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=48365265</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48365265</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48365265</guid></item><item><title><![CDATA[New comment by rohansood15 in "I think Anthropic and OpenAI have found product-market fit"]]></title><description><![CDATA[
<p>Are you comparing single-user requests or multiple concurrent requests when you say comparable to rented GPU? Most of the cost efficiencies kick in with concurrent/batch requests. A single H100 node can provide like 5k input + 2k output tok/s on a model like Qwen 3.6 35B-A3B with 30+ concurrent requests.</p>
]]></description><pubDate>Thu, 28 May 2026 01:57:30 +0000</pubDate><link>https://news.ycombinator.com/item?id=48303417</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48303417</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48303417</guid></item><item><title><![CDATA[New comment by rohansood15 in "Green card seekers must leave U.S. to apply, Trump administration says"]]></title><description><![CDATA[
<p>So ask for it. Seems like your issue isn't immigration, it is abuse. The recent changes don't do much to fix that, imo.</p>
]]></description><pubDate>Sun, 24 May 2026 01:47:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=48253523</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48253523</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48253523</guid></item><item><title><![CDATA[New comment by rohansood15 in "Green card seekers must leave U.S. to apply, Trump administration says"]]></title><description><![CDATA[
<p>Let's say the government can't care for 100M people because of lack of doctors. Now they could train one over 10 years, or you could have one of the smartest doctors in the world come be 100M+1. Would you take that?<p>Now expand that across socio-economic spectrum (not enough plumbers, teachers, AI experts, researchers etc). That is what legal immigration is meant for.</p>
]]></description><pubDate>Sun, 24 May 2026 00:33:58 +0000</pubDate><link>https://news.ycombinator.com/item?id=48253074</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48253074</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48253074</guid></item><item><title><![CDATA[New comment by rohansood15 in "Gemini 3.5 Flash"]]></title><description><![CDATA[
<p>Subjective, but if we compare to compute not everyone needs the most expensive laptops or super computers for their work.<p>I think frontier models will be invaluable for scientific research, defense, financial analysis and such. But the average person probably would be reasonably well-served with a local model.<p>If you're in sales, customer service, product management and such - the leading open models at the 30B mark are already good enough.</p>
]]></description><pubDate>Wed, 20 May 2026 01:08:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=48201809</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48201809</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48201809</guid></item><item><title><![CDATA[New comment by rohansood15 in "Apple Silicon costs less than OpenRouter"]]></title><description><![CDATA[
<p>Which part of this is a 'prediction'?</p>
]]></description><pubDate>Tue, 19 May 2026 08:11:44 +0000</pubDate><link>https://news.ycombinator.com/item?id=48190591</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48190591</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48190591</guid></item><item><title><![CDATA[New comment by rohansood15 in "Apple Silicon costs less than OpenRouter"]]></title><description><![CDATA[
<p>I don't think I follow?</p>
]]></description><pubDate>Tue, 19 May 2026 06:47:13 +0000</pubDate><link>https://news.ycombinator.com/item?id=48190064</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48190064</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48190064</guid></item><item><title><![CDATA[New comment by rohansood15 in "Apple Silicon costs less than OpenRouter"]]></title><description><![CDATA[
<p>Thanks for the info. Daniel fixed it - and no it wasn't an LLM error. :P</p>
]]></description><pubDate>Tue, 19 May 2026 05:37:03 +0000</pubDate><link>https://news.ycombinator.com/item?id=48189594</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48189594</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48189594</guid></item><item><title><![CDATA[New comment by rohansood15 in "Apple Silicon costs less than OpenRouter"]]></title><description><![CDATA[
<p>I used the same assumptions as the original HN post <a href="https://news.ycombinator.com/item?id=48168198">https://news.ycombinator.com/item?id=48168198</a></p>
]]></description><pubDate>Tue, 19 May 2026 05:22:55 +0000</pubDate><link>https://news.ycombinator.com/item?id=48189496</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48189496</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48189496</guid></item><item><title><![CDATA[New comment by rohansood15 in "Apple Silicon costs less than OpenRouter"]]></title><description><![CDATA[
<p>The title auto-corrected, my post was 'less' not 'more'.</p>
]]></description><pubDate>Tue, 19 May 2026 05:16:43 +0000</pubDate><link>https://news.ycombinator.com/item?id=48189450</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48189450</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48189450</guid></item><item><title><![CDATA[New comment by rohansood15 in "Apple Silicon costs less than OpenRouter"]]></title><description><![CDATA[
<p>Nope, HN changed the title.<p><a href="https://imgur.com/a/UgJqWEh" rel="nofollow">https://imgur.com/a/UgJqWEh</a></p>
]]></description><pubDate>Tue, 19 May 2026 05:09:50 +0000</pubDate><link>https://news.ycombinator.com/item?id=48189397</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48189397</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48189397</guid></item><item><title><![CDATA[New comment by rohansood15 in "Apple Silicon costs less than OpenRouter"]]></title><description><![CDATA[
<p>The title is Apple Silicon costs LESS than OpenRouter. Not sure why it got updated to this - maybe because I referenced the original HN post?<p>Here's the full post:<p>TLDR; When you consider batching, cache and input tokens, together with the residual cost of Macbook Pro is actually 14% cheaper than OpenRouter. This becomes a whooping 3x (i.e. 65%) cheaper if you consider MoE models like Gemma 4 26B.<p>There was a well-meaning post yesterday by @DataDrivenAngel comparing costs of self-hosting LLMs v/s using OpenRouter (HN link). The analysis however had a few flaws as pointed out by the HN community, and I ran benchmarks on my M4 Max 128GB to adjust for those.<p>1. The estimate was based entirely using output tokens, instead of real-world input-output token mix. The numbers look very different if you consider a 4:1 or 5:1 input to output token ratio.<p>2. Batching/concurrency/caching improves token throughput, and if you're running multiple coding agents/work trees the performance gain can be significant.<p>3. A Macbook Pro is an asset purchase, and retains significant residual value through it's life. Probably not unreasonable to expect ~1.5-2.5k resale value after 3-5 years of use.<p>I ran vllm bench using a resonable approximation for a coding agent workload with concurrency 4 for Gemma 4 31B (same as the original post), and got the following results:<p>-----------------------------------<p>Serving Benchmark Gemma 4 31B
Successful requests: 20
Maximum request concurrency: 4
Benchmark duration (s): 263.19
Total input tokens: 35000
Total generated tokens: 6400
Request throughput (req/s): 0.08
Output token throughput (tok/s): 24.32
Peak output token throughput (tok/s): 36
Peak concurrent requests: 8
Total token throughput (tok/s): 157.3<p>Scenario
3 years $0.15 Local cheaper (~6%)
5 years $0.14 Local cheaper (~13%)
7 years $0.13 Local cheaper (~19%)<p>-----------------------------------<p>Once you work out the math (using original assumptions on power costs and 5 year timeline),  you get to a blended cost of ~$0.14 per million tokens for local, v/s ~$0.16 for OpenRouter. That is not a massive win. But it is close enough to flip the narrative from local being more expensive to 'it depends'.<p>But it doesn't end there. If you used an MoE model like Gemma 4 26B, the blended cost drops to $0.038 per million tokens, v/s OpenRouter's $0.1 per million. That is a ~3x difference.<p>-----------------------------------<p>Serving Benchmark Gemma 4 26B (MoE) 
Successful requests: 20
Maximum request concurrency: 4
Benchmark duration (s): 60.05
Total input tokens: 30002
Total generated tokens: 4870
Request throughput (req/s): 0.33
Output token throughput (tok/s): 81.1
Peak output token throughput (tok/s): 128
Peak concurrent requests: 8
Total token throughput (tok/s): 580.72<p>Scenario 
3 years $0.040 Local cheaper (~60%)
5 years $0.038 Local cheaper (~62%)
7 years $0.035 Local cheaper (~65%)<p>-----------------------------------<p>This is not meant as an attack on the original analysis  - I am sure the synthetic bench I used has a few holes, plus buying price/residual value varies a fair bit. Plus, I don't think anybody will run their MBP for inference for 5 years straight. 
But with worsening GPU supply and the inevitable price/access squeeze, I think local LLMs have a huge role to play. And this is on top of the privacy benefits. A misperceived price differential should not be the reason that slows down adoption.</p>
]]></description><pubDate>Tue, 19 May 2026 05:05:57 +0000</pubDate><link>https://news.ycombinator.com/item?id=48189375</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48189375</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48189375</guid></item><item><title><![CDATA[New comment by rohansood15 in "Apple Silicon costs less than OpenRouter"]]></title><description><![CDATA[
<p>That's some HN shenanigans, I swear I copy pasted my original title. <a href="https://imgur.com/a/UgJqWEh" rel="nofollow">https://imgur.com/a/UgJqWEh</a></p>
]]></description><pubDate>Tue, 19 May 2026 04:59:53 +0000</pubDate><link>https://news.ycombinator.com/item?id=48189348</link><dc:creator>rohansood15</dc:creator><comments>https://news.ycombinator.com/item?id=48189348</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48189348</guid></item></channel></rss>