<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: kiratp</title><link>https://news.ycombinator.com/user?id=kiratp</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Sun, 14 Jun 2026 06:41:46 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=kiratp" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by kiratp in "An update on recent Claude Code quality reports"]]></title><description><![CDATA[
<p>Agents making forward progress hours apart is an expected pattern and inference engines are being adapted to serve that purpose well.<p>It’s hard to do it without killing performance and requires engineering in the DC to have fast access to SSDs etc.<p>Disclosure: work on ai@msft. Opinions my own.</p>
]]></description><pubDate>Fri, 24 Apr 2026 02:01:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=47884623</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=47884623</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47884623</guid></item><item><title><![CDATA[New comment by kiratp in "An update on recent Claude Code quality reports"]]></title><description><![CDATA[
<p>OpenAI does this for all API calls<p>> Our systems will smartly ignore any reasoning items that aren’t relevant to your functions, and only retain those in context that are relevant. You can pass reasoning items from previous responses either using the previous_response_id parameter, or by manually passing in all the output items from a past response into the input of a new one.<p><a href="https://developers.openai.com/api/docs/guides/reasoning" rel="nofollow">https://developers.openai.com/api/docs/guides/reasoning</a><p>Disclosure - work on AI@msft</p>
]]></description><pubDate>Fri, 24 Apr 2026 01:42:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=47884517</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=47884517</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47884517</guid></item><item><title><![CDATA[New comment by kiratp in "An update on recent Claude Code quality reports"]]></title><description><![CDATA[
<p>By caching they mean “cached in GPU memory”. That’s a very very scarce resource.<p>Caching to RAM and disk is a thing but it’s hard to keep performance up with that and it’s early days of that tech being deployed anywhere.<p>Disclosure: work on AI at Microsoft. Above is just common industry info (see work happening in vLLM for example)</p>
]]></description><pubDate>Fri, 24 Apr 2026 01:37:56 +0000</pubDate><link>https://news.ycombinator.com/item?id=47884487</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=47884487</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47884487</guid></item><item><title><![CDATA[New comment by kiratp in "Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return"]]></title><description><![CDATA[
<p>At the full current retail API price.<p>Business buyers are paying API prices, not subscription<p>Disclosure: Work at Microsoft on AI</p>
]]></description><pubDate>Tue, 21 Apr 2026 18:35:31 +0000</pubDate><link>https://news.ycombinator.com/item?id=47852637</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=47852637</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47852637</guid></item><item><title><![CDATA[New comment by kiratp in "1M context is now generally available for Opus 4.6 and Sonnet 4.6"]]></title><description><![CDATA[
<p>GitHub Copilot CLI lets you use all these models (unless your employer disables them.<p><a href="https://github.com/features/copilot/cli" rel="nofollow">https://github.com/features/copilot/cli</a><p>Disclosure: work at Msft</p>
]]></description><pubDate>Sat, 14 Mar 2026 03:12:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=47372941</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=47372941</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47372941</guid></item><item><title><![CDATA[New comment by kiratp in "NASA announces overhaul of Artemis program amid safety concerns, delays"]]></title><description><![CDATA[
<p>Same contractors (Beoing) who built Starliner...<p>Explaining Why NASA's Starliner Report Is So Bad > <a href="https://www.youtube.com/watch?v=L96asfTvJ_A" rel="nofollow">https://www.youtube.com/watch?v=L96asfTvJ_A</a></p>
]]></description><pubDate>Fri, 27 Feb 2026 17:58:22 +0000</pubDate><link>https://news.ycombinator.com/item?id=47183387</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=47183387</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47183387</guid></item><item><title><![CDATA[New comment by kiratp in "Sam Altman’s DRAM Deal"]]></title><description><![CDATA[
<p>This is missing a key part of the picture - Nvidia just announced that partners will need to source RAM themselves.<p>OpenAI is basically ensuring that they can actually get the chips they need for the DCs they are building.<p>I can’t guess as to what move came first (Nvidia policy change or these DRAM deals) but I would bet this is a large if not larger factor here than “bloc my competitors.</p>
]]></description><pubDate>Sat, 06 Dec 2025 03:52:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=46170521</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=46170521</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46170521</guid></item><item><title><![CDATA[New comment by kiratp in "Fizz Buzz without conditionals or booleans"]]></title><description><![CDATA[
<p>A loop either never halts or has a conditional. I guess a compiler could elide a “while True:” to a branch-less jump instruction.<p>One hack would be to use recursion and let stack exhaustion stop you.</p>
]]></description><pubDate>Wed, 19 Nov 2025 04:41:51 +0000</pubDate><link>https://news.ycombinator.com/item?id=45975930</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45975930</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45975930</guid></item><item><title><![CDATA[New comment by kiratp in "Fizz Buzz without conditionals or booleans"]]></title><description><![CDATA[
<p>A for loop has a conditional in it.<p>Unless by conditionals we mean “no if/else” and not “no branch instructions”.</p>
]]></description><pubDate>Wed, 19 Nov 2025 04:27:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=45975842</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45975842</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45975842</guid></item><item><title><![CDATA[New comment by kiratp in "Fizz Buzz without conditionals or booleans"]]></title><description><![CDATA[
<p>A for loop has an implicit conditional in its stop condition check.</p>
]]></description><pubDate>Wed, 19 Nov 2025 04:26:21 +0000</pubDate><link>https://news.ycombinator.com/item?id=45975834</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45975834</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45975834</guid></item><item><title><![CDATA[New comment by kiratp in "UnitedHealth pays its own physician groups 17% more than outside ones"]]></title><description><![CDATA[
<p>This only applies to large employers. Smaller ones are just presentef a limited list of plans to pick from, and the plans change every year. Most of the time, as a startup, you can’t buy a Mag7 equivalent health plan for any amount of money off the marketplace</p>
]]></description><pubDate>Tue, 04 Nov 2025 04:03:15 +0000</pubDate><link>https://news.ycombinator.com/item?id=45807305</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45807305</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45807305</guid></item><item><title><![CDATA[New comment by kiratp in "FSF announces Librephone project"]]></title><description><![CDATA[
<p>Should the app builder’s ability to “trust” that the hardware will protect them from the user supersede the user’s ability to be able to trust that the hardware will protect them from the app?<p>In other words, should the device be responsible to enforcing DRM (and more) against its owner?</p>
]]></description><pubDate>Wed, 15 Oct 2025 03:32:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=45587815</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45587815</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45587815</guid></item><item><title><![CDATA[New comment by kiratp in "The Tiny Teams Playbook"]]></title><description><![CDATA[
<p>The kind of people in these small teams are not ones to think "work is just work".</p>
]]></description><pubDate>Mon, 13 Oct 2025 03:13:20 +0000</pubDate><link>https://news.ycombinator.com/item?id=45564366</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45564366</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45564366</guid></item><item><title><![CDATA[New comment by kiratp in "Meta’s live demo fails; “AI” recording plays before the actor takes the steps"]]></title><description><![CDATA[
<p>You can put the AI on rails by just prompting by it. The latest models are very steerable.<p>System prompt: “stick to steps 1-n. Step 1 is…”<p>I can say confidently because our company does this. And we have F500 customers in production.</p>
]]></description><pubDate>Fri, 19 Sep 2025 01:16:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=45296906</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45296906</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45296906</guid></item><item><title><![CDATA[New comment by kiratp in "Meta’s live demo fails; “AI” recording plays before the actor takes the steps"]]></title><description><![CDATA[
<p>I see no evidence of that. It seems like they tried to put the AI “on rails” with predefined steps and things went wrong.</p>
]]></description><pubDate>Fri, 19 Sep 2025 00:43:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=45296725</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45296725</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45296725</guid></item><item><title><![CDATA[New comment by kiratp in "Meta’s live demo fails; “AI” recording plays before the actor takes the steps"]]></title><description><![CDATA[
<p>So much negativity.<p>I’m just excited that our industry is lead by optimists and our culture enables our corporations to invest huge sums into taking us forward technologically.<p>Meta could have just done a stock buyback but instead they made a computer that can talk, see, solve problems and paint virtual things into the real world in front of your eyes!<p>I commend them on attempting a live demo.</p>
]]></description><pubDate>Fri, 19 Sep 2025 00:41:04 +0000</pubDate><link>https://news.ycombinator.com/item?id=45296705</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45296705</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45296705</guid></item><item><title><![CDATA[New comment by kiratp in "A postmortem of three recent issues"]]></title><description><![CDATA[
<p>This is due to RoPE scaling.<p>> All the notable open-source frameworks implement static YaRN, which means the scaling factor remains constant regardless of input length, potentially impacting performance on shorter texts. We advise adding the rope_scaling configuration only when processing long contexts is required. It is also recommended to modify the factor as needed. For example, if the typical context length for your application is 524,288 tokens, it would be better to set factor as 2.0.<p><a href="https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-Thinking" rel="nofollow">https://huggingface.co/Qwen/Qwen3-Next-80B-A3B-Thinking</a></p>
]]></description><pubDate>Wed, 17 Sep 2025 22:45:14 +0000</pubDate><link>https://news.ycombinator.com/item?id=45282342</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45282342</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45282342</guid></item><item><title><![CDATA[New comment by kiratp in "Claude now has access to a server-side container environment"]]></title><description><![CDATA[
<p>Hardware can be the same but scheduling is a whole different beast.<p>Also, if you pull too manny resources from training your next model to make inference revenue today, you’ll fall behind in the larger race.</p>
]]></description><pubDate>Tue, 09 Sep 2025 18:38:25 +0000</pubDate><link>https://news.ycombinator.com/item?id=45186553</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45186553</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45186553</guid></item><item><title><![CDATA[New comment by kiratp in "Claude now has access to a server-side container environment"]]></title><description><![CDATA[
<p>> Importantly, we never intentionally degrade model quality as a result of demand or other factors, and the issues mentioned above stem from unrelated bugs.<p>Things they could do that would not technically contradict that:<p>- Quantize KV cache<p>- Data aware model quantization where their own evals will show "equivalent perf" but the overall model quality suffers.<p>Simple fact is that it takes longer to deploy physical compute but somehow they are able to serve more and more inference from a slowly growing pool of hardware. Something has to give...</p>
]]></description><pubDate>Tue, 09 Sep 2025 17:54:46 +0000</pubDate><link>https://news.ycombinator.com/item?id=45185705</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45185705</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45185705</guid></item><item><title><![CDATA[New comment by kiratp in "US economy added just 22,000 jobs in August, unemployment highest in 4 yrs"]]></title><description><![CDATA[
<p>Source?</p>
]]></description><pubDate>Fri, 05 Sep 2025 15:02:02 +0000</pubDate><link>https://news.ycombinator.com/item?id=45139386</link><dc:creator>kiratp</dc:creator><comments>https://news.ycombinator.com/item?id=45139386</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45139386</guid></item></channel></rss>