<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: jpau</title><link>https://news.ycombinator.com/user?id=jpau</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 10 Jun 2026 02:24:33 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=jpau" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[Amazon Strikes $6B Deal with Snowflake for Agentic Computing Chips]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.wsj.com/tech/amazon-strikes-6-billion-deal-with-snowflake-for-its-agentic-computing-chips-d04114d8">https://www.wsj.com/tech/amazon-strikes-6-billion-deal-with-snowflake-for-its-agentic-computing-chips-d04114d8</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=48302517">https://news.ycombinator.com/item?id=48302517</a></p>
<p>Points: 4</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 28 May 2026 00:10:04 +0000</pubDate><link>https://www.wsj.com/tech/amazon-strikes-6-billion-deal-with-snowflake-for-its-agentic-computing-chips-d04114d8</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=48302517</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48302517</guid></item><item><title><![CDATA[New comment by jpau in "Gemini 3.5 Flash: frontier intelligence with action"]]></title><description><![CDATA[
<p>Standard pricing is showing for me as $1.50 / $9.<p>(I suspect you're viewing the "flex" pricing).</p>
]]></description><pubDate>Tue, 19 May 2026 18:26:27 +0000</pubDate><link>https://news.ycombinator.com/item?id=48197249</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=48197249</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48197249</guid></item><item><title><![CDATA[New comment by jpau in "Launch HN: TeamOut (YC W22) – AI agent for planning company retreats"]]></title><description><![CDATA[
<p>> For venue recommendations [...] we do not rely purely on the language model. We embed both user requirements and venues into vector representations and retrieve candidates using similarity search. Hard constraints such as capacity and dates are applied first, and results are ranked before being presented.<p>Huh this surprised me as a forgone opportunity.<p>I heard second-hand about the process for organizing our last offsite. Searching for venues was not the time-consuming part.<p>The time-consuming part was actually engaging with the venues to confirm specific details not available online. Our teammate who did this engaged with _hundreds_ of venues. It was a lot of work on their part ... and probably not the most fun part of their job.<p>That seems like an ideal agent scenario?</p>
]]></description><pubDate>Wed, 25 Feb 2026 17:47:47 +0000</pubDate><link>https://news.ycombinator.com/item?id=47154961</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=47154961</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=47154961</guid></item><item><title><![CDATA[New comment by jpau in "GPT-5.3-Codex"]]></title><description><![CDATA[
<p>Interesting that this was released without a prior GPT-5.3 release. I wonder if that means we won't see a GPT-5.3?</p>
]]></description><pubDate>Thu, 05 Feb 2026 20:34:38 +0000</pubDate><link>https://news.ycombinator.com/item?id=46904842</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=46904842</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46904842</guid></item><item><title><![CDATA[New comment by jpau in "Tell HN: Google increased existing finetuned model latency by 5x"]]></title><description><![CDATA[
<p>Hey we're also a Vertex tuning customer in a similar spot. We're seeing other capacity issues, although not a leap in latency. Can you DM me? I'd love to trade notes. <a href="https://x.com/hellofromjames" rel="nofollow">https://x.com/hellofromjames</a></p>
]]></description><pubDate>Thu, 27 Nov 2025 00:27:16 +0000</pubDate><link>https://news.ycombinator.com/item?id=46063962</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=46063962</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=46063962</guid></item><item><title><![CDATA[New comment by jpau in "Why isn't everyone using Cerebras?"]]></title><description><![CDATA[
<p>I love Cerebras. I also love that they've started to scale rate limits to useful levels (which is relatively new).<p>I still don't know how long they'll support our chosen model.<p>On Oct 22 I got an email saying that<p>```<p>- qwen-3-coder-480b will be available until Nov 5, 2025<p>- qwen-3-235b-a22b-thinking-2507 will be available until Nov 14, 2025<p>```<p>That's not a lot of notice!<p>I don't want to spend all my time benchmarking new models for features I already built. I don't want my users' experience to be disturbed every few months.</p>
]]></description><pubDate>Sat, 15 Nov 2025 00:39:42 +0000</pubDate><link>https://news.ycombinator.com/item?id=45933959</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=45933959</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45933959</guid></item><item><title><![CDATA[The Irony of the LLM Treadmill]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill">https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45790930">https://news.ycombinator.com/item?id=45790930</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Sun, 02 Nov 2025 15:19:01 +0000</pubDate><link>https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=45790930</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45790930</guid></item><item><title><![CDATA[The Irony of the LLM Treadmill]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill">https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45759385">https://news.ycombinator.com/item?id=45759385</a></p>
<p>Points: 2</p>
<p># Comments: 0</p>
]]></description><pubDate>Thu, 30 Oct 2025 12:42:02 +0000</pubDate><link>https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=45759385</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45759385</guid></item><item><title><![CDATA[The Irony of the LLM Treadmill]]></title><description><![CDATA[
<p>Article URL: <a href="https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill">https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=45747049">https://news.ycombinator.com/item?id=45747049</a></p>
<p>Points: 3</p>
<p># Comments: 0</p>
]]></description><pubDate>Wed, 29 Oct 2025 14:09:08 +0000</pubDate><link>https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=45747049</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45747049</guid></item><item><title><![CDATA[New comment by jpau in "Show HN: Vibe Linking"]]></title><description><![CDATA[
<p>> A URL shortener that runs a lightweight model (gemini-1.5-flash)<p>I think gemini-1.5-flash is EOL'd from tomorrow (Sep 25th)
<a href="https://cloud.google.com/vertex-ai/generative-ai/docs/learn/model-versions" rel="nofollow">https://cloud.google.com/vertex-ai/generative-ai/docs/learn/...</a><p>RIP gemini-1.5</p>
]]></description><pubDate>Wed, 24 Sep 2025 20:20:33 +0000</pubDate><link>https://news.ycombinator.com/item?id=45365498</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=45365498</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=45365498</guid></item><item><title><![CDATA[AI coding: plateauing but also accelerating]]></title><description><![CDATA[
<p>Article URL: <a href="https://ghiculescu.substack.com/p/ai-coding-plateauing-but-also-accelerating">https://ghiculescu.substack.com/p/ai-coding-plateauing-but-also-accelerating</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=44977014">https://news.ycombinator.com/item?id=44977014</a></p>
<p>Points: 2</p>
<p># Comments: 1</p>
]]></description><pubDate>Thu, 21 Aug 2025 19:31:08 +0000</pubDate><link>https://ghiculescu.substack.com/p/ai-coding-plateauing-but-also-accelerating</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=44977014</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44977014</guid></item><item><title><![CDATA[New comment by jpau in "Claude Sonnet 4 now supports 1M tokens of context"]]></title><description><![CDATA[
<p>Google[1] also has a "long context" pricing structure. OpenAI may be considering offering similar since they do not offer their priority processing SLAs[2] for context >128K.<p>[1] <a href="https://cloud.google.com/vertex-ai/generative-ai/pricing" rel="nofollow">https://cloud.google.com/vertex-ai/generative-ai/pricing</a><p>[2] <a href="https://openai.com/api-priority-processing/" rel="nofollow">https://openai.com/api-priority-processing/</a></p>
]]></description><pubDate>Tue, 12 Aug 2025 20:57:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=44881736</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=44881736</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44881736</guid></item><item><title><![CDATA[New comment by jpau in "Claude 4"]]></title><description><![CDATA[
<p>Interesting!<p>Is there anything to read into needing twice the "Avg Attempts", or is this column relatively uninteresting in the overall context of the bench?</p>
]]></description><pubDate>Thu, 22 May 2025 20:23:11 +0000</pubDate><link>https://news.ycombinator.com/item?id=44066461</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=44066461</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44066461</guid></item><item><title><![CDATA[New comment by jpau in "Claude 4"]]></title><description><![CDATA[
<p>Seems to be a nod to each size being treated as their own product.<p>Claude 3 arrived as a family (Haiku, Sonnet, Opus), but no release since has included all three sizes.<p>A release of "claude-3-7-sonnet" alone seems incomplete without Haiku/Opus, when perhaps Sonnet is has its own development roadmap (claude-sonnet-*).</p>
]]></description><pubDate>Thu, 22 May 2025 20:18:39 +0000</pubDate><link>https://news.ycombinator.com/item?id=44066419</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=44066419</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=44066419</guid></item><item><title><![CDATA[New comment by jpau in "Ask HN: I'm an MIT senior and still unemployed – and so are most of my friends"]]></title><description><![CDATA[
<p>Sorry to hear the challenge.<p>You and your friends should email me with your resume and anything you're proud to have built. I'll extend that to any MIT senior/recent grad who wants to discuss moving to SF and helping us apply LLMs to build product features that solve interesting customer problems.<p>I'm at james.peterson@fathom.video. Include "[responding to HN thread 43614795]" in the title. I'd love to chat.</p>
]]></description><pubDate>Mon, 07 Apr 2025 19:09:35 +0000</pubDate><link>https://news.ycombinator.com/item?id=43614795</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=43614795</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43614795</guid></item><item><title><![CDATA[New comment by jpau in "BigQuery pricing model cost us $10k in 22 seconds"]]></title><description><![CDATA[
<p>I am grateful for GCP's quotas that help us prevent similar own-goals.<p>While this specific error is something we know to avoid, I'm sure quotas have helped us avoid the pain of other errors. So I'm somewhat sympathetic.<p>I think it's important to read the language of and judgements in the post in the context of someone who just got a large unexpected bill (expensive lesson).</p>
]]></description><pubDate>Tue, 25 Mar 2025 16:57:54 +0000</pubDate><link>https://news.ycombinator.com/item?id=43473449</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=43473449</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=43473449</guid></item><item><title><![CDATA[New comment by jpau in "Ask HN: How do people create those sleek looking demos for startups?"]]></title><description><![CDATA[
<p>I use screen.studio</p>
]]></description><pubDate>Thu, 02 May 2024 03:14:05 +0000</pubDate><link>https://news.ycombinator.com/item?id=40232333</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=40232333</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=40232333</guid></item><item><title><![CDATA[New comment by jpau in "Tell HN: Anthropic's Claude Instant price cut by ~half [pdf]"]]></title><description><![CDATA[
<p>I noticed Anthropic updated their prices, but haven't seen this posted anywhere.<p>Claude Instant is now 10% of Claude 2's pricing: $0.80 per million input tokens, and $2.40 per million completion tokens (down from I think $1.63 and $5.51 respectively).</p>
]]></description><pubDate>Wed, 13 Dec 2023 05:47:32 +0000</pubDate><link>https://news.ycombinator.com/item?id=38623200</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=38623200</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38623200</guid></item><item><title><![CDATA[Tell HN: Anthropic's Claude Instant price cut by ~half [pdf]]]></title><description><![CDATA[
<p>Article URL: <a href="https://www-files.anthropic.com/production/images/model_pricing_dec2023.pdf">https://www-files.anthropic.com/production/images/model_pricing_dec2023.pdf</a></p>
<p>Comments URL: <a href="https://news.ycombinator.com/item?id=38623199">https://news.ycombinator.com/item?id=38623199</a></p>
<p>Points: 1</p>
<p># Comments: 1</p>
]]></description><pubDate>Wed, 13 Dec 2023 05:47:32 +0000</pubDate><link>https://www-files.anthropic.com/production/images/model_pricing_dec2023.pdf</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=38623199</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=38623199</guid></item><item><title><![CDATA[New comment by jpau in "OpenAI plans major updates to lure developers with lower costs"]]></title><description><![CDATA[
<p>Altman mentioned[1][2] earlier that they were working on a "stateful" API for release this year.<p>> 2023: A stateful API — When you call the chat API today, you have to repeatedly pass through the same conversation history and pay for the same tokens again and again. In the future there will be a version of the API that remembers the conversation history.<p>Maybe it's an RAG-based thing, but that'd be underwhelming given the promise.<p>Wizard of Oz, or true magic?<p>(In the same interview, Altman also claimed progress toward releasing million-token context windows this year. Wowzers)<p>[1] <a href="https://humanloop.com/blog/openai-plans">https://humanloop.com/blog/openai-plans</a>, removed at OAI's request<p>[2] Archived at <a href="https://web.archive.org/web/20230531203946/https://humanloop.com/blog/openai-plans" rel="nofollow noreferrer">https://web.archive.org/web/20230531203946/https://humanloop...</a></p>
]]></description><pubDate>Thu, 12 Oct 2023 05:10:45 +0000</pubDate><link>https://news.ycombinator.com/item?id=37853649</link><dc:creator>jpau</dc:creator><comments>https://news.ycombinator.com/item?id=37853649</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=37853649</guid></item></channel></rss>