Hacker News: jpau

Amazon Strikes $6B Deal with Snowflake for Agentic Computing Chips

jpau — Thu, 28 May 2026 00:10:04 +0000

Article URL: https://www.wsj.com/tech/amazon-strikes-6-billion-deal-with-snowflake-for-its-agentic-computing-chips-d04114d8

Comments URL: https://news.ycombinator.com/item?id=48302517

Points: 4

# Comments: 0

New comment by jpau in "Gemini 3.5 Flash: frontier intelligence with action"

jpau — Tue, 19 May 2026 18:26:27 +0000

Standard pricing is showing for me as $1.50 / $9.

(I suspect you're viewing the "flex" pricing).

New comment by jpau in "Launch HN: TeamOut (YC W22) – AI agent for planning company retreats"

jpau — Wed, 25 Feb 2026 17:47:47 +0000

> For venue recommendations [...] we do not rely purely on the language model. We embed both user requirements and venues into vector representations and retrieve candidates using similarity search. Hard constraints such as capacity and dates are applied first, and results are ranked before being presented.

Huh this surprised me as a forgone opportunity.

I heard second-hand about the process for organizing our last offsite. Searching for venues was not the time-consuming part.

The time-consuming part was actually engaging with the venues to confirm specific details not available online. Our teammate who did this engaged with _hundreds_ of venues. It was a lot of work on their part ... and probably not the most fun part of their job.

That seems like an ideal agent scenario?

New comment by jpau in "GPT-5.3-Codex"

jpau — Thu, 05 Feb 2026 20:34:38 +0000

Interesting that this was released without a prior GPT-5.3 release. I wonder if that means we won't see a GPT-5.3?

New comment by jpau in "Tell HN: Google increased existing finetuned model latency by 5x"

jpau — Thu, 27 Nov 2025 00:27:16 +0000

Hey we're also a Vertex tuning customer in a similar spot. We're seeing other capacity issues, although not a leap in latency. Can you DM me? I'd love to trade notes. https://x.com/hellofromjames

New comment by jpau in "Why isn't everyone using Cerebras?"

jpau — Sat, 15 Nov 2025 00:39:42 +0000

I love Cerebras. I also love that they've started to scale rate limits to useful levels (which is relatively new).

I still don't know how long they'll support our chosen model.

On Oct 22 I got an email saying that

```

- qwen-3-coder-480b will be available until Nov 5, 2025

- qwen-3-235b-a22b-thinking-2507 will be available until Nov 14, 2025

```

That's not a lot of notice!

I don't want to spend all my time benchmarking new models for features I already built. I don't want my users' experience to be disturbed every few months.

The Irony of the LLM Treadmill

jpau — Sun, 02 Nov 2025 15:19:01 +0000

Article URL: https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill

Comments URL: https://news.ycombinator.com/item?id=45790930

Points: 2

# Comments: 0

The Irony of the LLM Treadmill

jpau — Thu, 30 Oct 2025 12:42:02 +0000

Article URL: https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill

Comments URL: https://news.ycombinator.com/item?id=45759385

Points: 2

# Comments: 0

The Irony of the LLM Treadmill

jpau — Wed, 29 Oct 2025 14:09:08 +0000

Article URL: https://www.jamespeterson.blog/p/the-irony-of-the-llm-treadmill

Comments URL: https://news.ycombinator.com/item?id=45747049

Points: 3

# Comments: 0

New comment by jpau in "Show HN: Vibe Linking"

jpau — Wed, 24 Sep 2025 20:20:33 +0000

> A URL shortener that runs a lightweight model (gemini-1.5-flash)

I think gemini-1.5-flash is EOL'd from tomorrow (Sep 25th) https://cloud.google.com/vertex-ai/generative-ai/docs/learn/...

RIP gemini-1.5

AI coding: plateauing but also accelerating

jpau — Thu, 21 Aug 2025 19:31:08 +0000

Article URL: https://ghiculescu.substack.com/p/ai-coding-plateauing-but-also-accelerating

Comments URL: https://news.ycombinator.com/item?id=44977014

Points: 2

# Comments: 1

New comment by jpau in "Claude Sonnet 4 now supports 1M tokens of context"

jpau — Tue, 12 Aug 2025 20:57:32 +0000

Google[1] also has a "long context" pricing structure. OpenAI may be considering offering similar since they do not offer their priority processing SLAs[2] for context >128K.

[1] https://cloud.google.com/vertex-ai/generative-ai/pricing

[2] https://openai.com/api-priority-processing/

New comment by jpau in "Claude 4"

jpau — Thu, 22 May 2025 20:23:11 +0000

Interesting!

Is there anything to read into needing twice the "Avg Attempts", or is this column relatively uninteresting in the overall context of the bench?

New comment by jpau in "Claude 4"

jpau — Thu, 22 May 2025 20:18:39 +0000

Seems to be a nod to each size being treated as their own product.

Claude 3 arrived as a family (Haiku, Sonnet, Opus), but no release since has included all three sizes.

A release of "claude-3-7-sonnet" alone seems incomplete without Haiku/Opus, when perhaps Sonnet is has its own development roadmap (claude-sonnet-*).

New comment by jpau in "Ask HN: I'm an MIT senior and still unemployed – and so are most of my friends"

jpau — Mon, 07 Apr 2025 19:09:35 +0000

Sorry to hear the challenge.

You and your friends should email me with your resume and anything you're proud to have built. I'll extend that to any MIT senior/recent grad who wants to discuss moving to SF and helping us apply LLMs to build product features that solve interesting customer problems.

I'm at james.peterson@fathom.video. Include "[responding to HN thread 43614795]" in the title. I'd love to chat.

New comment by jpau in "BigQuery pricing model cost us $10k in 22 seconds"

jpau — Tue, 25 Mar 2025 16:57:54 +0000

I am grateful for GCP's quotas that help us prevent similar own-goals.

While this specific error is something we know to avoid, I'm sure quotas have helped us avoid the pain of other errors. So I'm somewhat sympathetic.

I think it's important to read the language of and judgements in the post in the context of someone who just got a large unexpected bill (expensive lesson).

New comment by jpau in "Ask HN: How do people create those sleek looking demos for startups?"

jpau — Thu, 02 May 2024 03:14:05 +0000

I use screen.studio

New comment by jpau in "Tell HN: Anthropic's Claude Instant price cut by ~half [pdf]"

jpau — Wed, 13 Dec 2023 05:47:32 +0000

I noticed Anthropic updated their prices, but haven't seen this posted anywhere.

Claude Instant is now 10% of Claude 2's pricing: $0.80 per million input tokens, and $2.40 per million completion tokens (down from I think $1.63 and $5.51 respectively).

Tell HN: Anthropic's Claude Instant price cut by ~half [pdf]

jpau — Wed, 13 Dec 2023 05:47:32 +0000

Article URL: https://www-files.anthropic.com/production/images/model_pricing_dec2023.pdf

Comments URL: https://news.ycombinator.com/item?id=38623199

Points: 1

# Comments: 1

New comment by jpau in "OpenAI plans major updates to lure developers with lower costs"

jpau — Thu, 12 Oct 2023 05:10:45 +0000

Altman mentioned[1][2] earlier that they were working on a "stateful" API for release this year.

> 2023: A stateful API — When you call the chat API today, you have to repeatedly pass through the same conversation history and pay for the same tokens again and again. In the future there will be a version of the API that remembers the conversation history.

Maybe it's an RAG-based thing, but that'd be underwhelming given the promise.

Wizard of Oz, or true magic?

(In the same interview, Altman also claimed progress toward releasing million-token context windows this year. Wowzers)

[1] https://humanloop.com/blog/openai-plans, removed at OAI's request

[2] Archived at https://web.archive.org/web/20230531203946/https://humanloop...