Hacker News: jl

New comment by jl in "GPT-5.4"

__jl__ — Thu, 05 Mar 2026 22:27:46 +0000

I see your point. I do find Anthropic's approach more clean though particularly when you add in mini and nano. That makes 5 models priced differently. Some share the same core name, others don't: gpt 5 nano, gpt 5 mini, gpt 5.1, gpt 5.2, gpt 5.4. And we are not even talking about thinking budget.

But generally: These are not consumer facing products and I agree that someone who uses the API should be able to figure out the price point of different models.

New comment by jl in "GPT-5.4"

__jl__ — Thu, 05 Mar 2026 20:54:36 +0000

What a model mess!

OpenAI now has three price points: GPT 5.1, GPT 5.2 and now GPT 5.4. There version numbers jump across different model lines with codex at 5.3, what they now call instant also at 5.3.

Anthropic are really the only ones who managed to get this under control: Three models, priced at three different levels. New models are immediately available everywhere.

Google essentially only has Preview models! The last GA is 2.5. As a developer, I can either use an outdated model or have zero insurances that the model doesn't get discontinued within weeks.

New comment by jl in "Gemini 3.1 Pro"

__jl__ — Thu, 19 Feb 2026 16:32:55 +0000

Another preview release. Does that mean the recommended model by Google for production is 2.5 Flash and Pro? Not talking about what people are actually doing but the google recommendation. Kind of crazy if that is the case

New comment by jl in "GPT-5.3-Codex"

__jl__ — Thu, 05 Feb 2026 18:16:28 +0000

Impressive jump for GPT-5.3-codex and crazy to see two top coding models come out on the same day...

New comment by jl in "Unrolling the Codex agent loop"

__jl__ — Sat, 24 Jan 2026 01:31:50 +0000

Yes you can and I really like it as a feature. But it ties you to OpenAI…

New comment by jl in "Gemini 3 Flash: Frontier intelligence built for speed"

__jl__ — Wed, 17 Dec 2025 17:47:56 +0000

I will have to try that. Cursor bill got pretty high with Opus 4.5. Never considered opus before the 4.5 price drop but now it's hard to change... :)

New comment by jl in "Gemini 3 Flash: Frontier intelligence built for speed"

__jl__ — Wed, 17 Dec 2025 17:45:30 +0000

Mostly at the time of release except for 1.5 Flash which got a price drop in Aug 2024.

Google has been discontinuing older models after several months of transition period so I would expect the same for the 2.5 models. But that process only starts when the release version of 3 models is out (pro and flash are in preview right now).

New comment by jl in "Gemini 3 Flash: Frontier intelligence built for speed"

__jl__ — Wed, 17 Dec 2025 16:58:25 +0000

This is awesome. No preview release either, which is great to production.

They are pushing the prices higher with each release though: API pricing is up to $0.5/M for input and $3/M for output

For comparison:

Gemini 3.0 Flash: $0.50/M for input and $3.00/M for output

Gemini 2.5 Flash: $0.30/M for input and $2.50/M for output

Gemini 2.0 Flash: $0.15/M for input and $0.60/M for output

Gemini 1.5 Flash: $0.075/M for input and $0.30/M for output (after price drop)

Gemini 3.0 Pro: $2.00/M for input and $12/M for output

Gemini 2.5 Pro: $1.25/M for input and $10/M for output

Gemini 1.5 Pro: $1.25/M for input and $5/M for output

I think image input pricing went up even more.

Correction: It is a preview model...

New comment by jl in "Gemini 3"

__jl__ — Tue, 18 Nov 2025 15:30:38 +0000

API pricing is up to $2/M for input and $12/M for output

For comparison: Gemini 2.5 Pro was $1.25/M for input and $10/M for output Gemini 1.5 Pro was $1.25/M for input and $5/M for output

New comment by jl in "Gemini 3 Pro Model Card [pdf]"

__jl__ — Tue, 18 Nov 2025 15:30:03 +0000

API pricing is up to $2/M for input and $12/M for output

For comparison: Gemini 2.5 Pro was $1.25/M for input and $10/M for output Gemini 1.5 Pro was $1.25/M for input and $5/M for output

New comment by jl in "Gemini 3 Pro Model Card [pdf]"

__jl__ — Tue, 18 Nov 2025 14:05:52 +0000

Same here. They have been aggressively increasing prices with each iteration (maybe because they started so low). Still hope that is not the case this time. GPT 5.1 is priced pretty aggressively so maybe that is an incentive to keep the current gemini API prices.

New comment by jl in "GPT-5.1 for Developers"

__jl__ — Thu, 13 Nov 2025 22:17:02 +0000

The prompt caching change is awesome for any agent. Claude is far behind with increased costs for caching and manual caching checkpoints. Certainly depends on your application but prompt caching is also ignored in a lot of cost comparisons.

New comment by jl in "Cursor 1.7"

__jl__ — Wed, 01 Oct 2025 17:22:36 +0000

Since we have cursor people joining, let me bring up my constant problems around applying code changes. For background, I mostly work with "chat":

1. The apply button does not appear. This used to be mostly a problem with Gemini 2.5 Pro and GPT-5 but now sometimes happens with all models. Very annoying because I have to apply manually

2. Cursor doesn't recognize which file to apply changes to and just uses the currently open file. Also very annoying and impossible to change the file to which I want to apply changes after they were applied to one file.

New comment by jl in "Making 2.5 Flash and 2.5 Pro GA, and introducing Gemini 2.5 Flash-Lite"

__jl__ — Tue, 17 Jun 2025 18:42:32 +0000

1.5 -> 2.0 was a price increase as well (double, I think, and something like 4x for image input)

Now 2.0 -> 2.5 is another hefty price increase.

New comment by jl in "Cursor 1.0"

__jl__ — Thu, 05 Jun 2025 10:51:59 +0000

Same! :)

New comment by jl in "voyage-3.5 and voyage-3.5-lite: improved quality for a new retrieval frontier"

__jl__ — Sat, 24 May 2025 23:33:18 +0000

Voyage models are great in my experience and I am planing to test 3.5. Almost more interested in 3.5-lite though. Great price.

My concern: voyage api has been unreliable. They were bought by mango db, which makes me a little uneasy.

Gemini embeddings look like a great model but it’s in preview and there haven’t been any updates for a while (including at io). Also not sure how committed Google is to embeddings models.

New comment by jl in "Claude 4"

__jl__ — Thu, 22 May 2025 16:46:44 +0000

Thanks. I looked a couple minutes ago and couldn't see it. For anyone curious, pricing remains the same as previous Anthropic models.

New comment by jl in "Claude 4"

__jl__ — Thu, 22 May 2025 16:43:04 +0000

Anyone found information on API pricing?

New comment by jl in "OpenAI reaches agreement to buy Windsurf for $3B"

__jl__ — Tue, 06 May 2025 13:18:30 +0000

Here are my two cents on cursors versus windsurf approach:

CURSOR shifted to a more agentic approach even for chat requests to reduce input tokens.

Previously, they used the good old RAG pattern with code dumps: Request with user added files -> Retrieval (when Codebase enabled) -> LLM requests with combined context from user and retrieval.

Now they seem to be doing something like this: Request -> LLM with tools to search code base and/or user-added files

I get constant search tool calls even for user-added files. Big reduction in input token but I think performance suffers as well.

WINDSURF is still willing to dump code into the context, which gives them an edge in some cases (presumably at a cost of input tokens).

Windsurf is willing to spent to acquire customers (lower subscription cost, higher expenses for llm calls). Cursor has a huge customer base and is working on making it sustainable by a) reducing costs (see above) and b) increasing revenue (e.g. "Pro" requests for 0.05 with more input and output token).

New comment by jl in "Google Gemini has the worst LLM API"

__jl__ — Sun, 04 May 2025 14:53:46 +0000

Only problem is that the genai API at https://ai.google.dev is far less reliable and can be problematic for production use cases. Right around the time Gemini 2.0 launched, it was done for days on end without any communication. They are putting a lot of effort into improving it but it's much less reliable than openai, which matters for production. They can also reject your request based on overall system load (not your individual limits), which is very unpredictable. They advertise 2000 requests per minute. When I tried several weeks ago, I couldn't even get 500 per minute.

Hacker News: __jl__

New comment by __jl__ in "GPT-5.4"

New comment by __jl__ in "GPT-5.4"

New comment by __jl__ in "Gemini 3.1 Pro"

New comment by __jl__ in "GPT-5.3-Codex"

New comment by __jl__ in "Unrolling the Codex agent loop"

New comment by __jl__ in "Gemini 3 Flash: Frontier intelligence built for speed"

New comment by __jl__ in "Gemini 3 Flash: Frontier intelligence built for speed"

New comment by __jl__ in "Gemini 3 Flash: Frontier intelligence built for speed"

New comment by __jl__ in "Gemini 3"

New comment by __jl__ in "Gemini 3 Pro Model Card [pdf]"

New comment by __jl__ in "Gemini 3 Pro Model Card [pdf]"

New comment by __jl__ in "GPT-5.1 for Developers"

New comment by __jl__ in "Cursor 1.7"

New comment by __jl__ in "Making 2.5 Flash and 2.5 Pro GA, and introducing Gemini 2.5 Flash-Lite"

New comment by __jl__ in "Cursor 1.0"

New comment by __jl__ in "voyage-3.5 and voyage-3.5-lite: improved quality for a new retrieval frontier"

New comment by __jl__ in "Claude 4"

New comment by __jl__ in "Claude 4"

New comment by __jl__ in "OpenAI reaches agreement to buy Windsurf for $3B"

New comment by __jl__ in "Google Gemini has the worst LLM API"

Hacker News: jl

New comment by jl in "GPT-5.4"

New comment by jl in "GPT-5.4"

New comment by jl in "Gemini 3.1 Pro"

New comment by jl in "GPT-5.3-Codex"

New comment by jl in "Unrolling the Codex agent loop"

New comment by jl in "Gemini 3 Flash: Frontier intelligence built for speed"

New comment by jl in "Gemini 3 Flash: Frontier intelligence built for speed"

New comment by jl in "Gemini 3 Flash: Frontier intelligence built for speed"

New comment by jl in "Gemini 3"

New comment by jl in "Gemini 3 Pro Model Card [pdf]"

New comment by jl in "Gemini 3 Pro Model Card [pdf]"

New comment by jl in "GPT-5.1 for Developers"

New comment by jl in "Cursor 1.7"

New comment by jl in "Making 2.5 Flash and 2.5 Pro GA, and introducing Gemini 2.5 Flash-Lite"

New comment by jl in "Cursor 1.0"

New comment by jl in "voyage-3.5 and voyage-3.5-lite: improved quality for a new retrieval frontier"

New comment by jl in "Claude 4"

New comment by jl in "Claude 4"

New comment by jl in "OpenAI reaches agreement to buy Windsurf for $3B"

New comment by jl in "Google Gemini has the worst LLM API"