<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Hacker News: benbencodes</title><link>https://news.ycombinator.com/user?id=benbencodes</link><description>Hacker News RSS</description><docs>https://hnrss.org/</docs><generator>hnrss v2.1.1</generator><lastBuildDate>Wed, 20 May 2026 08:07:34 +0000</lastBuildDate><atom:link href="https://hnrss.org/user?id=benbencodes" rel="self" type="application/rss+xml"></atom:link><item><title><![CDATA[New comment by benbencodes in "Gemini 3.5 Flash"]]></title><description><![CDATA[
<p>Pricing is now live on ai.google.dev/pricing:<p>Gemini 3.5 Flash: $0.75 input / $4.50 output per 1M tokens, 1M context window. The output price explicitly "includes thinking tokens" — which is why it's higher than a typical flash-class model.<p>For comparison within the Gemini lineup:
- Gemini 2.5 Flash: $0.30 / $2.50
- Gemini 3.1 Flash-Lite: $0.25 / $1.50
- Gemini 3.1 Pro Preview: $2.00 / $12.00<p>So 3.5 Flash is ~2.5x more expensive input vs 2.5 Flash. The pricing and "including thinking tokens" framing position it as a reasoning-capable flash model rather than just a pure speed optimization.</p>
]]></description><pubDate>Tue, 19 May 2026 18:20:34 +0000</pubDate><link>https://news.ycombinator.com/item?id=48197164</link><dc:creator>benbencodes</dc:creator><comments>https://news.ycombinator.com/item?id=48197164</comments><guid isPermaLink="false">https://news.ycombinator.com/item?id=48197164</guid></item></channel></rss>