Gemini 3.5 Flash - Hacker News

Posted by spectraldrift 18 hours ago

https://ai.google.dev/gemini-api/docs/models/gemini-3.5-flas...

827 points | 571 commentspage 10

jdw64 16 hours ago|

Honestly, I feel like the new Gemini 3.5 Flash is a failure. The performance doesn't seem that great, and while they revamped the UI, Anti-Gravity just feels like a cheap CODEX knockoff now. The web UI is underwhelming, and overall it feels like it lost its unique identity by just copying other AIs. It’s a flop in both performance and price point. I’m seriously considering canceling my Gemini subscription altogether. Using Chinese AI models might actually be a better option at this point

warthog 17 hours ago||

GPT-5.5 on the benchmarks still seem to perform better than this

Plus the vibe of the gemini models are so weird particularly when it comes to tool calling

At this point I kinda need them to shock me to make the switch

Fairburn 14 hours ago||

Google shot it's shot with that alternative history artwork generation fiasco. Don't know why anyone would be too hot for them now. Dime a dozen at this point.

qgin 14 hours ago||

I think the number of people still holding a grudge for that today is small.

arjie 13 hours ago|||

Early Claude was a weak simulation of Goody2.ai. Things change. Being a lover or hater of a model doesn’t make sense. It’s just tech. Run evals. Then use.

helloplanets 12 hours ago||

Nano Banana is one of the most used image gen models

AgentMasterRace 8 hours ago||

Gemini 3.1 probation is literally the worst AI when I cycle from opus to got 5.5 then finally Gemini. It's actually insane that it's a frontier model. I rage at it more than my wife.

benbencodes 18 hours ago|

Pricing is now live on ai.google.dev/pricing:

Gemini 3.5 Flash: $0.75 input / $4.50 output per 1M tokens, 1M context window. The output price explicitly "includes thinking tokens" — which is why it's higher than a typical flash-class model.

For comparison within the Gemini lineup: - Gemini 2.5 Flash: $0.30 / $2.50 - Gemini 3.1 Flash-Lite: $0.25 / $1.50 - Gemini 3.1 Pro Preview: $2.00 / $12.00

So 3.5 Flash is ~2.5x more expensive input vs 2.5 Flash. The pricing and "including thinking tokens" framing position it as a reasoning-capable flash model rather than just a pure speed optimization.

lyjackal 17 hours ago||

You’re quoting the batch pricing. On demand is 1.5 per input and 9 per M output. This is effectively comparable cost to Gemini 2.5 Pro in a flash tier model

conorh 17 hours ago|||

I think you have your pricing wrong there, Gemini 3.5 flash is $1.50 input and $9 output.

mchusma 17 hours ago||

Okay, it's kind of somewhere between haiku and sonnet level pricing, at somewhere between sonnet and opus level performance. Its a great option. I was hoping to see opus class intelligence at haiku level pricing out of google, and this is close to that!

mchusma 17 hours ago||

Never mind, after looking at more benchmarks, seems closer to sonnet level intelligence at slightly lower cost. Speed is great for latency sensitive applications, but if this was 1/2 the cost it would have been priced to win.

If this is the big model release out of google, its a disappointent.

ls_stats 17 hours ago|||

You are seeing batch inference, standard inference is $1.5/$9. I was excited until I saw that price.

jpau 17 hours ago|||

Standard pricing is showing for me as $1.50 / $9.

(I suspect you're viewing the "flex" pricing).

Tiberium 17 hours ago|||

Please delete/edit your AI-written and factually wrong post.

MallocVoidstar 16 hours ago||

In addition to people pointing out your LLM got the pricing wrong,

> The pricing and "including thinking tokens" framing position it as a reasoning-capable flash model rather than just a pure speed optimization

Every Gemini model starting with 2.5 has been a reasoning model.