> 3.1 Flash-Lite (reasoning) (reasoning) doesn't say much. Is it low/med/high re...

vlmutolo · 2026-03-04T01:02:35 1772586155

Wow, that’s very interesting. I wish more benchmarks were reported along with the total cost of running that benchmark. Dollars per token is kind of useless for the reasons you mentioned.

XCSme · 2026-03-04T01:05:27 1772586327

Yup, MiniMax M-2.5 is a standout in that aspect. It's $/token is very low, because it reasons forever (fun fact, that's also the reason why it's #1 on OpenRouter, because it simply burns through tokens, and OpenRouter ranking is based on tokens usage)...

XCSme · 2026-03-04T01:06:06 1772586366

https://aibenchy.com/compare/google-gemini-3-1-flash-lite-pr...