Gemini 3.5 Flash: Google ships fastest frontier model at $1.50 per million tokens
AnalysisGemini 3.5 Flash landed in the Gemini API on May 19, priced at $1.50 per million input tokens and $9.00 per million output tokens, with a 1,048,576-token context window. It runs four times faster than comparable frontier models and beats Gemini 3.1 Pro on the benchmarks that look most like real agent work: 76.2% on Terminal-Bench 2.1, 83.6% on MCP Atlas (a test suite for model-context-protocol tool use), and 84.2% on CharXiv Reasoning. That combination of price, speed, and agentic performance puts it on a short list for default model choice in agent loops, where latency and token cost multiply with every iteration. One notable default change from the preview version: thinking_level drops from high to medium unless explicitly overridden.