
Google just did something unusual with a Flash-tier model: it made it better than its own Pro. Gemini 3.5 Flash, announced at Google I/O 2026, is the first model in the new Gemini 3.5 family , and its headline claim is that a mid-tier, fast model now outperforms the previous flagship on the benchmarks that matter most to builders: coding, agentic workflows, and multimodal reasoning.
It outperforms Gemini 3.1 Pro on challenging benchmarks like Terminal-Bench 2.1 (76.2%), GDPval-AA (1656 Elo), and MCP Atlas (83.6%), while leading in multimodal understanding with 84.2% on CharXiv Reasoning , and it does all of this at 4x the output token speed of other frontier models. That combination of quality and throughput is what makes this release worth paying attention to.
What broke the Flash ceiling?
Google broke one of the unwritten rules of AI model releases: the cheap, fast Flash tier now outperforms the previous flagship Pro model on coding and agentic benchmarks. 3.5 Flash scores 76.2% on Terminal-Bench 2.1 while running 4x faster than comparable frontier models , and it costs less than Gemini 3.1 Pro.
Unlike models that bolt vision onto a text foundation, Gemini 3.5 Flash was built multimodal from the ground up. It can analyze complex charts and extract quantitative insights, understand the spatial relationships in images and video frames, process audio for transcription or reasoning tasks, and read PDFs structurally rather than just as text streams. The tweet that sparked this article is a perfect illustration: the model watches a reference photo of a lighting setup, reasons about the spatial arrangement of light sources, and generates a fully interactive 3D visualizer from that analysis alone.
Don't miss what's next in AI
Join 300,000+ engineers and researchers who get the signal, not the noise.
- Full access to in-depth AI research breakdowns
- Be the first to know what's trending before it hits mainstream
- Daily curated papers, repos, and industry moves

