5/16/2026, 1:00:00 PM · foundation-models

Google launches Gemini 3.5 Flash at I/O 2026

Google released Gemini 3.5 Flash as the first model in its 3.5 series, shipping it to consumer apps, the Gemini API, and Android development tools at its annual developer conference.

Google announced Gemini 3.5 Flash at its I/O developer conference, positioning the model as the first entry in a new 3.5 series that the company describes as combining frontier intelligence with agentic action. <cite index="3-7,3-8,3-9,3-10">Google launched Gemini 3.5 Flash as the first in its latest series of models combining frontier intelligence with action, and made it generally available via its agent-first development platform Google Antigravity, the Gemini API in Google AI Studio, and Android Studio.</cite>

Model positioning and benchmarks

The release inverts the usual hierarchy between Google's Flash and Pro tiers. <cite index="3-12,3-13">Google says Gemini 3.5 Flash delivers intelligence that rivals large flagship models at speeds expected from the Flash series, and that it outperforms Gemini 3.1 Pro on coding and agentic benchmarks including Terminal-Bench 2.1 (76.2%), GDPval-AA (1,656 Elo), and MCP Atlas (83.6%).</cite> <cite index="1-15,1-16">It also scores 84.2% on CharXiv Reasoning, a benchmark for multimodal understanding.</cite>

<cite index="5-9">Google says 3.5 Flash delivers four times the output token generation speed of competing frontier models.</cite> <cite index="3-28,3-29,3-30">The company positions the model for long-horizon agentic tasks, claiming work that previously took a developer days or an auditor weeks can now be completed in a fraction of the time, often at less than half the cost of other frontier models, with the model planning, building, and iterating on applications, codebases, or financial documents.</cite>

A more capable tier is not yet shipping. <cite index="3-33,3-34">Google said it is also working on Gemini 3.5 Pro, which is already being used internally and is expected to roll out next month.</cite>

Distribution and developer tooling

The model has been rolled into Google's consumer surfaces immediately. <cite index="3-19,3-20">AI Mode, which Google says has surpassed more than 1 billion monthly users, is being upgraded with Gemini 3.5 Flash as the new default model globally.</cite>

For developers, the launch is paired with a refresh of Google's agent platform. <cite index="5-11,5-12">Google updated Antigravity, its agentic development platform, to version 2.0, with two primary views: an Editor view that functions like an IDE with an agent sidebar, and a Manager view that orchestrates multiple agents working in parallel across workspaces.</cite> <cite index="5-13">Within this environment, Google claims a hyper-optimized version of Gemini 3.5 Flash can reach significantly faster generation speeds than standard frontier model deployments.</cite>

The stable model identifier is gemini-3.5-flash, replacing the preview SKU used previously. API (Application Programming Interface) pricing was published at the launch. <cite index="7-30,7-31">Gemini 3.5 Flash is free in the Gemini app and in AI Mode in Google Search, while developers pay $1.50 per million input tokens and $9.00 per million output tokens through the API.</cite>

Enterprise adoption

Google cited several enterprise customers piloting or deploying the model. <cite index="1-25,1-26,1-27,1-28,1-29,1-30,1-31,1-32,1-33,1-34">Google said Shopify is running subagents in parallel for data analysis to power merchant growth forecasts; Macquarie Bank is piloting the model for customer onboarding, reasoning over complex documents of 100 or more pages to retrieve information and make recommendations; Salesforce is integrating 3.5 Flash into Agentforce to automate enterprise tasks using subagents that retain context across multi-turn tool calling; and Ramp uses it for invoice OCR (Optical Character Recognition).</cite>

Scale context

The model release was framed against rapid growth in Google's AI usage. <cite index="5-10">Google CEO Sundar Pichai said the company is now processing more than 3.2 quadrillion tokens per month, up from 480 trillion tokens per month at I/O 2025.</cite> <cite index="5-23">Google has committed to spending between $180 billion and $190 billion in infrastructure investment this year, roughly six times its 2022 capital expenditure figure.</cite>

Cross-references

Sources

  1. [1]
    Google Introduces Gemini 3.5 Flash at I/O 2026: A Faster and Cheaper Model for AI Agents and Coding - MarkTechPost
  2. [2]
    Everything Google announced at I/O 2026: Gemini, Search, Android XR, & more
  3. [3]
    100 things we announced at Google I/O 2026
  4. [4]
    Google I/O 2026: Gemini 3.5 Flash, Spark & Agentic AI
  5. [5]
    Google I/O 2026: Gemini 3.5 Flash Debuts, AI Search Era Begins and More | HotHardware
  6. [6]
    Google I/O 2026: Every Major Announcement — Gemini 3.5, Spark & More - AICC - AI.cc
  7. [7]
    Gemini 3.5 Review: What Google Launched at I/O 2026
Google launches Gemini 3.5 Flash at I/O 2026 · AIDB