AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Gemini 3.1 Pro vs Gemini 3.5 Flash

Google
Gemini 3.1 Pro
Google
Gemini 3.5 Flash
Overview
CompanyGoogleGoogle
Release dateFeb 19 2026May 19 2026
AccessProprietaryProprietary
Benchmarks
Nonsense detection
BullshitBench v2
37%Best
20%
Agentic coding
SWE-Bench Pro
54.2%
55.1%Best
Coding
SWE-Bench Verified
80.6%
Agentic coding
CursorBench v3.1
49.8%
Next.js coding
Next.js Evals
75%
Agentic terminal coding
Terminal-Bench 2.1
70.3%
76.2%Best
Agentic terminal coding
Terminal-Bench 2.0
68.5%
Multi-step tool use
MCP Atlas
78.2%
83.6%Best
General tool use
Toolathlon
48.8%
56.5%Best
Web browsing
BrowseComp
85.9%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
44.4%Best
40.2%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
51.4%
Abstract reasoning
ARC-AGI-2
77.1%Best
72.1%
Advanced math
FrontierMath · Tier 1–3
36.9%
Advanced math
FrontierMath · Tier 4
16.7%
Science
GPQA Diamond
94.3%
Agentic computer use
OSWorld-Verified
76.2%
78.4%Best
Agentic financial analysis
Finance Agent v2
43%
57.9%Best
Knowledge work
GDPval-AA
1314
1656Best
Knowledge work
GDPval (win/tie rate)
67.3%
Chart reasoning
CharXiv Reasoning
83.3%
84.2%Best
Multimodal reasoning
MMMU-Pro
80.5%
83.6%Best
Spatial reasoning
Blueprint-Bench 2
26.5%
33.6%Best
Long context
MRCR v2 (8-needle) · 128k average
84.9%Best
77.3%
Long context
MRCR v2 (8-needle) · 1M pointwise
26.3%
26.6%Best
Timeline
Release gapGemini 3.1 Pro shipped 89 days before Gemini 3.5 Flash

Which is better: Gemini 3.1 Pro or Gemini 3.5 Flash?

Gemini 3.5 Flash leads Gemini 3.1 Pro on 11 of the 15 benchmarks they both report. Gemini 3.1 Pro shipped 89 days before Gemini 3.5 Flash, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On BullshitBench v2, Gemini 3.1 Pro leads at 37% vs Gemini 3.5 Flash at 20%. On SWE-Bench Pro, Gemini 3.5 Flash leads at 55.1% vs Gemini 3.1 Pro at 54.2%. On Terminal-Bench 2.1, Gemini 3.5 Flash leads at 76.2% vs Gemini 3.1 Pro at 70.3%. On MCP Atlas, Gemini 3.5 Flash leads at 83.6% vs Gemini 3.1 Pro at 78.2%. On Toolathlon, Gemini 3.5 Flash leads at 56.5% vs Gemini 3.1 Pro at 48.8%. On Humanity's Last Exam · no tools, Gemini 3.1 Pro leads at 44.4% vs Gemini 3.5 Flash at 40.2%. On ARC-AGI-2, Gemini 3.1 Pro leads at 77.1% vs Gemini 3.5 Flash at 72.1%. On OSWorld-Verified, Gemini 3.5 Flash leads at 78.4% vs Gemini 3.1 Pro at 76.2%. On Finance Agent v2, Gemini 3.5 Flash leads at 57.9% vs Gemini 3.1 Pro at 43%. On GDPval-AA, Gemini 3.5 Flash leads at 1656 vs Gemini 3.1 Pro at 1314. On CharXiv Reasoning, Gemini 3.5 Flash leads at 84.2% vs Gemini 3.1 Pro at 83.3%. On MMMU-Pro, Gemini 3.5 Flash leads at 83.6% vs Gemini 3.1 Pro at 80.5%. On Blueprint-Bench 2, Gemini 3.5 Flash leads at 33.6% vs Gemini 3.1 Pro at 26.5%. On MRCR v2 (8-needle) · 128k average, Gemini 3.1 Pro leads at 84.9% vs Gemini 3.5 Flash at 77.3%. On MRCR v2 (8-needle) · 1M pointwise, Gemini 3.5 Flash leads at 26.6% vs Gemini 3.1 Pro at 26.3%.

Frequently asked questions

Gemini 3.1 Pro was released by Google on Feb 19 2026.

Gemini 3.5 Flash was released by Google on May 19 2026.

Gemini 3.5 Flash leads on SWE-Bench Pro — Gemini 3.1 Pro 54.2% vs Gemini 3.5 Flash 55.1%.

Gemini 3.1 Pro leads on Humanity's Last Exam · no tools — Gemini 3.1 Pro 44.4% vs Gemini 3.5 Flash 40.2%.

Other comparisons