AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Claude Sonnet 5 vs Gemini 3.0 Flash

Anthropic
Claude Sonnet 5
Google
Gemini 3.0 Flash
Overview
CompanyAnthropicGoogle
Release dateJun 30 2026Dec 17 2025
AccessProprietaryProprietary
Benchmarks
Agentic coding
SWE-Bench Pro
63.2%Best
49.6%
Coding
SWE-Bench Verified
78%
Agentic coding
CursorBench v3.1
61.2%
Agentic terminal coding
Terminal-Bench 2.1
80.4%Best
58%
Multi-step tool use
MCP Atlas
62%
General tool use
Toolathlon
49.4%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
43.2%Best
33.7%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
57.4%
Abstract reasoning
ARC-AGI-2
33.6%
Science
GPQA Diamond
90.4%
Agentic computer use
OSWorld-Verified
81.2%Best
65.1%
Agentic financial analysis
Finance Agent v2
42.6%
Knowledge work
GDPval-AA
1618Best
1204
Chart reasoning
CharXiv Reasoning
80.3%
Multimodal reasoning
MMMU-Pro
81.2%
Spatial reasoning
Blueprint-Bench 2
0%
Long context
MRCR v2 (8-needle) · 128k average
67.2%
Long context
MRCR v2 (8-needle) · 1M pointwise
22.1%
Timeline
Release gapGemini 3.0 Flash shipped 195 days before Claude Sonnet 5

Which is better: Claude Sonnet 5 or Gemini 3.0 Flash?

Claude Sonnet 5 leads Gemini 3.0 Flash on 5 of the 5 benchmarks they both report. Gemini 3.0 Flash shipped 195 days before Claude Sonnet 5, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Pro, Claude Sonnet 5 leads at 63.2% vs Gemini 3.0 Flash at 49.6%. On Terminal-Bench 2.1, Claude Sonnet 5 leads at 80.4% vs Gemini 3.0 Flash at 58%. On Humanity's Last Exam · no tools, Claude Sonnet 5 leads at 43.2% vs Gemini 3.0 Flash at 33.7%. On OSWorld-Verified, Claude Sonnet 5 leads at 81.2% vs Gemini 3.0 Flash at 65.1%. On GDPval-AA, Claude Sonnet 5 leads at 1618 vs Gemini 3.0 Flash at 1204.

Frequently asked questions

Claude Sonnet 5 was released by Anthropic on Jun 30 2026.

Gemini 3.0 Flash was released by Google on Dec 17 2025.

Claude Sonnet 5 leads on SWE-Bench Pro — Claude Sonnet 5 63.2% vs Gemini 3.0 Flash 49.6%.

Claude Sonnet 5 leads on Humanity's Last Exam · no tools — Claude Sonnet 5 43.2% vs Gemini 3.0 Flash 33.7%.

Other comparisons