AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Home/Compare/Claude Sonnet 4 vs Gemini 3.0 Flash

Claude Sonnet 4 vs Gemini 3.0 Flash

0 vs 2 benchmarks won

Anthropic
Claude Sonnet 4
Google
Gemini 3.0 Flash
Overview
CompanyAnthropicGoogle
Release dateMay 22 2025Dec 17 2025
Model type
Open sourceNoNo
Specifications
Parameters
Context window
Benchmarks
Agentic coding
SWE-Bench Pro
49.6%
Coding
SWE-Bench Verified
72.7%
78%Best
Agentic terminal coding
Terminal-Bench 2.1
58%
Multi-step tool use
MCP Atlas
62%
General tool use
Toolathlon
49.4%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
33.7%
Abstract reasoning
ARC-AGI-2
33.6%
Science
GPQA Diamond
75.4%
90.4%Best
Agentic computer use
OSWorld-Verified
65.1%
Agentic financial analysis
Finance Agent v2
42.6%
Knowledge work
GDPval-AA
1204
Chart reasoning
CharXiv Reasoning
80.3%
Multimodal reasoning
MMMU-Pro
81.2%
Spatial reasoning
Blueprint-Bench 2
0%
Long context
MRCR v2 (8-needle) · 128k average
67.2%
Long context
MRCR v2 (8-needle) · 1M pointwise
22.1%
Timeline
Release gapClaude Sonnet 4 shipped 209 days before Gemini 3.0 Flash

Which is better: Claude Sonnet 4 or Gemini 3.0 Flash?

Gemini 3.0 Flash leads Claude Sonnet 4 on 2 of the 2 benchmarks they both report (SWE-Bench Verified, GPQA Diamond). Claude Sonnet 4 shipped 209 days before Gemini 3.0 Flash, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Verified, Gemini 3.0 Flash leads at 78% vs Claude Sonnet 4 at 72.7%. On GPQA Diamond, Gemini 3.0 Flash leads at 90.4% vs Claude Sonnet 4 at 75.4%.

Frequently asked questions

When was Claude Sonnet 4 released?
Claude Sonnet 4 was released by Anthropic on May 22 2025.
When was Gemini 3.0 Flash released?
Gemini 3.0 Flash was released by Google on Dec 17 2025.
Which is better at coding, Claude Sonnet 4 or Gemini 3.0 Flash?
Gemini 3.0 Flash leads on SWE-Bench Verified — Claude Sonnet 4 72.7% vs Gemini 3.0 Flash 78%.
Which scores higher on GPQA Diamond, Claude Sonnet 4 or Gemini 3.0 Flash?
Gemini 3.0 Flash leads on GPQA Diamond — Claude Sonnet 4 75.4% vs Gemini 3.0 Flash 90.4%.

Other comparisons