AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Home/Compare/Claude Sonnet 4.6 vs Gemini 3.0 Flash

Claude Sonnet 4.6 vs Gemini 3.0 Flash

8 vs 4 benchmarks won

Anthropic
Claude Sonnet 4.6
Google
Gemini 3.0 Flash
Overview
CompanyAnthropicGoogle
Release dateFeb 17 2026Dec 17 2025
Model type
Open sourceNoNo
Specifications
Parameters
Context window
Benchmarks
Agentic coding
SWE-Bench Pro
49.6%
Coding
SWE-Bench Verified
79.6%Best
78%
Agentic terminal coding
Terminal-Bench 2.1
58%
Multi-step tool use
MCP Atlas
69.5%Best
62%
General tool use
Toolathlon
49.4%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
33.2%
33.7%Best
Abstract reasoning
ARC-AGI-2
58.3%Best
33.6%
Science
GPQA Diamond
89.9%
90.4%Best
Agentic computer use
OSWorld-Verified
72.5%Best
65.1%
Agentic financial analysis
Finance Agent v2
51%Best
42.6%
Knowledge work
GDPval-AA
1676Best
1204
Chart reasoning
CharXiv Reasoning
72.4%
80.3%Best
Multimodal reasoning
MMMU-Pro
74.5%
81.2%Best
Spatial reasoning
Blueprint-Bench 2
6.7%Best
0%
Long context
MRCR v2 (8-needle) · 128k average
84.9%Best
67.2%
Long context
MRCR v2 (8-needle) · 1M pointwise
22.1%
Timeline
Release gapGemini 3.0 Flash shipped 62 days before Claude Sonnet 4.6

Which is better: Claude Sonnet 4.6 or Gemini 3.0 Flash?

Claude Sonnet 4.6 leads Gemini 3.0 Flash on 8 of the 12 benchmarks they both report. Gemini 3.0 Flash shipped 62 days before Claude Sonnet 4.6, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Verified, Claude Sonnet 4.6 leads at 79.6% vs Gemini 3.0 Flash at 78%. On MCP Atlas, Claude Sonnet 4.6 leads at 69.5% vs Gemini 3.0 Flash at 62%. On Humanity's Last Exam · no tools, Gemini 3.0 Flash leads at 33.7% vs Claude Sonnet 4.6 at 33.2%. On ARC-AGI-2, Claude Sonnet 4.6 leads at 58.3% vs Gemini 3.0 Flash at 33.6%. On GPQA Diamond, Gemini 3.0 Flash leads at 90.4% vs Claude Sonnet 4.6 at 89.9%. On OSWorld-Verified, Claude Sonnet 4.6 leads at 72.5% vs Gemini 3.0 Flash at 65.1%. On Finance Agent v2, Claude Sonnet 4.6 leads at 51% vs Gemini 3.0 Flash at 42.6%. On GDPval-AA, Claude Sonnet 4.6 leads at 1676 vs Gemini 3.0 Flash at 1204. On CharXiv Reasoning, Gemini 3.0 Flash leads at 80.3% vs Claude Sonnet 4.6 at 72.4%. On MMMU-Pro, Gemini 3.0 Flash leads at 81.2% vs Claude Sonnet 4.6 at 74.5%. On Blueprint-Bench 2, Claude Sonnet 4.6 leads at 6.7% vs Gemini 3.0 Flash at 0%. On MRCR v2 (8-needle) · 128k average, Claude Sonnet 4.6 leads at 84.9% vs Gemini 3.0 Flash at 67.2%.

Frequently asked questions

When was Claude Sonnet 4.6 released?
Claude Sonnet 4.6 was released by Anthropic on Feb 17 2026.
When was Gemini 3.0 Flash released?
Gemini 3.0 Flash was released by Google on Dec 17 2025.
Which is better at coding, Claude Sonnet 4.6 or Gemini 3.0 Flash?
Claude Sonnet 4.6 leads on SWE-Bench Verified — Claude Sonnet 4.6 79.6% vs Gemini 3.0 Flash 78%.
Which scores higher on Humanity's Last Exam, Claude Sonnet 4.6 or Gemini 3.0 Flash?
Gemini 3.0 Flash leads on Humanity's Last Exam · no tools — Claude Sonnet 4.6 33.2% vs Gemini 3.0 Flash 33.7%.

Other comparisons