AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Home/Compare/Claude 3.5 Sonnet vs Gemini 3.0 Flash

Claude 3.5 Sonnet vs Gemini 3.0 Flash

0 vs 2 benchmarks won

Anthropic
Claude 3.5 Sonnet
Google
Gemini 3.0 Flash
Overview
CompanyAnthropicGoogle
Release dateJun 20 2024Dec 17 2025
Model type
Open sourceNoNo
Specifications
Parameters
Context window
Benchmarks
Agentic coding
SWE-Bench Pro
49.6%
Coding
SWE-Bench Verified
33.4%
78%Best
Agentic terminal coding
Terminal-Bench 2.1
58%
Multi-step tool use
MCP Atlas
62%
General tool use
Toolathlon
49.4%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
33.7%
Abstract reasoning
ARC-AGI-2
33.6%
Science
GPQA Diamond
59.4%
90.4%Best
Agentic computer use
OSWorld-Verified
65.1%
Agentic financial analysis
Finance Agent v2
42.6%
Knowledge work
GDPval-AA
1204
Chart reasoning
CharXiv Reasoning
80.3%
Multimodal reasoning
MMMU-Pro
81.2%
Spatial reasoning
Blueprint-Bench 2
0%
Long context
MRCR v2 (8-needle) · 128k average
67.2%
Long context
MRCR v2 (8-needle) · 1M pointwise
22.1%
Timeline
Release gapClaude 3.5 Sonnet shipped 545 days before Gemini 3.0 Flash

Which is better: Claude 3.5 Sonnet or Gemini 3.0 Flash?

Gemini 3.0 Flash leads Claude 3.5 Sonnet on 2 of the 2 benchmarks they both report (SWE-Bench Verified, GPQA Diamond). Claude 3.5 Sonnet shipped 545 days before Gemini 3.0 Flash, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Verified, Gemini 3.0 Flash leads at 78% vs Claude 3.5 Sonnet at 33.4%. On GPQA Diamond, Gemini 3.0 Flash leads at 90.4% vs Claude 3.5 Sonnet at 59.4%.

Frequently asked questions

When was Claude 3.5 Sonnet released?
Claude 3.5 Sonnet was released by Anthropic on Jun 20 2024.
When was Gemini 3.0 Flash released?
Gemini 3.0 Flash was released by Google on Dec 17 2025.
Which is better at coding, Claude 3.5 Sonnet or Gemini 3.0 Flash?
Gemini 3.0 Flash leads on SWE-Bench Verified — Claude 3.5 Sonnet 33.4% vs Gemini 3.0 Flash 78%.
Which scores higher on GPQA Diamond, Claude 3.5 Sonnet or Gemini 3.0 Flash?
Gemini 3.0 Flash leads on GPQA Diamond — Claude 3.5 Sonnet 59.4% vs Gemini 3.0 Flash 90.4%.

Other comparisons