AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Claude Sonnet 4.5 vs Gemini 3.5 Flash

Anthropic
Claude Sonnet 4.5
Google
Gemini 3.5 Flash
Overview
CompanyAnthropicGoogle
Release dateSep 29 2025May 19 2026
AccessProprietaryProprietary
Benchmarks
Nonsense detection
BullshitBench v2
79%Best
20%
Agentic coding
SWE-Bench Pro
55.1%
Coding
SWE-Bench Verified
77.2%
Agentic coding
CursorBench v3.1
49.8%
Next.js coding
Next.js Evals
50%
Agentic terminal coding
Terminal-Bench 2.1
76.2%
Multi-step tool use
MCP Atlas
83.6%
General tool use
Toolathlon
56.5%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
40.2%
Abstract reasoning
ARC-AGI-2
72.1%
Science
GPQA Diamond
83.4%
Agentic computer use
OSWorld-Verified
78.4%
Agentic financial analysis
Finance Agent v2
57.9%
Knowledge work
GDPval-AA
1656
Chart reasoning
CharXiv Reasoning
84.2%
Multimodal reasoning
MMMU-Pro
83.6%
Multimodal
MMMU
68%
Spatial reasoning
Blueprint-Bench 2
33.6%
Long context
MRCR v2 (8-needle) · 128k average
77.3%
Long context
MRCR v2 (8-needle) · 1M pointwise
26.6%
Timeline
Release gapClaude Sonnet 4.5 shipped 232 days before Gemini 3.5 Flash

Which is better: Claude Sonnet 4.5 or Gemini 3.5 Flash?

Claude Sonnet 4.5 leads Gemini 3.5 Flash on 1 of the 1 benchmark they both report (BullshitBench v2). Claude Sonnet 4.5 shipped 232 days before Gemini 3.5 Flash, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On BullshitBench v2, Claude Sonnet 4.5 leads at 79% vs Gemini 3.5 Flash at 20%.

Frequently asked questions

Claude Sonnet 4.5 was released by Anthropic on Sep 29 2025.

Gemini 3.5 Flash was released by Google on May 19 2026.

Other comparisons