AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Claude Sonnet 5 vs Gemini 3.1 Pro

Anthropic
Claude Sonnet 5
Google
Gemini 3.1 Pro
Overview
CompanyAnthropicGoogle
Release dateJun 30 2026Feb 19 2026
AccessProprietaryProprietary
Benchmarks
Nonsense detection
BullshitBench v2
37%
Agentic coding
SWE-Bench Pro
63.2%Best
54.2%
Coding
SWE-Bench Verified
80.6%
Agentic coding
CursorBench v3.1
61.2%
Agentic terminal coding
Terminal-Bench 2.1
80.4%Best
70.3%
Agentic terminal coding
Terminal-Bench 2.0
68.5%
Multi-step tool use
MCP Atlas
78.2%
General tool use
Toolathlon
48.8%
Web browsing
BrowseComp
85.9%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
43.2%
44.4%Best
Multidisciplinary reasoning
Humanity's Last Exam · with tools
57.4%Best
51.4%
Abstract reasoning
ARC-AGI-2
77.1%
Advanced math
FrontierMath · Tier 1–3
36.9%
Advanced math
FrontierMath · Tier 4
16.7%
Science
GPQA Diamond
94.3%
Agentic computer use
OSWorld-Verified
81.2%Best
76.2%
Agentic financial analysis
Finance Agent v2
43%
Knowledge work
GDPval-AA
1618Best
1314
Knowledge work
GDPval (win/tie rate)
67.3%
Chart reasoning
CharXiv Reasoning
83.3%
Multimodal reasoning
MMMU-Pro
80.5%
Spatial reasoning
Blueprint-Bench 2
26.5%
Long context
MRCR v2 (8-needle) · 128k average
84.9%
Long context
MRCR v2 (8-needle) · 1M pointwise
26.3%
Timeline
Release gapGemini 3.1 Pro shipped 131 days before Claude Sonnet 5

Which is better: Claude Sonnet 5 or Gemini 3.1 Pro?

Claude Sonnet 5 leads Gemini 3.1 Pro on 5 of the 6 benchmarks they both report. Gemini 3.1 Pro shipped 131 days before Claude Sonnet 5, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Pro, Claude Sonnet 5 leads at 63.2% vs Gemini 3.1 Pro at 54.2%. On Terminal-Bench 2.1, Claude Sonnet 5 leads at 80.4% vs Gemini 3.1 Pro at 70.3%. On Humanity's Last Exam · no tools, Gemini 3.1 Pro leads at 44.4% vs Claude Sonnet 5 at 43.2%. On Humanity's Last Exam · with tools, Claude Sonnet 5 leads at 57.4% vs Gemini 3.1 Pro at 51.4%. On OSWorld-Verified, Claude Sonnet 5 leads at 81.2% vs Gemini 3.1 Pro at 76.2%. On GDPval-AA, Claude Sonnet 5 leads at 1618 vs Gemini 3.1 Pro at 1314.

Frequently asked questions

Claude Sonnet 5 was released by Anthropic on Jun 30 2026.

Gemini 3.1 Pro was released by Google on Feb 19 2026.

Claude Sonnet 5 leads on SWE-Bench Pro — Claude Sonnet 5 63.2% vs Gemini 3.1 Pro 54.2%.

Gemini 3.1 Pro leads on Humanity's Last Exam · no tools — Claude Sonnet 5 43.2% vs Gemini 3.1 Pro 44.4%.

Other comparisons