AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Claude Sonnet 5 vs GPT-5.5

Anthropic
Claude Sonnet 5
OpenAI
GPT-5.5
Overview
CompanyAnthropicOpenAI
Release dateJun 30 2026Apr 23 2026
AccessProprietaryProprietary
Specifications
Context window
1.05M
Benchmarks
Nonsense detection
BullshitBench v2
47%
Agentic coding
SWE-Bench Pro
63.2%Best
58.6%
Multilingual coding
SWE-Bench Multilingual
77.8%
Agentic coding
CursorBench v3.1
61.2%
64.3%Best
Agentic terminal coding
Terminal-Bench 2.1
80.4%Best
78.2%
Agentic terminal coding
Terminal-Bench 2.0
82.7%
Software engineering
Expert-SWE (Internal)
73.1%
Multi-step tool use
MCP Atlas
75.3%
General tool use
Toolathlon
55.6%
Web browsing
BrowseComp
84.4%
Cybersecurity
CyberGym
81.8%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
43.2%Best
41.4%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
57.4%Best
52.2%
Abstract reasoning
ARC-AGI-2
84.6%
Advanced math
FrontierMath · Tier 1–3
51.7%
Advanced math
FrontierMath · Tier 4
35.4%
Science
GPQA Diamond
93.6%
Agentic computer use
OSWorld-Verified
81.2%Best
78.7%
Agentic financial analysis
Finance Agent v2
51.8%
Knowledge work
GDPval-AA
1618
1769Best
Knowledge work
GDPval (win/tie rate)
84.9%
Chart reasoning
CharXiv Reasoning
84.1%
Multimodal reasoning
MMMU-Pro
81.2%
Spatial reasoning
Blueprint-Bench 2
36.2%
Long context
MRCR v2 (8-needle) · 128k average
94.8%
Timeline
Release gapGPT-5.5 shipped 68 days before Claude Sonnet 5

Which is better: Claude Sonnet 5 or GPT-5.5?

Claude Sonnet 5 leads GPT-5.5 on 5 of the 7 benchmarks they both report. GPT-5.5 shipped 68 days before Claude Sonnet 5, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Pro, Claude Sonnet 5 leads at 63.2% vs GPT-5.5 at 58.6%. On CursorBench v3.1, GPT-5.5 leads at 64.3% vs Claude Sonnet 5 at 61.2%. On Terminal-Bench 2.1, Claude Sonnet 5 leads at 80.4% vs GPT-5.5 at 78.2%. On Humanity's Last Exam · no tools, Claude Sonnet 5 leads at 43.2% vs GPT-5.5 at 41.4%. On Humanity's Last Exam · with tools, Claude Sonnet 5 leads at 57.4% vs GPT-5.5 at 52.2%. On OSWorld-Verified, Claude Sonnet 5 leads at 81.2% vs GPT-5.5 at 78.7%. On GDPval-AA, GPT-5.5 leads at 1769 vs Claude Sonnet 5 at 1618.

Frequently asked questions

Claude Sonnet 5 was released by Anthropic on Jun 30 2026.

GPT-5.5 was released by OpenAI on Apr 23 2026.

Claude Sonnet 5 leads on SWE-Bench Pro — Claude Sonnet 5 63.2% vs GPT-5.5 58.6%.

Claude Sonnet 5 leads on Humanity's Last Exam · no tools — Claude Sonnet 5 43.2% vs GPT-5.5 41.4%.

Other comparisons