AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Home/Compare/Claude Opus 4.8 vs GPT-5.5

Claude Opus 4.8 vs GPT-5.5

6 vs 1 benchmarks won

Anthropic
Claude Opus 4.8
OpenAI
GPT-5.5
Overview
CompanyAnthropicOpenAI
Release dateMay 28 2026Apr 23 2026
Model type
Open sourceNoNo
Specifications
Parameters
Context window
Benchmarks
Agentic coding
SWE-Bench Pro
69.2%Best
58.6%
Multilingual coding
SWE-Bench Multilingual
77.8%
Agentic coding
CursorBench v3.1
59.2%
Agentic terminal coding
Terminal-Bench 2.1
74.6%
78.2%Best
Agentic terminal coding
Terminal-Bench 2.0
82.7%
Software engineering
Expert-SWE (Internal)
73.1%
Multi-step tool use
MCP Atlas
75.3%
General tool use
Toolathlon
55.6%
Web browsing
BrowseComp
84.4%
Cybersecurity
CyberGym
81.8%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
49.8%Best
41.4%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
57.9%Best
52.2%
Abstract reasoning
ARC-AGI-2
84.6%
Advanced math
FrontierMath · Tier 1–3
51.7%
Advanced math
FrontierMath · Tier 4
35.4%
Science
GPQA Diamond
93.6%
Agentic computer use
OSWorld-Verified
83.4%Best
78.7%
Agentic financial analysis
Finance Agent v2
53.9%Best
51.8%
Knowledge work
GDPval-AA
1890Best
1769
Knowledge work
GDPval (win/tie rate)
84.9%
Chart reasoning
CharXiv Reasoning
84.1%
Multimodal reasoning
MMMU-Pro
81.2%
Spatial reasoning
Blueprint-Bench 2
36.2%
Long context
MRCR v2 (8-needle) · 128k average
94.8%
Timeline
Release gapGPT-5.5 shipped 35 days before Claude Opus 4.8

Which is better: Claude Opus 4.8 or GPT-5.5?

Claude Opus 4.8 leads GPT-5.5 on 6 of the 7 benchmarks they both report. GPT-5.5 shipped 35 days before Claude Opus 4.8, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Pro, Claude Opus 4.8 leads at 69.2% vs GPT-5.5 at 58.6%. On Terminal-Bench 2.1, GPT-5.5 leads at 78.2% vs Claude Opus 4.8 at 74.6%. On Humanity's Last Exam · no tools, Claude Opus 4.8 leads at 49.8% vs GPT-5.5 at 41.4%. On Humanity's Last Exam · with tools, Claude Opus 4.8 leads at 57.9% vs GPT-5.5 at 52.2%. On OSWorld-Verified, Claude Opus 4.8 leads at 83.4% vs GPT-5.5 at 78.7%. On Finance Agent v2, Claude Opus 4.8 leads at 53.9% vs GPT-5.5 at 51.8%. On GDPval-AA, Claude Opus 4.8 leads at 1890 vs GPT-5.5 at 1769.

Frequently asked questions

When was Claude Opus 4.8 released?
Claude Opus 4.8 was released by Anthropic on May 28 2026.
When was GPT-5.5 released?
GPT-5.5 was released by OpenAI on Apr 23 2026.
Which is better at coding, Claude Opus 4.8 or GPT-5.5?
Claude Opus 4.8 leads on SWE-Bench Pro — Claude Opus 4.8 69.2% vs GPT-5.5 58.6%.
Which scores higher on Humanity's Last Exam, Claude Opus 4.8 or GPT-5.5?
Claude Opus 4.8 leads on Humanity's Last Exam · no tools — Claude Opus 4.8 49.8% vs GPT-5.5 41.4%.

Other comparisons