AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Home/Compare/Claude Opus 4.7 vs GPT-5.5

Claude Opus 4.7 vs GPT-5.5

7 vs 15 benchmarks won

Anthropic
Claude Opus 4.7
OpenAI
GPT-5.5
Overview
CompanyAnthropicOpenAI
Release dateApr 16 2026Apr 23 2026
Model type
Open sourceNoNo
Specifications
Parameters
Context window
Benchmarks
Agentic coding
SWE-Bench Pro
64.3%Best
58.6%
Coding
SWE-Bench Verified
87.6%
Multilingual coding
SWE-Bench Multilingual
80.5%Best
77.8%
Agentic coding
CursorBench v3.1
61.6%Best
59.2%
Agentic terminal coding
Terminal-Bench 2.1
66.1%
78.2%Best
Agentic terminal coding
Terminal-Bench 2.0
69.4%
82.7%Best
Software engineering
Expert-SWE (Internal)
73.1%
Multi-step tool use
MCP Atlas
79.1%Best
75.3%
General tool use
Toolathlon
55.6%
Web browsing
BrowseComp
79.3%
84.4%Best
Cybersecurity
CyberGym
73.1%
81.8%Best
Multidisciplinary reasoning
Humanity's Last Exam · no tools
46.9%Best
41.4%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
54.7%Best
52.2%
Abstract reasoning
ARC-AGI-2
75.8%
84.6%Best
Advanced math
FrontierMath · Tier 1–3
43.8%
51.7%Best
Advanced math
FrontierMath · Tier 4
22.9%
35.4%Best
Science
GPQA Diamond
94.2%Best
93.6%
Agentic computer use
OSWorld-Verified
78%
78.7%Best
Agentic financial analysis
Finance Agent v2
51.5%
51.8%Best
Knowledge work
GDPval-AA
1753
1769Best
Knowledge work
GDPval (win/tie rate)
80.3%
84.9%Best
Chart reasoning
CharXiv Reasoning
82.1%
84.1%Best
Multimodal reasoning
MMMU-Pro
75.2%
81.2%Best
Spatial reasoning
Blueprint-Bench 2
24.5%
36.2%Best
Long context
MRCR v2 (8-needle) · 128k average
59.3%
94.8%Best
Timeline
Release gapClaude Opus 4.7 shipped 7 days before GPT-5.5

Which is better: Claude Opus 4.7 or GPT-5.5?

GPT-5.5 leads Claude Opus 4.7 on 15 of the 22 benchmarks they both report. Claude Opus 4.7 shipped 7 days before GPT-5.5, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Pro, Claude Opus 4.7 leads at 64.3% vs GPT-5.5 at 58.6%. On SWE-Bench Multilingual, Claude Opus 4.7 leads at 80.5% vs GPT-5.5 at 77.8%. On CursorBench v3.1, Claude Opus 4.7 leads at 61.6% vs GPT-5.5 at 59.2%. On Terminal-Bench 2.1, GPT-5.5 leads at 78.2% vs Claude Opus 4.7 at 66.1%. On Terminal-Bench 2.0, GPT-5.5 leads at 82.7% vs Claude Opus 4.7 at 69.4%. On MCP Atlas, Claude Opus 4.7 leads at 79.1% vs GPT-5.5 at 75.3%. On BrowseComp, GPT-5.5 leads at 84.4% vs Claude Opus 4.7 at 79.3%. On CyberGym, GPT-5.5 leads at 81.8% vs Claude Opus 4.7 at 73.1%. On Humanity's Last Exam · no tools, Claude Opus 4.7 leads at 46.9% vs GPT-5.5 at 41.4%. On Humanity's Last Exam · with tools, Claude Opus 4.7 leads at 54.7% vs GPT-5.5 at 52.2%. On ARC-AGI-2, GPT-5.5 leads at 84.6% vs Claude Opus 4.7 at 75.8%. On FrontierMath · Tier 1–3, GPT-5.5 leads at 51.7% vs Claude Opus 4.7 at 43.8%. On FrontierMath · Tier 4, GPT-5.5 leads at 35.4% vs Claude Opus 4.7 at 22.9%. On GPQA Diamond, Claude Opus 4.7 leads at 94.2% vs GPT-5.5 at 93.6%. On OSWorld-Verified, GPT-5.5 leads at 78.7% vs Claude Opus 4.7 at 78%. On Finance Agent v2, GPT-5.5 leads at 51.8% vs Claude Opus 4.7 at 51.5%. On GDPval-AA, GPT-5.5 leads at 1769 vs Claude Opus 4.7 at 1753. On GDPval (win/tie rate), GPT-5.5 leads at 84.9% vs Claude Opus 4.7 at 80.3%. On CharXiv Reasoning, GPT-5.5 leads at 84.1% vs Claude Opus 4.7 at 82.1%. On MMMU-Pro, GPT-5.5 leads at 81.2% vs Claude Opus 4.7 at 75.2%. On Blueprint-Bench 2, GPT-5.5 leads at 36.2% vs Claude Opus 4.7 at 24.5%. On MRCR v2 (8-needle) · 128k average, GPT-5.5 leads at 94.8% vs Claude Opus 4.7 at 59.3%.

Frequently asked questions

When was Claude Opus 4.7 released?
Claude Opus 4.7 was released by Anthropic on Apr 16 2026.
When was GPT-5.5 released?
GPT-5.5 was released by OpenAI on Apr 23 2026.
Which is better at coding, Claude Opus 4.7 or GPT-5.5?
Claude Opus 4.7 leads on SWE-Bench Pro — Claude Opus 4.7 64.3% vs GPT-5.5 58.6%.
Which scores higher on Humanity's Last Exam, Claude Opus 4.7 or GPT-5.5?
Claude Opus 4.7 leads on Humanity's Last Exam · no tools — Claude Opus 4.7 46.9% vs GPT-5.5 41.4%.

Other comparisons