AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Home/Compare/Claude 3.7 Sonnet vs GPT-5.1

Claude 3.7 Sonnet vs GPT-5.1

0 vs 3 benchmarks won

Anthropic
Claude 3.7 Sonnet
OpenAI
GPT-5.1
Overview
CompanyAnthropicOpenAI
Release dateFeb 24 2025Nov 12 2025
Model type
Open sourceNoNo
Specifications
Parameters
Context window
Benchmarks
Science reasoning
GPQA Diamond
68%
88.1%
Software engineering
SWE-Bench Verified
62.3%
76.3%
Multimodal understanding
MMMU
76%
Timeline
Release gapClaude 3.7 Sonnet shipped 261 days before GPT-5.1

Which is better: Claude 3.7 Sonnet or GPT-5.1?

GPT-5.1 leads Claude 3.7 Sonnet on 3 of the tracked benchmarks (GPQA Diamond, SWE-Bench Verified, MMMU). Claude 3.7 Sonnet shipped 261 days before GPT-5.1, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On GPQA Diamond, GPT-5.1 scores 88.1%, 20.1 points above Claude 3.7 Sonnet at 68%. On SWE-Bench Verified, GPT-5.1 scores 76.3%, 14 points above Claude 3.7 Sonnet at 62.3%.

Frequently asked questions

When was Claude 3.7 Sonnet released?
Claude 3.7 Sonnet was released by Anthropic on Feb 24 2025.
When was GPT-5.1 released?
GPT-5.1 was released by OpenAI on Nov 12 2025.
Which is better on GPQA Diamond, Claude 3.7 Sonnet or GPT-5.1?
GPT-5.1 scores higher on GPQA Diamond — Claude 3.7 Sonnet 68% vs GPT-5.1 88.1%.
Which is better at coding, Claude 3.7 Sonnet or GPT-5.1?
On SWE-Bench Verified (real-world software-engineering tasks), GPT-5.1 leads — Claude 3.7 Sonnet scores 62.3% and GPT-5.1 scores 76.3%.

Other comparisons