3 vs 0 benchmarks won
Anthropic Claude Sonnet 4.5 | Google Gemini 3.0 Pro Image | |
|---|---|---|
| Overview | ||
| Company | Anthropic | |
| Release date | Sep 29 2025 | Nov 20 2025 |
| Model type | — | — |
| Open source | No | No |
| Specifications | ||
Parameters | — | — |
Context window | — | — |
| Benchmarks | ||
Science reasoning GPQA Diamond | 83.4% | — |
Software engineering SWE-Bench Verified | 77.2% | — |
Multimodal understanding MMMU | 68% | — |
| Timeline | ||
| Release gap | Claude Sonnet 4.5 shipped 52 days before Gemini 3.0 Pro Image | |
Claude Sonnet 4.5 leads Gemini 3.0 Pro Image on 3 of the tracked benchmarks (GPQA Diamond, SWE-Bench Verified, MMMU). Claude Sonnet 4.5 shipped 52 days before Gemini 3.0 Pro Image, so benchmark comparisons should account for the intervening progress.
Published specifications for these two models are limited — see each model page for the latest details.
Direct benchmark comparisons are unavailable — at least one of these models has not published scores on GPQA Diamond, SWE-Bench Verified, or MMMU.