1 vs 1 benchmarks won
Anthropic Claude Sonnet 4.6 | Google Gemini 3.0 Flash | |
|---|---|---|
| Overview | ||
| Company | Anthropic | |
| Release date | Feb 17 2026 | Dec 17 2025 |
| Model type | — | — |
| Open source | No | No |
| Specifications | ||
Parameters | — | — |
Context window | — | — |
| Benchmarks | ||
Science reasoning GPQA Diamond | 89.9% | 90.4% |
Software engineering SWE-Bench Verified | 79.6% | 78% |
Multimodal understanding MMMU | — | — |
| Timeline | ||
| Release gap | Gemini 3.0 Flash shipped 62 days before Claude Sonnet 4.6 | |
Claude Sonnet 4.6 and Gemini 3.0 Flash are evenly matched across the benchmarks they both publish. Gemini 3.0 Flash shipped 62 days before Claude Sonnet 4.6, so benchmark comparisons should account for the intervening progress.
Published specifications for these two models are limited — see each model page for the latest details.
On GPQA Diamond, Gemini 3.0 Flash scores 90.4%, 0.5 points above Claude Sonnet 4.6 at 89.9%. On SWE-Bench Verified, Claude Sonnet 4.6 scores 79.6%, 1.6 points above Gemini 3.0 Flash at 78%.