AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Coding

SWE-Bench Verified

Real coding tasks pulled from open-source projects — the AI has to find and fix actual bugs. A human-checked version of the original SWE-Bench. Higher is better.

Rankings

Higher is better
← All benchmarks