AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Agentic coding

SWE-Bench Pro

Can the AI fix real bugs in real software? It's handed actual problems from open-source projects and has to write code that genuinely solves them. Higher is better.

Rankings

Higher is better
← All benchmarks