AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Multilingual coding

SWE-Bench Multilingual

Like SWE-Bench, but the coding problems span many programming languages, not just one. Tests how broadly the AI can code. Higher is better.

Rankings

Higher is better
← All benchmarks