AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Knowledge work

GDPval-AA

Measures how well the AI does economically valuable knowledge work, judged against human experts. Shown as a rating (like a chess Elo) — higher is better.

Rankings

Higher is better
← All benchmarks