AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Long context

MRCR v2 (8-needle)128k average

Tests whether the AI can find specific details buried inside a very long document (around 128k tokens — roughly a long book). Higher is better.

Rankings

Higher is better
← All benchmarks