Long context
MRCR v2 (8-needle)128k average
Tests whether the AI can find specific details buried inside a very long document (around 128k tokens — roughly a long book). Higher is better.
Tests whether the AI can find specific details buried inside a very long document (around 128k tokens — roughly a long book). Higher is better.