Be first to know when a new model drops.Get instant alerts · $4/mo

Home Latest Analytics Pricing Contact

Claude Sonnet 5

Next.js coding

Next.js Evals

Vercel's open eval of how well AI coding agents build and migrate real Next.js apps — measured as the share of tasks the agent completes successfully. Higher is better.

Rankings

Higher is better

Cursor · May 18 2026

Anthropic · Jun 9 2026

Claude Opus 4.8

Anthropic · May 28 2026

Z.ai · Jun 16 2026

OpenAI · Feb 5 2026

OpenAI · Mar 5 2026

OpenAI · Apr 23 2026

Claude Opus 4.6

Anthropic · Feb 5 2026

Google · Feb 19 2026

Cursor · Mar 20 2026

Z.ai · Apr 7 2026

Claude Opus 4.7

Anthropic · Apr 16 2026

Moonshot AI · Jun 12 2026

Google · Nov 18 2025

Cursor · Feb 8 2026

Moonshot AI · Apr 21 2026

Claude Sonnet 4.6

Anthropic · Feb 17 2026

Claude Sonnet 4.5

Anthropic · Sep 29 2025

Moonshot AI · Jan 27 2026

← All benchmarks