AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Agentic terminal coding

Terminal-Bench 2.0

Can the AI work in a command-line terminal — running commands and finishing technical setup tasks the way a developer would? (Version 2.0 of the test.) Higher is better.

Rankings

Higher is better
← All benchmarks