Be first to know when a new model drops.
Get instant alerts · $4/mo
AI Model Release Tracker - Timeline of Major AI Models from 2022-2026
AI Model Release Tracker
Home
Latest
Analytics
Pricing
Contact
Latest Release
Kimi K2.7 Code
Jun 12 2026
Log in
Sign up
Home
/
Benchmarks
/
Toolathlon
General tool use
Toolathlon
Tests how well the AI uses everyday real-world tools and apps to get things done. Higher is better.
Rankings
Higher is better
1
Gemini 3.5 Flash
Google · May 19 2026
56.5%
2
GPT-5.5
OpenAI · Apr 23 2026
55.6%
3
GPT-5.4
OpenAI · Mar 5 2026
54.6%
4
Gemini 3.0 Flash
Google · Dec 17 2025
49.4%
5
Gemini 3.1 Pro
Google · Feb 19 2026
48.8%
← All benchmarks