Agentic terminal coding
Terminal-Bench 2.1
Can the AI work in a command-line terminal — running commands and finishing technical setup tasks the way a developer would? Higher is better.
Can the AI work in a command-line terminal — running commands and finishing technical setup tasks the way a developer would? Higher is better.