AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Home/Compare/Claude Opus 4.1 vs Devstral Small 2 (24B)

Claude Opus 4.1 vs Devstral Small 2 (24B)

2 vs 0 benchmarks won

Anthropic
Claude Opus 4.1
Mistral
Devstral Small 2 (24B)
Overview
CompanyAnthropicMistral
Release dateAug 5 2025Dec 9 2025
Model type
Open sourceNoYes
Specifications
Parameters
Context window
Benchmarks
Science reasoning
GPQA Diamond
80.9%
Software engineering
SWE-Bench Verified
74.5%
Multimodal understanding
MMMU
Timeline
Release gapClaude Opus 4.1 shipped 126 days before Devstral Small 2 (24B)

Which is better: Claude Opus 4.1 or Devstral Small 2 (24B)?

Claude Opus 4.1 leads Devstral Small 2 (24B) on 2 of the tracked benchmarks (GPQA Diamond, SWE-Bench Verified, MMMU). Claude Opus 4.1 shipped 126 days before Devstral Small 2 (24B), so benchmark comparisons should account for the intervening progress.

Devstral Small 2 (24B) is an open-source / open-weight model; Claude Opus 4.1 is proprietary.

Direct benchmark comparisons are unavailable — at least one of these models has not published scores on GPQA Diamond, SWE-Bench Verified, or MMMU.

Frequently asked questions

When was Claude Opus 4.1 released?
Claude Opus 4.1 was released by Anthropic on Aug 5 2025.
When was Devstral Small 2 (24B) released?
Devstral Small 2 (24B) was released by Mistral on Dec 9 2025.
Is Claude Opus 4.1 or Devstral Small 2 (24B) open source?
Devstral Small 2 (24B) is an open-source / open-weight model released by Mistral. Claude Opus 4.1 is a proprietary model released by Anthropic.

Other comparisons