When was Claude 3.7 Sonnet released?

Claude 3.7 Sonnet was released by Anthropic on Feb 24 2025.

When was Muse Spark released?

Muse Spark was released by Meta on Apr 8 2026.

Which is better at coding, Claude 3.7 Sonnet or Muse Spark?

Muse Spark leads on SWE-Bench Verified — Claude 3.7 Sonnet 62.3% vs Muse Spark 77.4%.

Which scores higher on GPQA Diamond, Claude 3.7 Sonnet or Muse Spark?

Muse Spark leads on GPQA Diamond — Claude 3.7 Sonnet 68% vs Muse Spark 89.5%.

Claude 3.7 Sonnet vs Muse Spark

	Anthropic Claude 3.7 Sonnet	Meta Muse Spark
Overview
Company	Anthropic	Meta
Release date	Feb 24 2025	Apr 8 2026
Access	Proprietary	Proprietary
Benchmarks
Nonsense detection BullshitBench v2	49%	—
Agentic coding SWE-Bench Pro	—	55%
Coding SWE-Bench Verified	62.3%	77.4%Best
Agentic coding DeepSWE 1.1	—	10%
Agentic terminal coding Terminal-Bench 2.1	—	67.3%
Multi-step tool use MCP Atlas	—	82.2%
Professional tool use JobBench	—	17%
Personal tool use Toolathlon-Verified	—	49.4%
Multidisciplinary reasoning Humanity's Last Exam · with tools	—	50.4%
Abstract reasoning ARC-AGI-2	—	42.5%
Science GPQA Diamond	68%	89.5%Best
Agentic computer use OSWorld-Verified	—	53.3%
Chart reasoning CharXiv Reasoning	—	88.9%
Visual reasoning BabyVision	—	39.9%
Multimodal MMMU	—	80.4%
Community preference Arena Elo (Text)	—	1488
Timeline
Release gap	Claude 3.7 Sonnet shipped 408 days before Muse Spark

Which is better: Claude 3.7 Sonnet or Muse Spark?

Muse Spark leads Claude 3.7 Sonnet on 2 of the 2 benchmarks they both report (SWE-Bench Verified, GPQA Diamond). Claude 3.7 Sonnet shipped 408 days before Muse Spark, so benchmark comparisons should account for the intervening progress.

Published specifications for these two models are limited — see each model page for the latest details.

On SWE-Bench Verified, Muse Spark leads at 77.4% vs Claude 3.7 Sonnet at 62.3%. On GPQA Diamond, Muse Spark leads at 89.5% vs Claude 3.7 Sonnet at 68%.

Frequently asked questions

: Claude 3.7 Sonnet was released by Anthropic on Feb 24 2025.
: Muse Spark was released by Meta on Apr 8 2026.
: Muse Spark leads on SWE-Bench Verified — Claude 3.7 Sonnet 62.3% vs Muse Spark 77.4%.
: Muse Spark leads on GPQA Diamond — Claude 3.7 Sonnet 68% vs Muse Spark 89.5%.

Other comparisons

Claude 3.7 Sonnet vs GPT-5.6 Sol Muse Spark vs GPT-5.6 Sol Claude 3.7 Sonnet vs Gemini Omni Muse Spark vs Gemini Omni Claude 3.7 Sonnet vs Grok 4.5 Muse Spark vs Grok 4.5 Claude 3.7 Sonnet vs DeepSeek-V4-Pro Muse Spark vs DeepSeek-V4-Pro Claude 3.7 Sonnet vs Mistral Medium 3.5 Muse Spark vs Mistral Medium 3.5 Claude 3.7 Sonnet vs Kimi K2.7 Code Muse Spark vs Kimi K2.7 Code