AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Claude 3.7 Sonnet vs GLM-4.7

Anthropic
Claude 3.7 Sonnet
Z.ai
GLM-4.7
Overview
CompanyAnthropicZ.ai
Release dateFeb 24 2025Dec 22 2025
AccessProprietaryOpen Weight
Specifications
Context window
128k
Benchmarks
Nonsense detection
BullshitBench v2
49%
Coding
SWE-Bench Verified
62.3%
73.8%Best
Agentic terminal coding
Terminal-Bench 2.0
41%
Web browsing
BrowseComp
52%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
24.8%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
42.8%
Science
GPQA Diamond
68%
85.7%Best
Timeline
Release gapClaude 3.7 Sonnet shipped 301 days before GLM-4.7

Which is better: Claude 3.7 Sonnet or GLM-4.7?

GLM-4.7 leads Claude 3.7 Sonnet on 2 of the 2 benchmarks they both report (SWE-Bench Verified, GPQA Diamond). Claude 3.7 Sonnet shipped 301 days before GLM-4.7, so benchmark comparisons should account for the intervening progress.

Claude 3.7 Sonnet is proprietary, while GLM-4.7 is open weight.

On SWE-Bench Verified, GLM-4.7 leads at 73.8% vs Claude 3.7 Sonnet at 62.3%. On GPQA Diamond, GLM-4.7 leads at 85.7% vs Claude 3.7 Sonnet at 68%.

Frequently asked questions

Claude 3.7 Sonnet was released by Anthropic on Feb 24 2025.

GLM-4.7 was released by Z.ai on Dec 22 2025.

GLM-4.7 leads on SWE-Bench Verified — Claude 3.7 Sonnet 62.3% vs GLM-4.7 73.8%.

GLM-4.7 leads on GPQA Diamond — Claude 3.7 Sonnet 68% vs GLM-4.7 85.7%.

Claude 3.7 Sonnet is a proprietary model released by Anthropic. GLM-4.7 is an open weight model released by Z.ai.

Other comparisons