AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Claude Opus 4.7 vs GLM-5.1

Anthropic
Claude Opus 4.7
Z.ai
GLM-5.1
Overview
CompanyAnthropicZ.ai
Release dateApr 16 2026Apr 7 2026
AccessProprietaryOpen Weight
Specifications
Parameters
744B
Context window
1M
200k
Benchmarks
Nonsense detection
BullshitBench v2
83%
Agentic coding
SWE-Bench Pro
64.3%
Coding
SWE-Bench Verified
87.6%
Multilingual coding
SWE-Bench Multilingual
80.5%
Agentic coding
CursorBench v3.1
61.6%
Agentic terminal coding
Terminal-Bench 2.1
66.1%
Agentic terminal coding
Terminal-Bench 2.0
69.4%Best
63.5%
Multi-step tool use
MCP Atlas
79.1%
Web browsing
BrowseComp
79.3%Best
68%
Cybersecurity
CyberGym
73.1%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
46.9%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
54.7%
Abstract reasoning
ARC-AGI-2
75.8%
Advanced math
FrontierMath · Tier 1–3
43.8%
Advanced math
FrontierMath · Tier 4
22.9%
Science
GPQA Diamond
94.2%Best
86.2%
Agentic computer use
OSWorld-Verified
78%
Agentic financial analysis
Finance Agent v2
51.5%
Knowledge work
GDPval-AA
1753
Knowledge work
GDPval (win/tie rate)
80.3%
Chart reasoning
CharXiv Reasoning
82.1%
Multimodal reasoning
MMMU-Pro
75.2%
Spatial reasoning
Blueprint-Bench 2
24.5%
Long context
MRCR v2 (8-needle) · 128k average
59.3%
Timeline
Release gapGLM-5.1 shipped 9 days before Claude Opus 4.7

Which is better: Claude Opus 4.7 or GLM-5.1?

Claude Opus 4.7 leads GLM-5.1 on 3 of the 3 benchmarks they both report (Terminal-Bench 2.0, BrowseComp, GPQA Diamond). GLM-5.1 shipped 9 days before Claude Opus 4.7, so benchmark comparisons should account for the intervening progress.

Context windows are 1M (Claude Opus 4.7) vs 200k (GLM-5.1). Claude Opus 4.7 is proprietary, while GLM-5.1 is open weight.

On Terminal-Bench 2.0, Claude Opus 4.7 leads at 69.4% vs GLM-5.1 at 63.5%. On BrowseComp, Claude Opus 4.7 leads at 79.3% vs GLM-5.1 at 68%. On GPQA Diamond, Claude Opus 4.7 leads at 94.2% vs GLM-5.1 at 86.2%.

Frequently asked questions

Claude Opus 4.7 was released by Anthropic on Apr 16 2026.

GLM-5.1 was released by Z.ai on Apr 7 2026.

Claude Opus 4.7 leads on Terminal-Bench 2.0 — Claude Opus 4.7 69.4% vs GLM-5.1 63.5%.

Claude Opus 4.7 leads on GPQA Diamond — Claude Opus 4.7 94.2% vs GLM-5.1 86.2%.

Claude Opus 4.7 has a 1M context window; GLM-5.1 has 200k.

Claude Opus 4.7 is a proprietary model released by Anthropic. GLM-5.1 is an open weight model released by Z.ai.

Other comparisons