AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Claude Opus 4.7 vs GLM-4.7

Anthropic
Claude Opus 4.7
Z.ai
GLM-4.7
Overview
CompanyAnthropicZ.ai
Release dateApr 16 2026Dec 22 2025
AccessProprietaryOpen Weight
Specifications
Context window
1M
128k
Benchmarks
Nonsense detection
BullshitBench v2
83%
Agentic coding
SWE-Bench Pro
64.3%
Coding
SWE-Bench Verified
87.6%Best
73.8%
Multilingual coding
SWE-Bench Multilingual
80.5%
Agentic coding
CursorBench v3.1
61.6%
Agentic terminal coding
Terminal-Bench 2.1
66.1%
Agentic terminal coding
Terminal-Bench 2.0
69.4%Best
41%
Multi-step tool use
MCP Atlas
79.1%
Web browsing
BrowseComp
79.3%Best
52%
Cybersecurity
CyberGym
73.1%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
46.9%Best
24.8%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
54.7%Best
42.8%
Abstract reasoning
ARC-AGI-2
75.8%
Advanced math
FrontierMath · Tier 1–3
43.8%
Advanced math
FrontierMath · Tier 4
22.9%
Science
GPQA Diamond
94.2%Best
85.7%
Agentic computer use
OSWorld-Verified
78%
Agentic financial analysis
Finance Agent v2
51.5%
Knowledge work
GDPval-AA
1753
Knowledge work
GDPval (win/tie rate)
80.3%
Chart reasoning
CharXiv Reasoning
82.1%
Multimodal reasoning
MMMU-Pro
75.2%
Spatial reasoning
Blueprint-Bench 2
24.5%
Long context
MRCR v2 (8-needle) · 128k average
59.3%
Timeline
Release gapGLM-4.7 shipped 115 days before Claude Opus 4.7

Which is better: Claude Opus 4.7 or GLM-4.7?

Claude Opus 4.7 leads GLM-4.7 on 6 of the 6 benchmarks they both report. GLM-4.7 shipped 115 days before Claude Opus 4.7, so benchmark comparisons should account for the intervening progress.

Context windows are 1M (Claude Opus 4.7) vs 128k (GLM-4.7). Claude Opus 4.7 is proprietary, while GLM-4.7 is open weight.

On SWE-Bench Verified, Claude Opus 4.7 leads at 87.6% vs GLM-4.7 at 73.8%. On Terminal-Bench 2.0, Claude Opus 4.7 leads at 69.4% vs GLM-4.7 at 41%. On BrowseComp, Claude Opus 4.7 leads at 79.3% vs GLM-4.7 at 52%. On Humanity's Last Exam · no tools, Claude Opus 4.7 leads at 46.9% vs GLM-4.7 at 24.8%. On Humanity's Last Exam · with tools, Claude Opus 4.7 leads at 54.7% vs GLM-4.7 at 42.8%. On GPQA Diamond, Claude Opus 4.7 leads at 94.2% vs GLM-4.7 at 85.7%.

Frequently asked questions

Claude Opus 4.7 was released by Anthropic on Apr 16 2026.

GLM-4.7 was released by Z.ai on Dec 22 2025.

Claude Opus 4.7 leads on SWE-Bench Verified — Claude Opus 4.7 87.6% vs GLM-4.7 73.8%.

Claude Opus 4.7 leads on Humanity's Last Exam · no tools — Claude Opus 4.7 46.9% vs GLM-4.7 24.8%.

Claude Opus 4.7 has a 1M context window; GLM-4.7 has 128k.

Claude Opus 4.7 is a proprietary model released by Anthropic. GLM-4.7 is an open weight model released by Z.ai.

Other comparisons