AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

GPT-5.5 vs GLM-4.7

OpenAI
GPT-5.5
Z.ai
GLM-4.7
Overview
CompanyOpenAIZ.ai
Release dateApr 23 2026Dec 22 2025
AccessProprietaryOpen Weight
Specifications
Context window
1.05M
128k
Benchmarks
Nonsense detection
BullshitBench v2
47%
Agentic coding
SWE-Bench Pro
58.6%
Coding
SWE-Bench Verified
73.8%
Multilingual coding
SWE-Bench Multilingual
77.8%
Agentic coding
CursorBench v3.1
59.2%
Agentic terminal coding
Terminal-Bench 2.1
78.2%
Agentic terminal coding
Terminal-Bench 2.0
82.7%Best
41%
Software engineering
Expert-SWE (Internal)
73.1%
Multi-step tool use
MCP Atlas
75.3%
General tool use
Toolathlon
55.6%
Web browsing
BrowseComp
84.4%Best
52%
Cybersecurity
CyberGym
81.8%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
41.4%Best
24.8%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
52.2%Best
42.8%
Abstract reasoning
ARC-AGI-2
84.6%
Advanced math
FrontierMath · Tier 1–3
51.7%
Advanced math
FrontierMath · Tier 4
35.4%
Science
GPQA Diamond
93.6%Best
85.7%
Agentic computer use
OSWorld-Verified
78.7%
Agentic financial analysis
Finance Agent v2
51.8%
Knowledge work
GDPval-AA
1769
Knowledge work
GDPval (win/tie rate)
84.9%
Chart reasoning
CharXiv Reasoning
84.1%
Multimodal reasoning
MMMU-Pro
81.2%
Spatial reasoning
Blueprint-Bench 2
36.2%
Long context
MRCR v2 (8-needle) · 128k average
94.8%
Timeline
Release gapGLM-4.7 shipped 122 days before GPT-5.5

Which is better: GPT-5.5 or GLM-4.7?

GPT-5.5 leads GLM-4.7 on 5 of the 5 benchmarks they both report (Terminal-Bench 2.0, BrowseComp, Humanity's Last Exam, GPQA Diamond). GLM-4.7 shipped 122 days before GPT-5.5, so benchmark comparisons should account for the intervening progress.

Context windows are 1.05M (GPT-5.5) vs 128k (GLM-4.7). GPT-5.5 is proprietary, while GLM-4.7 is open weight.

On Terminal-Bench 2.0, GPT-5.5 leads at 82.7% vs GLM-4.7 at 41%. On BrowseComp, GPT-5.5 leads at 84.4% vs GLM-4.7 at 52%. On Humanity's Last Exam · no tools, GPT-5.5 leads at 41.4% vs GLM-4.7 at 24.8%. On Humanity's Last Exam · with tools, GPT-5.5 leads at 52.2% vs GLM-4.7 at 42.8%. On GPQA Diamond, GPT-5.5 leads at 93.6% vs GLM-4.7 at 85.7%.

Frequently asked questions

GPT-5.5 was released by OpenAI on Apr 23 2026.

GLM-4.7 was released by Z.ai on Dec 22 2025.

GPT-5.5 leads on Terminal-Bench 2.0 — GPT-5.5 82.7% vs GLM-4.7 41%.

GPT-5.5 leads on Humanity's Last Exam · no tools — GPT-5.5 41.4% vs GLM-4.7 24.8%.

GPT-5.5 has a 1.05M context window; GLM-4.7 has 128k.

GPT-5.5 is a proprietary model released by OpenAI. GLM-4.7 is an open weight model released by Z.ai.

Other comparisons