AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Gemini 3.1 Pro vs GLM-4.7

Google
Gemini 3.1 Pro
Z.ai
GLM-4.7
Overview
CompanyGoogleZ.ai
Release dateFeb 19 2026Dec 22 2025
AccessProprietaryOpen Weight
Specifications
Context window
128k
Benchmarks
Nonsense detection
BullshitBench v2
37%
Agentic coding
SWE-Bench Pro
54.2%
Coding
SWE-Bench Verified
80.6%Best
73.8%
Agentic terminal coding
Terminal-Bench 2.1
70.3%
Agentic terminal coding
Terminal-Bench 2.0
68.5%Best
41%
Multi-step tool use
MCP Atlas
78.2%
General tool use
Toolathlon
48.8%
Web browsing
BrowseComp
85.9%Best
52%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
44.4%Best
24.8%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
51.4%Best
42.8%
Abstract reasoning
ARC-AGI-2
77.1%
Advanced math
FrontierMath · Tier 1–3
36.9%
Advanced math
FrontierMath · Tier 4
16.7%
Science
GPQA Diamond
94.3%Best
85.7%
Agentic computer use
OSWorld-Verified
76.2%
Agentic financial analysis
Finance Agent v2
43%
Knowledge work
GDPval-AA
1314
Knowledge work
GDPval (win/tie rate)
67.3%
Chart reasoning
CharXiv Reasoning
83.3%
Multimodal reasoning
MMMU-Pro
80.5%
Spatial reasoning
Blueprint-Bench 2
26.5%
Long context
MRCR v2 (8-needle) · 128k average
84.9%
Long context
MRCR v2 (8-needle) · 1M pointwise
26.3%
Timeline
Release gapGLM-4.7 shipped 59 days before Gemini 3.1 Pro

Which is better: Gemini 3.1 Pro or GLM-4.7?

Gemini 3.1 Pro leads GLM-4.7 on 6 of the 6 benchmarks they both report. GLM-4.7 shipped 59 days before Gemini 3.1 Pro, so benchmark comparisons should account for the intervening progress.

Gemini 3.1 Pro is proprietary, while GLM-4.7 is open weight.

On SWE-Bench Verified, Gemini 3.1 Pro leads at 80.6% vs GLM-4.7 at 73.8%. On Terminal-Bench 2.0, Gemini 3.1 Pro leads at 68.5% vs GLM-4.7 at 41%. On BrowseComp, Gemini 3.1 Pro leads at 85.9% vs GLM-4.7 at 52%. On Humanity's Last Exam · no tools, Gemini 3.1 Pro leads at 44.4% vs GLM-4.7 at 24.8%. On Humanity's Last Exam · with tools, Gemini 3.1 Pro leads at 51.4% vs GLM-4.7 at 42.8%. On GPQA Diamond, Gemini 3.1 Pro leads at 94.3% vs GLM-4.7 at 85.7%.

Frequently asked questions

Gemini 3.1 Pro was released by Google on Feb 19 2026.

GLM-4.7 was released by Z.ai on Dec 22 2025.

Gemini 3.1 Pro leads on SWE-Bench Verified — Gemini 3.1 Pro 80.6% vs GLM-4.7 73.8%.

Gemini 3.1 Pro leads on Humanity's Last Exam · no tools — Gemini 3.1 Pro 44.4% vs GLM-4.7 24.8%.

Gemini 3.1 Pro is a proprietary model released by Google. GLM-4.7 is an open weight model released by Z.ai.

Other comparisons