AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Gemini 3.1 Pro vs GLM-5.1

Google
Gemini 3.1 Pro
Z.ai
GLM-5.1
Overview
CompanyGoogleZ.ai
Release dateFeb 19 2026Apr 7 2026
AccessProprietaryOpen Weight
Specifications
Parameters
744B
Context window
200k
Benchmarks
Nonsense detection
BullshitBench v2
37%
Agentic coding
SWE-Bench Pro
54.2%
Coding
SWE-Bench Verified
80.6%
Agentic terminal coding
Terminal-Bench 2.1
70.3%
Agentic terminal coding
Terminal-Bench 2.0
68.5%Best
63.5%
Multi-step tool use
MCP Atlas
78.2%
General tool use
Toolathlon
48.8%
Web browsing
BrowseComp
85.9%Best
68%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
44.4%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
51.4%
Abstract reasoning
ARC-AGI-2
77.1%
Advanced math
FrontierMath · Tier 1–3
36.9%
Advanced math
FrontierMath · Tier 4
16.7%
Science
GPQA Diamond
94.3%Best
86.2%
Agentic computer use
OSWorld-Verified
76.2%
Agentic financial analysis
Finance Agent v2
43%
Knowledge work
GDPval-AA
1314
Knowledge work
GDPval (win/tie rate)
67.3%
Chart reasoning
CharXiv Reasoning
83.3%
Multimodal reasoning
MMMU-Pro
80.5%
Spatial reasoning
Blueprint-Bench 2
26.5%
Long context
MRCR v2 (8-needle) · 128k average
84.9%
Long context
MRCR v2 (8-needle) · 1M pointwise
26.3%
Timeline
Release gapGemini 3.1 Pro shipped 47 days before GLM-5.1

Which is better: Gemini 3.1 Pro or GLM-5.1?

Gemini 3.1 Pro leads GLM-5.1 on 3 of the 3 benchmarks they both report (Terminal-Bench 2.0, BrowseComp, GPQA Diamond). Gemini 3.1 Pro shipped 47 days before GLM-5.1, so benchmark comparisons should account for the intervening progress.

Gemini 3.1 Pro is proprietary, while GLM-5.1 is open weight.

On Terminal-Bench 2.0, Gemini 3.1 Pro leads at 68.5% vs GLM-5.1 at 63.5%. On BrowseComp, Gemini 3.1 Pro leads at 85.9% vs GLM-5.1 at 68%. On GPQA Diamond, Gemini 3.1 Pro leads at 94.3% vs GLM-5.1 at 86.2%.

Frequently asked questions

Gemini 3.1 Pro was released by Google on Feb 19 2026.

GLM-5.1 was released by Z.ai on Apr 7 2026.

Gemini 3.1 Pro leads on Terminal-Bench 2.0 — Gemini 3.1 Pro 68.5% vs GLM-5.1 63.5%.

Gemini 3.1 Pro leads on GPQA Diamond — Gemini 3.1 Pro 94.3% vs GLM-5.1 86.2%.

Gemini 3.1 Pro is a proprietary model released by Google. GLM-5.1 is an open weight model released by Z.ai.

Other comparisons