AI Model Release Tracker - Timeline of Major AI Models from 2022-2026

Home/Compare/Gemini 3.1 Pro vs Kimi K2 Thinking

Gemini 3.1 Pro vs Kimi K2 Thinking

1 vs 0 benchmarks won

Google
Gemini 3.1 Pro
Moonshot AI
Kimi K2 Thinking
Overview
CompanyGoogleMoonshot AI
Release dateFeb 19 2026Nov 6 2025
Model type
Open sourceNoYes
Specifications
Parameters
1T
Context window
256k
Benchmarks
Agentic coding
SWE-Bench Pro
54.2%
Coding
SWE-Bench Verified
80.6%Best
71.3%
Agentic terminal coding
Terminal-Bench 2.1
70.3%
Agentic terminal coding
Terminal-Bench 2.0
68.5%
Multi-step tool use
MCP Atlas
78.2%
General tool use
Toolathlon
48.8%
Web browsing
BrowseComp
85.9%
Multidisciplinary reasoning
Humanity's Last Exam · no tools
44.4%
Multidisciplinary reasoning
Humanity's Last Exam · with tools
51.4%
Abstract reasoning
ARC-AGI-2
77.1%
Advanced math
FrontierMath · Tier 1–3
36.9%
Advanced math
FrontierMath · Tier 4
16.7%
Science
GPQA Diamond
94.3%
Agentic computer use
OSWorld-Verified
76.2%
Agentic financial analysis
Finance Agent v2
43%
Knowledge work
GDPval-AA
1314
Knowledge work
GDPval (win/tie rate)
67.3%
Chart reasoning
CharXiv Reasoning
83.3%
Multimodal reasoning
MMMU-Pro
80.5%
Spatial reasoning
Blueprint-Bench 2
26.5%
Long context
MRCR v2 (8-needle) · 128k average
84.9%
Long context
MRCR v2 (8-needle) · 1M pointwise
26.3%
Timeline
Release gapKimi K2 Thinking shipped 105 days before Gemini 3.1 Pro

Which is better: Gemini 3.1 Pro or Kimi K2 Thinking?

Gemini 3.1 Pro leads Kimi K2 Thinking on 1 of the 1 benchmark they both report (SWE-Bench Verified). Kimi K2 Thinking shipped 105 days before Gemini 3.1 Pro, so benchmark comparisons should account for the intervening progress.

Kimi K2 Thinking is an open-source / open-weight model; Gemini 3.1 Pro is proprietary.

On SWE-Bench Verified, Gemini 3.1 Pro leads at 80.6% vs Kimi K2 Thinking at 71.3%.

Frequently asked questions

When was Gemini 3.1 Pro released?
Gemini 3.1 Pro was released by Google on Feb 19 2026.
When was Kimi K2 Thinking released?
Kimi K2 Thinking was released by Moonshot AI on Nov 6 2025.
Which is better at coding, Gemini 3.1 Pro or Kimi K2 Thinking?
Gemini 3.1 Pro leads on SWE-Bench Verified — Gemini 3.1 Pro 80.6% vs Kimi K2 Thinking 71.3%.
Is Gemini 3.1 Pro or Kimi K2 Thinking open source?
Kimi K2 Thinking is an open-source / open-weight model released by Moonshot AI. Gemini 3.1 Pro is a proprietary model released by Google.

Other comparisons