GPT-4o vs Claude 4 Sonnet vs Gemini 2.5 Pro

Complete comparison (June 2026): benchmarks, pricing, features, and recommendations.

📅Last updated: June 6, 2026

📋 Overview

Feature	GPT-4o	Claude 4 Sonnet	Gemini 2.5 Pro
Release Date	May 2024	Jun 2025	Mar 2025
Parameters	Undisclosed	Undisclosed	Undisclosed
Context Window	128K tokens	200K tokens	1M tokens
License	Proprietary	Proprietary	Proprietary
Multimodal	✅ Text, Image, Audio	✅ Text, Image	✅ Text, Image, Audio, Video
Function Calling	✅	✅	✅
Streaming	✅	✅	✅

Benchmark	GPT-4o	Claude 4 Sonnet	Gemini 2.5 Pro
MMLU	88.7	89	90 ✓
HumanEval	90.2	92.1 ✓	88.4
Chatbot Arena ELO	1387 ✓	1385	1380
MATH	76.6	78.3	83 ✓
GPQA Diamond	53.6	59.4	65 ✓

Price Point	GPT-4o	Claude 4 Sonnet	Gemini 2.5 Pro
Input / 1M tokens	$2.50	$3.00	$1.25
Output / 1M tokens	$10.00	$15.00	$10.00
Batch Input / 1M	$1.25	$1.50	$0.625
Batch Output / 1M	$5.00	$7.50	$5.00
Free Tier	✅ ChatGPT Free	❌	✅ Gemini Free

GPT-4o

Best balance of quality, speed, and cost. Huge ecosystem of integrations.

Claude 4 Sonnet

Highest HumanEval score. Excels at complex multi-file code tasks.

Gemini 2.5 Pro

1M context window handles massive documents. Strong on GPQA.

DeepSeek V3

Not on this list, but at $0.27/$1.10 per 1M tokens, it crushes on price.

Gemini 2.5 Pro

Only model that natively handles video input alongside text and audio.