ANALYSIS · LIVE FROM ARTIFICIAL ANALYSIS

Frontier model benchmarks.

Independent evaluations across the Artificial Analysis Intelligence Index, coding, math, MMLU-Pro, GPQA, LiveCodeBench, HLE, and price + throughput — refreshed every 30 minutes from artificialanalysis.ai.

528 models tracked · cached 30 min
AA INTELLIGENCE INDEX · TOP 10

Top by intelligence

01AClaude Opus 4.8 (Adaptive Reasoning, Max Effort)Anthropic2026-05-28
61.4
02OGPT-5.5 (xhigh)OpenAI2026-04-23
60.2
03OGPT-5.5 (high)OpenAI2026-04-23
58.9
04AClaude Opus 4.7 (Adaptive Reasoning, Max Effort)Anthropic2026-04-16
57.3
05GGemini 3.1 Pro PreviewGoogle2026-02-19
57.2
06OGPT-5.4 (xhigh)OpenAI2026-03-05
56.8
07OGPT-5.5 (medium)OpenAI2026-04-23
56.7
08AQwen3.7 MaxAlibaba2026-05-19
56.6
09GGemini 3.5 Flash (high)Google2026-05-19
55.3
10GGemini 3.5 Flash (medium)Google2026-05-19
54.8
AA CODING INDEX · TOP 10

Top by coding

01OGPT-5.5 (xhigh)OpenAI2026-04-23
59.1
02OGPT-5.5 (high)OpenAI2026-04-23
58.5
03OGPT-5.4 (xhigh)OpenAI2026-03-05
57.2
04AClaude Opus 4.8 (Adaptive Reasoning, Max Effort)Anthropic2026-05-28
56.7
05OGPT-5.5 (medium)OpenAI2026-04-23
56.2
06GGemini 3.1 Pro PreviewGoogle2026-02-19
55.5
07OGPT-5.3 Codex (xhigh)OpenAI2026-02-05
53.1
08AClaude Opus 4.7 (Non-reasoning, High Effort)Anthropic2026-04-16
53.1
09AClaude Opus 4.7 (Adaptive Reasoning, Max Effort)Anthropic2026-04-16
52.5
10OGPT-5.5 (low)OpenAI2026-04-23
52.1
AA MATH INDEX · TOP 10

Top by math

01OGPT-5.2 (xhigh)OpenAI2025-12-11
99.0
02OGPT-5 Codex (high)OpenAI2025-09-23
98.7
03GGemini 3 Flash Preview (Reasoning)Google2025-12-17
97.0
04OGPT-5.2 (medium)OpenAI2025-12-11
96.7
05DDeepSeek V3.2 SpecialeDeepSeek2025-12-01
96.7
06XMiMo-V2-Flash (Reasoning)Xiaomi2025-12-16
96.3
07OGPT-5.1 Codex (high)OpenAI2025-11-13
95.7
08GGemini 3 Pro Preview (high)Google2025-11-18
95.7
09ZGLM-4.7 (Reasoning)Z AI2025-12-22
95.0
10KKAT-Coder-Pro V1KwaiKAT2025-11-11
94.7
GPQA · TOP 10

GPQA (graduate-level Q&A)

01GGemini 3.1 Pro PreviewGoogle2026-02-19
0.9
02OGPT-5.5 (xhigh)OpenAI2026-04-23
0.9
03OGPT-5.5 (high)OpenAI2026-04-23
0.9
04OGPT-5.5 (medium)OpenAI2026-04-23
0.9
05AQwen3.7 MaxAlibaba2026-05-19
0.9
06GGemini 3.5 Flash (high)Google2026-05-19
0.9
07GGemini 3.5 Flash (medium)Google2026-05-19
0.9
08AClaude Opus 4.8 (Adaptive Reasoning, Max Effort)Anthropic2026-05-28
0.9
09OGPT-5.4 (xhigh)OpenAI2026-03-05
0.9
10OGPT-5.3 Codex (xhigh)OpenAI2026-02-05
0.9
MMLU-PRO · TOP 10

MMLU-Pro

01GGemini 3 Pro Preview (high)Google2025-11-18
0.9
02GGemini 3 Pro Preview (low)Google2025-11-18
0.9
03AClaude Opus 4.5 (Reasoning)Anthropic2025-11-24
0.9
04GGemini 3 Flash Preview (Reasoning)Google2025-12-17
0.9
05AClaude Opus 4.5 (Non-reasoning)Anthropic2025-11-24
0.9
06GGemini 3 Flash Preview (Non-reasoning)Google2025-12-17
0.9
07AClaude 4.1 Opus (Reasoning)Anthropic2025-08-05
0.9
08AClaude 4.5 Sonnet (Reasoning)Anthropic2025-09-29
0.9
09MMiniMax-M2.1MiniMax2025-12-23
0.9
10OGPT-5.2 (xhigh)OpenAI2025-12-11
0.9
PRICING + THROUGHPUT

Cost-to-intelligence frontier

#Model$ / 1M (blended)ThroughputTTFT
01AClaude Opus 4.8 (Adaptive Reasoning, Max Effort)Anthropic$10.9459 tok/s12.49s
02OGPT-5.5 (xhigh)OpenAI$11.2573 tok/s42.03s
03OGPT-5.5 (high)OpenAI$11.2565 tok/s19.13s
04AClaude Opus 4.7 (Adaptive Reasoning, Max Effort)Anthropic$10.9456 tok/s14.27s
05GGemini 3.1 Pro PreviewGoogle$4.50133 tok/s21.57s
06OGPT-5.4 (xhigh)OpenAI$5.6382 tok/s167.85s
07OGPT-5.5 (medium)OpenAI$11.2567 tok/s4.88s
08AQwen3.7 MaxAlibaba$3.75202 tok/s1.64s
09GGemini 3.5 Flash (high)Google$3.38221 tok/s11.57s
10GGemini 3.5 Flash (medium)Google$3.38207 tok/s11.43s
11KKimi K2.6Kimi$1.7140 tok/s1.33s
12XMiMo-V2.5-ProXiaomi$0.5451 tok/s1.93s
13OGPT-5.3 Codex (xhigh)OpenAI$4.8181 tok/s58.22s
14xGrok 4.3 (high)xAI$1.56130 tok/s23.74s
15AClaude Opus 4.6 (Adaptive Reasoning, Max Effort)Anthropic$10.9450 tok/s11.03s
← AIDB models indexSource · artificialanalysis.ai ↗