RankModelProviderScore (0-100)SamplesContextPrice / 1M tokens
1
G
gemini-3-pro Google
100.0
224
1.05M
¥14.4 / ¥86.4Input/Output
2
G
gemini-3.1-pro-preview Google
96.4
86
1.05M
¥14.4 / ¥86.4Input/Output
3
G
gemini-2.5-pro Google
92.9
808
1.05M
¥9 / ¥72Input/Output
4
G
gemini-3-flash Google
89.3
157
1.05M
¥3.6 / ¥21.6Input/Output
5
O
gpt-5.2-high Openai
85.7
79
400K
¥12.6 / ¥101Input/Output
6
A
qwen3-vl-235b-a22b-instruct Alibaba
82.1
148
128K
¥2.16 / ¥8.64Input/Output
7
G
gemini-2.5-flash Google
78.6
596
1.05M
¥2.16 / ¥18Input/Output
8
M
kimi-k2.5-thinking Moonshot
75.0
89
262K
¥4.32 / ¥21.6Input/Output
9
O
gpt-5.1-high Openai
71.4
100
400K
¥9 / ¥72Input/Output
10
X
grok-4-0709 Xai
67.9
377
256K
¥21.6 / ¥108Input/Output
11
G
gemini-3-flash (thinking-minimal) Google
64.3
138
1.05M
¥3.6 / ¥21.6Input/Output
12
G
gemma-4-31b Google
60.7
50
262K
¥3.24 / ¥7.2Input/Output
13
O
chatgpt-4o-latest-20250326 Openai
57.1
286
128K
¥18 / ¥72Input/Output
14
O
gpt-5-chat Openai
53.6
403
400K
¥9 / ¥72Input/Output
15
O
gpt-5-mini-high Openai
50.0
302
400K
¥1.8 / ¥14.4Input/Output
16
O
o3-2025-04-16 Openai
46.4
564
200K
¥14.4 / ¥57.6Input/Output
17
O
gpt-4.1-2025-04-14 Openai
42.9
443
1.05M
¥14.4 / ¥57.6Input/Output
18
O
gpt-5-high Openai
39.3
382
400K
¥9 / ¥72Input/Output
19
O
o4-mini-2025-04-16 Openai
35.7
442
200K
¥7.92 / ¥31.7Input/Output
20
O
gpt-5.1 Openai
32.1
125
400K
¥9 / ¥72Input/Output
21
G
gemini-2.5-flash-lite-preview-06-17-thinking Google
28.6
406
65.5K
¥0.72 / ¥2.88Input/Output
22
O
gpt-4.1-mini-2025-04-14 Openai
25.0
410
1.05M
¥2.88 / ¥11.5Input/Output
23
O
gpt-5.2 Openai
21.4
87
400K
¥12.6 / ¥101Input/Output
24
MA
mistral-medium-2508 Mistral
17.9
412
262K
¥2.88 / ¥14.4Input/Output
25
MA
mistral-small-3.1-24b-instruct-2503 Mistral
14.3
281
262K
¥2.88 / ¥14.4Input/Output
26
G
gemma-3-27b-it Google
10.7
273
128K
¥2.15 / ¥2.15Input/Output
27
G
gemini-2.0-flash-001 Google
7.1
110
1.05M
¥1.08 / ¥4.32Input/Output
28
MA
mistral-medium-2505 Mistral
3.6
168
262K
¥2.88 / ¥14.4Input/Output
29
MA
mistral-small-2506 Mistral
0.0
194
262K
¥2.88 / ¥14.4Input/Output
Top model analysisgemini-3-pro why it ranks first
gemini-3-pro ranks first with a percent score of 100.0 and 224 samples. Use it as the first option for this leaderboard, then compare price, context and availability.
How to chooseDo not only look at rank #1
Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.
Related leaderboardsCompare adjacent capabilities