RankModelProviderScore (0-100)SamplesContextPrice / 1M tokens
1
A
claude-opus-4-7 Anthropic
100.0
593
1M
¥36 / ¥180Input/Output
2
A
claude-opus-4-7-thinking Anthropic
98.7
688
1M
¥36 / ¥180Input/Output
3
A
claude-opus-4-6 Anthropic
97.5
1.3K
1M
¥36 / ¥180Input/Output
4
A
claude-opus-4-6-thinking Anthropic
96.2
1.2K
1M
¥36 / ¥180Input/Output
5
A
qwen3.7-max-20260517 Alibaba
94.9
213
1M
¥18 / ¥54Input/Output
6
Z
glm-5.1 Zai
93.7
530
200K
¥0 / ¥0Input/Output
7
M
muse-spark Meta
92.4
186
-
-
8
A
claude-sonnet-4-6 Anthropic
91.1
1.6K
1M
¥21.6 / ¥108Input/Output
9
O
gpt-5.5-xhigh (codex-harness) Openai
89.9
494
400K
¥9 / ¥72Input/Output
10
M
kimi-k2.6 Moonshot
88.6
532
262K
¥6.84 / ¥28.8Input/Output
11
O
gpt-5.5-high (codex-harness) Openai
87.3
523
400K
¥9 / ¥72Input/Output
12
A
claude-opus-4-5-20251101-thinking-32k Anthropic
86.1
7.9K
200K
¥108 / ¥540Input/Output
13
G
gemini-3.1-pro-preview Google
84.8
1.3K
1.05M
¥14.4 / ¥86.4Input/Output
14
O
gpt-5.5 (codex-harness) Openai
83.5
530
400K
¥9 / ¥72Input/Output
15
G
gemini-3.5-flash Google
82.3
324
1.05M
¥10.8 / ¥64.8Input/Output
16
A
qwen3.6-max-preview Alibaba
81.0
311
246K
¥9.5 / ¥56.9Input/Output
17
A
claude-opus-4-5-20251101 Anthropic
79.7
8.4K
200K
¥36 / ¥180Input/Output
18
MI
mimo-v2.5-pro Xiaomi
78.5
562
1.05M
¥7.2 / ¥21.6Input/Output
19
O
gpt-5.4-medium (codex-harness) Openai
77.2
165
400K
¥9 / ¥72Input/Output
20
D
deepseek-v4-pro-thinking Deepseek
75.9
507
1M
¥3.13 / ¥6.26Input/Output
21
A
qwen3.6-plus Alibaba
74.7
786
1M
¥3.6 / ¥21.6Input/Output
22
G
gemini-3-pro Google
73.4
13.8K
1.05M
¥14.4 / ¥86.4Input/Output
23
O
gpt-5.4-high (codex-harness) Openai
72.2
160
400K
¥9 / ¥72Input/Output
24
Z
glm-5 Zai
70.9
804
205K
¥7.2 / ¥23Input/Output
25
Z
glm-4.7 Zai
69.6
4.8K
205K
¥0 / ¥0Input/Output
26
MI
mimo-v2-pro Xiaomi
68.4
794
1.05M
¥7.2 / ¥21.6Input/Output
27
G
gemini-3-flash Google
67.1
9.2K
1.05M
¥3.6 / ¥21.6Input/Output
28
O
gpt-5.4-mini-high Openai
65.8
672
400K
¥5.4 / ¥32.4Input/Output
29
MI
mimo-v2.5 Xiaomi
64.6
451
1.05M
¥2.88 / ¥14.4Input/Output
30
O
gpt-5.3-codex (codex-harness) Openai
63.3
394
400K
¥9 / ¥72Input/Output
31
M
kimi-k2.5-thinking Moonshot
62.0
1.6K
262K
¥4.32 / ¥21.6Input/Output
32
O
gpt-5.3-codex (codex-harness) Openai
60.8
360
400K
¥9 / ¥72Input/Output
33
M
kimi-k2.5-instant Moonshot
59.5
589
262K
¥4.32 / ¥21.6Input/Output
34
O
gpt-5.2 Openai
58.2
1.5K
400K
¥12.6 / ¥101Input/Output
35
M
minimax-m2.7 Minimax
57.0
786
205K
¥0 / ¥0Input/Output
36
A
qwen3.5-397b-a17b Alibaba
55.7
1.2K
262K
¥3.1 / ¥18.6Input/Output
37
M
minimax-m2.5 Minimax
54.4
1K
205K
¥0 / ¥0Input/Output
38
M
minimax-m2.1-preview Minimax
53.2
6.8K
205K
¥0 / ¥0Input/Output
39
O
gpt-5-medium Openai
51.9
3.8K
400K
¥9 / ¥72Input/Output
40
G
gemini-3-flash (thinking-minimal) Google
50.6
6.5K
1.05M
¥3.6 / ¥21.6Input/Output
41
O
gpt-5.1-medium Openai
49.4
6.1K
400K
¥9 / ¥72Input/Output
42
A
claude-sonnet-4-5-20250929-thinking-32k Anthropic
48.1
11.3K
200K
¥21.6 / ¥108Input/Output
43
A
claude-opus-4-1-20250805 Anthropic
46.8
8.5K
200K
¥108 / ¥540Input/Output
44
A
claude-sonnet-4-5-20250929 Anthropic
45.6
12.9K
200K
¥21.6 / ¥108Input/Output
45
A
qwen3.5-27b Alibaba
44.3
928
262K
¥2.16 / ¥17.3Input/Output
46
X
grok-4.20-beta-0309-reasoning Xai
43.0
856
2M
¥14.4 / ¥43.2Input/Output
47
D
deepseek-v3.2-thinking Deepseek
41.8
4K
128K
¥2.09 / ¥3.1Input/Output
48
G
gemma-4-31b Google
40.5
371
262K
¥3.24 / ¥7.2Input/Output
49
Z
glm-4.6 Zai
39.2
8.3K
205K
¥4.32 / ¥15.8Input/Output
50
X
grok-4.3 Xai
38.0
447
1M
¥9 / ¥18Input/Output
51
MI
mimo-v2-flash (non-thinking) Xiaomi
36.7
4.1K
262K
¥0.72 / ¥2.16Input/Output
52
O
gpt-5.1 Openai
35.4
10K
400K
¥9 / ¥72Input/Output
53
TE
hunyuan-hy3-preview Tencent
34.2
189
256K
¥0 / ¥0Input/Output
54
G
gemma-4-26b-a4b Google
32.9
202
262K
¥0.94 / ¥2.88Input/Output
55
A
qwen3.5-122b-a10b Alibaba
31.6
990
262K
¥2.88 / ¥23Input/Output
56
MI
mimo-v2-flash (thinking) Xiaomi
30.4
1.2K
262K
¥0.72 / ¥2.16Input/Output
57
O
gpt-5.2-codex Openai
29.1
3.1K
400K
¥12.6 / ¥101Input/Output
58
O
gpt-5.1-codex Openai
27.8
6.2K
400K
¥9 / ¥72Input/Output
59
M
kimi-k2-thinking-turbo Moonshot
26.6
10K
262K
¥17.3 / ¥72Input/Output
60
A
qwen3.5-35b-a3b Alibaba
25.3
251
262K
¥1.8 / ¥14.4Input/Output
61
M
minimax-m2 Minimax
24.1
8.4K
197K
¥0 / ¥0Input/Output
62
A
claude-haiku-4-5-20251001 Anthropic
22.8
11.6K
200K
¥7.2 / ¥36Input/Output
63
D
deepseek-v3.2 Deepseek
21.5
5.2K
128K
¥2.09 / ¥3.1Input/Output
64
A
qwen3.5-flash Alibaba
20.3
196
1M
¥1.24 / ¥12.4Input/Output
65
D
deepseek-v3.2-exp Deepseek
19.0
4.9K
128K
¥0 / ¥0Input/Output
66
A
qwen3-coder-480b-a35b-instruct Alibaba
17.7
10.8K
262K
¥6.2 / ¥24.8Input/Output
67
UNtrinity-large-thinking
-
16.5
197
262K
¥1.8 / ¥6.48Input/Output
68
G
gemini-3.1-flash-lite-preview Google
15.2
1.1K
1.05M
¥1.8 / ¥10.8Input/Output
69
UNKAT-Coder-Pro-V1
-
13.9
1.9K
256K
¥0.22 / ¥8.64Input/Output
70
O
gpt-5.1-codex-mini Openai
12.7
1.4K
400K
¥1.8 / ¥14.4Input/Output
71
X
grok-4-1-fast-reasoning Xai
11.4
5.5K
2M
¥1.44 / ¥3.6Input/Output
72
MA
mistral-large-3 Mistral
10.1
1K
262K
¥3.6 / ¥10.8Input/Output
73
IB
granite-4.1-8b Ibm
8.9
229
131K
¥0.36 / ¥0.72Input/Output
74
X
grok-4.1-thinking Xai
7.6
1.2K
200K
¥14.4 / ¥72Input/Output
75
MA
devstral-2 Mistral
6.3
1.3K
262K
¥2.88 / ¥14.4Input/Output
76
G
gemini-2.5-pro Google
5.1
3.3K
1.05M
¥9 / ¥72Input/Output
77
IA
mercury-2 Inception Ai
3.8
100
128K
¥1.8 / ¥5.4Input/Output
78
X
grok-4-fast-reasoning Xai
2.5
933
2M
¥1.44 / ¥3.6Input/Output
79
X
grok-code-fast-1 Xai
1.3
982
256K
¥1.44 / ¥10.8Input/Output
80
MA
devstral-medium-2507 Mistral
0.0
992
262K
¥2.88 / ¥14.4Input/Output
Top model analysisclaude-opus-4-7 why it ranks first
claude-opus-4-7 ranks first with a percent score of 100.0 and 593 samples. Use it as the first option for this leaderboard, then compare price, context and availability.
How to chooseDo not only look at rank #1
Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.
Related leaderboardsCompare adjacent capabilities