Chat · Text · Business, Management & Finance Leaderboard

Ranking for Text / Business, Management & Finance, based on public preference data.

Selection guide

Business, Management & Finance model ranking guide

Ranking for Text / Business, Management & Finance, based on public preference data.

claude-opus-4-6claude-opus-4-6-thinkingclaude-opus-4-7-thinkinggpt-5.5-highgpt-5.5
Current DirectoryChat · Text · Business, Management & Finance
Models353
Published2026/05/27
Arena public preference evaluationOriginal leaderboard: Text / Industry Business And Management And Financial OperationsPublished: 2026/05/27Leaderboard dataset: LMArena latest parquetOpen Arena sourceOpen leaderboard dataset
1
claude-opus-4-6
Anthropic
100.0
7.1K
1M
¥36 / ¥180Input/Output
2
claude-opus-4-6-thinking
Anthropic
99.7
6.6K
1M
¥36 / ¥180Input/Output
3
claude-opus-4-7-thinking
Anthropic
99.4
3.9K
1M
¥36 / ¥180Input/Output
4
gpt-5.5-high
Openai
99.1
3.2K
1.05M
¥36 / ¥216Input/Output
5
gpt-5.5
Openai
98.9
3.2K
1.05M
¥36 / ¥216Input/Output
6
claude-opus-4-7
Anthropic
98.6
4.1K
1M
¥36 / ¥180Input/Output
7
gpt-5.4-high
Openai
98.3
5.5K
1.05M
¥18 / ¥108Input/Output
8
ernie-5.1
Baidu
98.0
2.8K
119K
¥5.4 / ¥21.6Input/Output
9
gpt-5.4
Openai
97.7
5.9K
1.05M
¥18 / ¥108Input/Output
10
muse-spark
Meta
97.4
2.4K
-
-
11
gemini-3.1-pro-preview
Google
97.2
8.4K
1.05M
¥14.4 / ¥86.4Input/Output
12
gemini-3-pro
Google
96.9
7.6K
1.05M
¥14.4 / ¥86.4Input/Output
13
qwen3.5-max-preview
Alibaba
96.6
4K
-
-
14
claude-sonnet-4-6
Anthropic
96.3
5.3K
1M
¥21.6 / ¥108Input/Output
15
mimo-v2.5-pro
Xiaomi
96.0
3.1K
1.05M
¥7.2 / ¥21.6Input/Output
16
gemini-3.5-flash
Google
95.7
1.8K
1.05M
¥10.8 / ¥64.8Input/Output
17
amazon-nova-experimental-chat-26-02-10
Amazon
95.5
602
-
-
18
gemini-3-flash
Google
95.2
5.6K
1.05M
¥3.6 / ¥21.6Input/Output
19
kimi-k2.6
Moonshot
94.9
3.1K
262K
¥6.84 / ¥28.8Input/Output
20
qwen3.7-max-preview
Alibaba
94.6
838
1M
¥18 / ¥54Input/Output
21
claude-opus-4-5-20251101
Anthropic
94.3
12.5K
200K
¥36 / ¥180Input/Output
22
glm-5.1
Zai
94.0
2.8K
200K
¥0 / ¥0Input/Output
23
gemini-2.5-pro
Google
93.8
22K
1.05M
¥9 / ¥72Input/Output
24
claude-sonnet-4-5-20250929
Anthropic
93.5
14.4K
200K
¥21.6 / ¥108Input/Output
25
dola-seed-2.0-pro
Bytedance
93.2
7.1K
-
-
26
mimo-v2-pro
Xiaomi
92.9
4.4K
1.05M
¥7.2 / ¥21.6Input/Output
27
qwen3-max-preview
Alibaba
92.6
5.2K
262K
¥6.2 / ¥24.8Input/Output
28
deepseek-v4-pro
Deepseek
92.3
3.3K
1M
¥3.13 / ¥6.26Input/Output
29
qwen3.6-plus
Alibaba
92.0
3.7K
1M
¥3.6 / ¥21.6Input/Output
30
gemma-4-26b-a4b
Google
91.8
1K
262K
¥0.94 / ¥2.88Input/Output
31
grok-4.20-beta-0309-reasoning
Xai
91.5
5.7K
2M
¥14.4 / ¥43.2Input/Output
32
qwen3.5-397b-a17b
Alibaba
91.2
6.1K
262K
¥3.1 / ¥18.6Input/Output
33
gpt-5.2-chat-latest-20260210
Openai
90.9
6.2K
400K
¥12.6 / ¥101Input/Output
34
claude-opus-4-5-20251101-thinking-32k
Anthropic
90.6
6.9K
200K
¥108 / ¥540Input/Output
35
deepseek-v4-pro-thinking
Deepseek
90.3
3.2K
1M
¥3.13 / ¥6.26Input/Output
36
qwen3-vl-235b-a22b-instruct
Alibaba
90.1
2.2K
128K
¥2.16 / ¥8.64Input/Output
37
gpt-5.1-high
Openai
89.8
7.6K
400K
¥9 / ¥72Input/Output
38
ernie-5.0-0110
Baidu
89.5
6.5K
128K
¥7.92 / ¥14.4Input/Output
39
claude-sonnet-4-5-20250929-thinking-32k
Anthropic
89.2
14.7K
200K
¥21.6 / ¥108Input/Output
40
mimo-v2.5
Xiaomi
88.9
3.2K
1.05M
¥2.88 / ¥14.4Input/Output
41
glm-4.7
Zai
88.6
2.2K
205K
¥0 / ¥0Input/Output
42
glm-5
Zai
88.4
4.3K
205K
¥7.2 / ¥23Input/Output
43
longcat-flash-chat
Meituan
88.1
2.2K
128K
¥1.08 / ¥10.8Input/Output
44
gemini-3-flash (thinking-minimal)
Google
87.8
10.2K
1.05M
¥3.6 / ¥21.6Input/Output
45
qwen3.6-max-preview
Alibaba
87.5
883
246K
¥9.5 / ¥56.9Input/Output
46
kimi-k2.5-thinking
Moonshot
87.2
7K
262K
¥4.32 / ¥21.6Input/Output
47
grok-4.20-beta1
Xai
86.9
4.6K
2M
¥14.4 / ¥43.2Input/Output
48
gemma-4-31b
Google
86.6
1.1K
262K
¥3.24 / ¥7.2Input/Output
49
grok-4.1
Xai
86.4
12.3K
200K
¥14.4 / ¥72Input/Output
50
chatgpt-4o-latest-20250326
Openai
86.1
13.6K
128K
¥18 / ¥72Input/Output
51
mistral-large-3
Mistral
85.8
7.9K
262K
¥3.6 / ¥10.8Input/Output
52
amazon-nova-experimental-chat-12-10
Amazon
85.5
594
-
-
53
grok-4.20-multi-agent-beta-0309
Xai
85.2
5.5K
2M
¥14.4 / ¥43.2Input/Output
54
qwen3-next-80b-a3b-instruct
Alibaba
84.9
4.3K
131K
¥1.04 / ¥4.13Input/Output
55
ernie-5.0-preview-1203
Baidu
84.7
1.9K
128K
¥7.92 / ¥14.4Input/Output
56
glm-4.5
Zai
84.4
4.3K
131K
¥4.32 / ¥15.8Input/Output
57
mimo-v2-omni
Xiaomi
84.1
595
262K
¥2.88 / ¥14.4Input/Output
58
gpt-5.1
Openai
83.8
8.1K
400K
¥9 / ¥72Input/Output
59
gpt-5.4-mini-high
Openai
83.5
5.3K
400K
¥5.4 / ¥32.4Input/Output
60
mistral-medium-2508
Mistral
83.2
17K
262K
¥2.88 / ¥14.4Input/Output
61
amazon-nova-experimental-chat-11-10
Amazon
83.0
4.7K
-
-
62
glm-4.6
Zai
82.7
6.8K
205K
¥4.32 / ¥15.8Input/Output
63
grok-4.1-thinking
Xai
82.4
12.1K
200K
¥14.4 / ¥72Input/Output
64
qwen3-235b-a22b-instruct-2507
Alibaba
82.1
17.5K
128K
¥2.09 / ¥8.23Input/Output
65
ernie-5.0-preview-1022
Baidu
81.8
921
128K
¥7.92 / ¥14.4Input/Output
66
gpt-5.2-high
Openai
81.5
9K
400K
¥12.6 / ¥101Input/Output
67
longcat-flash-chat-2602-exp
Meituan
81.3
4.6K
128K
¥1.08 / ¥10.8Input/Output
68
qwen3.5-122b-a10b
Alibaba
81.0
5K
262K
¥2.88 / ¥23Input/Output
69
deepseek-v3.2-exp
Deepseek
80.7
2.2K
128K
¥0 / ¥0Input/Output
70
hunyuan-vision-1.5-thinking
Tencent
80.4
472
-
-
71
deepseek-v3.1-terminus-thinking
Deepseek
80.1
699
128K
¥1.8 / ¥5.04Input/Output
72
deepseek-v4-flash-thinking
Deepseek
79.8
3.3K
1M
¥1.01 / ¥2.02Input/Output
73
deepseek-v4-flash
Deepseek
79.5
3.3K
1M
¥1.01 / ¥2.02Input/Output
74
qwen3-235b-a22b-thinking-2507
Alibaba
79.3
1.5K
131K
¥2.07 / ¥8.26Input/Output
75
gpt-5.2
Openai
79.0
9K
400K
¥12.6 / ¥101Input/Output
76
deepseek-v3.2
Deepseek
78.7
8.5K
128K
¥2.09 / ¥3.1Input/Output
77
grok-3-preview-02-24
Xai
78.4
4K
1M
¥9 / ¥18Input/Output
78
gpt-5.5-instant
Openai
78.1
5.2K
400K
¥9 / ¥72Input/Output
79
deepseek-v3.1-thinking
Deepseek
77.8
1.9K
128K
¥1.44 / ¥5.04Input/Output
80
deepseek-v3.2-thinking
Deepseek
77.6
7.7K
128K
¥2.09 / ¥3.1Input/Output
81
deepseek-v3.1
Deepseek
77.3
2.5K
128K
¥1.44 / ¥5.04Input/Output
82
kimi-k2.5-instant
Moonshot
77.0
1.5K
262K
¥4.32 / ¥21.6Input/Output
83
claude-opus-4-1-20250805
Anthropic
76.7
13.9K
200K
¥108 / ¥540Input/Output
84
minimax-m2.7
Minimax
76.4
4.6K
205K
¥0 / ¥0Input/Output
85
claude-opus-4-1-20250805-thinking-16k
Anthropic
76.1
9.1K
200K
¥108 / ¥540Input/Output
86
gemini-2.5-flash
Google
75.9
21.8K
1.05M
¥2.16 / ¥18Input/Output
87
amazon-nova-experimental-chat-10-20
Amazon
75.6
2.1K
-
-
88
mimo-v2-flash (non-thinking)
Xiaomi
75.3
8.4K
262K
¥0.72 / ¥2.16Input/Output
89
deepseek-r1-0528
Deepseek
75.0
2.9K
164K
¥3.6 / ¥15.5Input/Output
90
gpt-5-chat
Openai
74.7
5.6K
400K
¥9 / ¥72Input/Output
91
hunyuan-hy3-preview
Tencent
74.4
1.2K
256K
¥0 / ¥0Input/Output
92
kimi-k2-thinking-turbo
Moonshot
74.1
11.4K
262K
¥17.3 / ¥72Input/Output
93
gemini-2.5-flash-preview-09-2025
Google
73.9
6.1K
1M
¥2.16 / ¥18Input/Output
94
qwen3.5-27b
Alibaba
73.6
4.9K
262K
¥2.16 / ¥17.3Input/Output
95
qwen3-max-2025-09-23
Alibaba
73.3
1.8K
258K
¥6.19 / ¥24.7Input/Output
96
claude-haiku-4-5-20251001
Anthropic
73.0
14.8K
200K
¥7.2 / ¥36Input/Output
97
qwen3.5-flash
Alibaba
72.7
5.7K
1M
¥1.24 / ¥12.4Input/Output
98
qwen3.5-35b-a3b
Alibaba
72.4
5.4K
262K
¥1.8 / ¥14.4Input/Output
99
deepseek-v3.2-exp-thinking
Deepseek
72.2
1.8K
128K
¥0 / ¥0Input/Output
100
step-3.5-flash
Stepfun
71.9
6.5K
256K
¥0.69 / ¥2.07Input/Output
101
amazon-nova-experimental-chat-26-01-10
Amazon
71.6
603
-
-
102
grok-4-fast-reasoning
Xai
71.3
3.7K
2M
¥1.44 / ¥3.6Input/Output
103
o3-2025-04-16
Openai
71.0
9.6K
200K
¥14.4 / ¥57.6Input/Output
104
qwen3-vl-235b-a22b-thinking
Alibaba
70.7
1.6K
131K
¥2.06 / ¥8.26Input/Output
105
qwen3-235b-a22b-no-thinking
Alibaba
70.5
6.4K
131K
¥2.07 / ¥8.26Input/Output
106
gemini-3.1-flash-lite-preview
Google
70.2
6.7K
1.05M
¥1.8 / ¥10.8Input/Output
107
minimax-m2.1-preview
Minimax
69.9
3.1K
205K
¥0 / ¥0Input/Output
108
grok-4.3
Xai
69.6
3.2K
1M
¥9 / ¥18Input/Output
109
qwen3-30b-a3b-instruct-2507
Alibaba
69.3
4.2K
262K
¥2.16 / ¥3.6Input/Output
110
grok-4-0709
Xai
69.0
7.3K
256K
¥21.6 / ¥108Input/Output
111
mimo-v2-flash (thinking)
Xiaomi
68.8
2.1K
262K
¥0.72 / ¥2.16Input/Output
112
deepseek-v3.1-terminus
Deepseek
68.5
740
128K
¥1.8 / ¥5.04Input/Output
113
gpt-4.5-preview-2025-02-27
Openai
68.2
1.4K
8.19K
¥216 / ¥432Input/Output
114
grok-4-1-fast-reasoning
Xai
67.9
10.3K
2M
¥1.44 / ¥3.6Input/Output
115
grok-4-fast-chat
Xai
67.6
1.2K
2M
¥1.44 / ¥3.6Input/Output
116
hunyuan-turbos-20250416
Tencent
67.3
1.4K
131K
¥0 / ¥0Input/Output
117
gpt-5.3-chat-latest
Openai
67.0
6K
128K
¥12.6 / ¥101Input/Output
118
gpt-5-high
Openai
66.8
5.5K
400K
¥9 / ¥72Input/Output
119
amazon-nova-experimental-chat-10-09
Amazon
66.5
579
-
-
120
gemma-3-12b-it
Google
66.2
364
128K
¥1.96 / ¥1.96Input/Output
121
hunyuan-t1-20250711
Tencent
65.9
770
131K
¥0 / ¥0Input/Output
122
glm-4.5-air
Zai
65.6
5.5K
131K
¥0 / ¥0Input/Output
123
gpt-4.1-2025-04-14
Openai
65.3
8.3K
1.05M
¥14.4 / ¥57.6Input/Output
124
nova-2-lite
Amazon
65.1
2.3K
128K
¥2.38 / ¥19.8Input/Output
125
gemini-2.5-flash-lite-preview-09-2025-no-thinking
Google
64.8
9K
1.05M
¥0.72 / ¥2.88Input/Output
126
nvidia-nemotron-3-super-120b-a12b
Nvidia
64.5
1.4K
262K
¥1.44 / ¥5.76Input/Output
127
gpt-5.4-nano-high
Openai
64.2
5.1K
400K
¥1.44 / ¥9Input/Output
128
qwen3-next-80b-a3b-thinking
Alibaba
63.9
2.5K
131K
¥1.04 / ¥10.3Input/Output
129
gemma-3-27b-it
Google
63.6
6.6K
128K
¥2.15 / ¥2.15Input/Output
130
ling-flash-2.0
Ant Group
63.4
1.3K
131K
¥1.01 / ¥4.1Input/Output
131
gpt-5-mini-high
Openai
63.1
4.8K
400K
¥1.8 / ¥14.4Input/Output
132
gpt-oss-120b
Openai
62.8
5.4K
131K
¥1.08 / ¥4.32Input/Output
133
glm-4.6v
Zai
62.5
540
128K
¥2.16 / ¥6.48Input/Output
134
minimax-m2.5
Minimax
62.2
6.9K
205K
¥0 / ¥0Input/Output
135
gemini-2.5-flash-lite-preview-06-17-thinking
Google
61.9
5.5K
65.5K
¥0.72 / ¥2.88Input/Output
136
kimi-k2-0905-preview
Moonshot
61.6
2.1K
262K
¥4.32 / ¥18Input/Output
137
mistral-medium-2505
Mistral
61.4
5.1K
262K
¥2.88 / ¥14.4Input/Output
138
deepseek-v3-0324
Deepseek
61.1
6.9K
75K
¥1.44 / ¥5.76Input/Output
139
qwen2.5-max
Alibaba
60.8
3.7K
32K
¥11.5 / ¥46Input/Output
140
mercury-2
Inception Ai
60.5
478
128K
¥1.8 / ¥5.4Input/Output
141
qwen3-235b-a22b
Alibaba
60.2
4K
131K
¥2.07 / ¥8.26Input/Output
142
intellect-3
-
59.9
985
131K
¥1.44 / ¥7.92Input/Output
143
deepseek-r1
Deepseek
59.7
1.8K
164K
¥5.04 / ¥18Input/Output
144
nvidia-nemotron-3-nano-30b-a3b-bf16
Nvidia
59.4
2.9K
131K
¥0 / ¥0Input/Output
145
glm-4.7-flash
Zai
59.1
2.2K
200K
¥0 / ¥0Input/Output
146
grok-3-mini-beta
Xai
58.8
3.6K
1M
¥9 / ¥18Input/Output
147
claude-opus-4-20250514-thinking-16k
Anthropic
58.5
6.1K
200K
¥108 / ¥540Input/Output
148
qwen3-coder-480b-a35b-instruct
Alibaba
58.2
4.4K
262K
¥6.2 / ¥24.8Input/Output
149
step-3
Stepfun
58.0
1.2K
65.5K
¥1.8 / ¥4.68Input/Output
150
grok-3-mini-high
Xai
57.7
2.8K
128K
¥0 / ¥0Input/Output
151
kimi-k2-0711-preview
Moonshot
57.4
4.6K
131K
¥4.32 / ¥18Input/Output
152
minimax-m2
Minimax
57.1
1.3K
197K
¥0 / ¥0Input/Output
153
gemini-2.0-flash-001
Google
56.8
5.3K
1.05M
¥1.08 / ¥4.32Input/Output
154
claude-opus-4-20250514
Anthropic
56.5
7.2K
200K
¥108 / ¥540Input/Output
155
gemini-2.0-flash-lite-preview-02-05
Google
56.3
2.4K
1.05M
¥0.54 / ¥2.16Input/Output
156
trinity-large-thinking
-
56.0
4.7K
262K
¥1.8 / ¥6.48Input/Output
157
o1-2024-12-17
Openai
55.7
2.7K
128K
¥108 / ¥432Input/Output
158
step-1o-turbo-202506
Stepfun
55.4
1.4K
-
-
159
mistral-small-2506
Mistral
55.1
2.9K
262K
¥2.88 / ¥14.4Input/Output
160
glm-4-plus-0111
Zai
54.8
592
128K
¥72 / ¥72Input/Output
161
ring-flash-2.0
Ant Group
54.5
1.3K
131K
¥1.01 / ¥4.1Input/Output
162
o4-mini-2025-04-16
Openai
54.3
7.1K
200K
¥7.92 / ¥31.7Input/Output
163
hunyuan-turbos-20250226
Tencent
54.0
263
131K
¥0 / ¥0Input/Output
164
trinity-large-preview
-
53.7
5.4K
262K
¥1.8 / ¥6.48Input/Output
165
gpt-4.1-mini-2025-04-14
Openai
53.4
6.1K
1.05M
¥2.88 / ¥11.5Input/Output
166
qwq-32b
Alibaba
53.1
3.5K
131K
¥2.07 / ¥6.2Input/Output
167
minimax-m1
Minimax
52.8
6.1K
1M
¥0.95 / ¥9.03Input/Output
168
qwen3-32b
Alibaba
52.6
437
131K
¥2.07 / ¥8.26Input/Output
169
claude-sonnet-4-20250514
Anthropic
52.3
6.7K
200K
¥21.6 / ¥108Input/Output
170
olmo-3.1-32b-instruct
Allenai
52.0
2.2K
200K
¥14.4 / ¥57.6Input/Output
171
command-a-03-2025
Cohere
51.7
8.8K
256K
¥18 / ¥72Input/Output
172
claude-sonnet-4-20250514-thinking-32k
Anthropic
51.4
5.9K
200K
¥21.6 / ¥108Input/Output
173
o3-mini-high
Openai
51.1
1.8K
200K
¥7.92 / ¥31.7Input/Output
174
gemma-3-4b-it
Google
50.9
390
128K
¥1.44 / ¥1.44Input/Output
175
nvidia-llama-3.3-nemotron-super-49b-v1.5
Nvidia
50.6
599
131K
¥2.88 / ¥2.88Input/Output
176
gpt-5-nano-high
Openai
50.3
1.5K
400K
¥0.36 / ¥2.88Input/Output
177
glm-4.5v
Zai
50.0
921
64K
¥4.32 / ¥13Input/Output
178
hunyuan-turbo-0110
Tencent
49.7
256
-
-
179
qwen3-30b-a3b
Alibaba
49.4
4K
128K
¥0.79 / ¥7.78Input/Output
180
deepseek-v3
Deepseek
49.1
2.2K
128K
¥0 / ¥0Input/Output
181
o1-preview
Openai
48.9
3.8K
128K
¥108 / ¥432Input/Output
182
qwen-plus-0125
Alibaba
48.6
571
1M
¥0.83 / ¥2.07Input/Output
183
llama-3.1-nemotron-ultra-253b-v1
Nvidia
48.3
254
128K
¥4.32 / ¥13Input/Output
184
mercury
Inception Ai
48.0
351
128K
¥1.8 / ¥5.4Input/Output
185
gemma-3n-e4b-it
Google
47.7
3.3K
128K
¥0 / ¥0Input/Output
186
olmo-3-32b-think
Allenai
47.4
1.1K
128K
¥2.16 / ¥3.24Input/Output
187
qwen2.5-plus-1127
Alibaba
47.2
1.1K
-
-
188
o3-mini
Openai
46.9
7.7K
200K
¥7.92 / ¥31.7Input/Output
189
o1-mini
Openai
46.6
6.1K
128K
¥7.92 / ¥31.7Input/Output
190
llama-3.3-nemotron-49b-super-v1
Nvidia
46.3
241
131K
¥0 / ¥0Input/Output
191
gemini-1.5-pro-002
Google
46.0
6.4K
-
-
192
gpt-oss-20b
Openai
45.7
1.8K
131K
¥0.32 / ¥1.3Input/Output
193
step-2-16k-exp-202412
Stepfun
45.5
506
16.4K
¥37.5 / ¥118Input/Output
194
granite-4.1-8b
Ibm
45.2
587
131K
¥0.36 / ¥0.72Input/Output
195
athene-v2-chat
-
44.9
2.7K
-
-
196
claude-3-7-sonnet-20250219
Anthropic
44.6
5.8K
200K
¥21.6 / ¥108Input/Output
197
glm-4-plus
Zai
44.3
3.3K
128K
¥54 / ¥54Input/Output
198
yi-lightning
-
44.0
3.4K
12K
¥1.44 / ¥1.44Input/Output
199
claude-3-7-sonnet-20250219-thinking-32k
Anthropic
43.8
5.6K
-
-
200
gpt-4.1-nano-2025-04-14
Openai
43.5
651
1.05M
¥14.4 / ¥57.6Input/Output
201
grok-2-2024-08-13
Xai
43.2
7.5K
1M
¥9 / ¥18Input/Output
202
gpt-4o-mini-2024-07-18
Openai
42.9
7.8K
128K
¥1.08 / ¥4.32Input/Output
203
gemini-1.5-flash-002
Google
42.6
4.3K
2M
¥0.54 / ¥2.2Input/Output
204
llama-3.1-nemotron-70b-instruct
Nvidia
42.3
904
128K
¥0 / ¥0Input/Output
205
gpt-4o-2024-05-13
Openai
42.0
13.3K
128K
¥36 / ¥108Input/Output
206
llama-4-maverick-17b-128e-instruct
Meta
41.8
6.2K
1M
¥1.8 / ¥6.26Input/Output
207
deepseek-v2.5-1210
Deepseek
41.5
715
1M
¥1.01 / ¥2.02Input/Output
208
llama-4-scout-17b-16e-instruct
Meta
41.2
5K
128K
¥1.44 / ¥5.62Input/Output
209
mistral-small-3.1-24b-instruct-2503
Mistral
40.9
5.6K
262K
¥2.88 / ¥14.4Input/Output
210
qwen-max-0919
Alibaba
40.6
2.2K
131K
¥2.48 / ¥9.91Input/Output
211
hunyuan-standard-2025-02-10
Tencent
40.3
376
-
-
212
claude-3-5-sonnet-20241022
Anthropic
40.1
11.3K
200K
¥21.6 / ¥108Input/Output
213
grok-2-mini-2024-08-13
Xai
39.8
6.1K
1M
¥9 / ¥18Input/Output
214
olmo-3.1-32b-think
Allenai
39.5
1.5K
200K
¥14.4 / ¥57.6Input/Output
215
qwen2.5-72b-instruct
Alibaba
39.2
4.7K
131K
¥4.13 / ¥12.4Input/Output
216
gemini-1.5-pro-001
Google
38.9
9.3K
-
-
217
deepseek-v2.5
Deepseek
38.6
2.8K
1M
¥1.01 / ¥2.02Input/Output
218
llama-3.1-405b-instruct-fp8
Meta
38.4
6.7K
128K
¥0 / ¥0Input/Output
219
gemini-advanced-0514
Google
38.1
6.1K
-
-
220
athene-70b-0725
-
37.8
2.1K
-
-
221
llama-3.1-405b-instruct-bf16
Meta
37.5
4.5K
128K
¥0 / ¥0Input/Output
222
llama-3.3-70b-instruct
Meta
37.2
6.7K
128K
¥0 / ¥0Input/Output
223
hunyuan-large-2025-02-10
Tencent
36.9
357
-
-
224
mistral-large-2411
Mistral
36.6
3K
128K
¥14.4 / ¥43.2Input/Output
225
llama-3.1-70b-instruct
Meta
36.4
6.4K
131K
¥2.88 / ¥2.88Input/Output
226
claude-3-5-sonnet-20240620
Anthropic
36.1
9.7K
200K
¥21.6 / ¥108Input/Output
227
gpt-4o-2024-08-06
Openai
35.8
5.1K
128K
¥18 / ¥72Input/Output
228
hunyuan-large-vision
Tencent
35.5
798
-
-
229
amazon-nova-pro-v1.0
Amazon
35.2
2.6K
300K
¥5.76 / ¥23Input/Output
230
llama-3.1-tulu-3-70b
Allenai
34.9
288
-
-
231
mistral-large-2407
Mistral
34.7
5.2K
131K
¥14.4 / ¥43.2Input/Output
232
gemini-1.5-flash-001
Google
34.4
7.3K
2M
¥0.54 / ¥2.2Input/Output
233
qwen2.5-coder-32b-instruct
Alibaba
34.1
629
131K
¥2.07 / ¥6.2Input/Output
234
magistral-medium-2506
Mistral
33.8
2.2K
128K
¥14.4 / ¥36Input/Output
235
claude-3-5-haiku-20241022
Anthropic
33.5
9.2K
200K
¥5.76 / ¥28.8Input/Output
236
gpt-4-turbo-2024-04-09
Openai
33.2
10.8K
128K
¥72 / ¥216Input/Output
237
gemma-2-9b-it-simpo
-
33.0
1.1K
8.19K
¥1.44 / ¥1.44Input/Output
238
claude-3-opus-20240229
Anthropic
32.7
21.7K
200K
¥108 / ¥540Input/Output
239
amazon-nova-lite-v1.0
Amazon
32.4
2.1K
300K
¥0.43 / ¥1.73Input/Output
240
reka-core-20240904
-
32.1
837
-
-
241
gpt-4-0125-preview
Openai
31.8
9.7K
8.19K
¥216 / ¥432Input/Output
242
llama-3.1-nemotron-51b-instruct
Nvidia
31.5
421
128K
¥0 / ¥0Input/Output
243
command-r-plus-08-2024
Cohere
31.3
1.1K
128K
¥18 / ¥72Input/Output
244
ibm-granite-h-small
Ibm
31.0
1K
-
-
245
gpt-4-1106-preview
Openai
30.7
9.6K
8.19K
¥216 / ¥432Input/Output
246
gemini-1.5-flash-8b-001
Google
30.4
4.3K
2M
¥0.54 / ¥2.2Input/Output
247
c4ai-aya-expanse-32b
Cohere
30.1
3.1K
-
-
248
gemma-2-27b-it
Google
29.8
8.7K
8.19K
¥0.58 / ¥0.58Input/Output
249
mistral-small-24b-instruct-2501
Mistral
29.5
1.4K
262K
¥2.88 / ¥14.4Input/Output
250
phi-4
Microsoft
29.3
2.4K
128K
¥0.9 / ¥3.6Input/Output
251
llama-3.1-tulu-3-8b
Allenai
29.0
285
-
-
252
jamba-1.5-large
-
28.7
979
256K
¥0 / ¥0Input/Output
253
reka-flash-20240904
-
28.4
904
65.5K
¥0.72 / ¥1.44Input/Output
254
command-r-plus
Cohere
28.1
8.5K
128K
¥18 / ¥72Input/Output
255
gemma-2-9b-it
Google
27.8
6.2K
8.19K
¥1.44 / ¥1.44Input/Output
256
nemotron-4-340b-instruct
Nvidia
27.6
2.2K
-
-
257
amazon-nova-micro-v1.0
Amazon
27.3
2.1K
128K
¥0.25 / ¥1.01Input/Output
258
olmo-2-0325-32b-instruct
Allenai
27.0
336
-
-
259
hunyuan-standard-256k
Tencent
26.7
328
-
-
260
glm-4-0520
Zai
26.4
1.2K
128K
¥108 / ¥108Input/Output
261
c4ai-aya-expanse-8b
Cohere
26.1
1.1K
-
-
262
claude-3-sonnet-20240229
Anthropic
25.9
11.9K
200K
¥21.6 / ¥108Input/Output
263
ministral-8b-2410
Mistral
25.6
572
128K
¥0.72 / ¥0.72Input/Output
264
llama-3-70b-instruct
Meta
25.3
16.6K
8.19K
¥3.67 / ¥5.33Input/Output
265
llama-3.1-8b-instruct
Meta
25.0
5.7K
131K
¥0.79 / ¥0.79Input/Output
266
claude-3-haiku-20240307
Anthropic
24.7
13.2K
200K
¥1.8 / ¥9Input/Output
267
jamba-1.5-mini
-
24.4
1K
256K
¥0 / ¥0Input/Output
268
command-r-08-2024
Cohere
24.1
1.2K
128K
¥18 / ¥72Input/Output
269
deepseek-coder-v2
Deepseek
23.9
1.8K
1M
¥1.01 / ¥2.02Input/Output
270
qwen2-72b-instruct
Alibaba
23.6
4.3K
131K
¥4.13 / ¥12.4Input/Output
271
gpt-4-0314
Openai
23.3
5.1K
8.19K
¥216 / ¥432Input/Output
272
granite-3.1-8b-instruct
Ibm
23.0
316
-
-
273
command-r
Cohere
22.7
6K
128K
¥18 / ¥72Input/Output
274
qwen1.5-110b-chat
Alibaba
22.4
3.1K
-
-
275
granite-3.1-2b-instruct
Ibm
22.2
382
-
-
276
reka-flash-21b-20240226-online
-
21.9
1.7K
-
-
277
yi-1.5-34b-chat
-
21.6
2.8K
-
-
278
gemini-pro-dev-api
Google
21.3
1.7K
1.05M
¥14.4 / ¥86.4Input/Output
279
mistral-large-2402
Mistral
21.0
6.5K
262K
¥2.88 / ¥14.4Input/Output
280
llama-3-8b-instruct
Meta
20.7
10.8K
8.19K
¥0.29 / ¥0.29Input/Output
281
reka-flash-21b-20240226
-
20.5
2.7K
-
-
282
gemma-2-2b-it
Google
20.2
5.2K
128K
¥0 / ¥0Input/Output
283
gpt-4-0613
Openai
19.9
8.7K
8.19K
¥216 / ¥432Input/Output
284
mistral-medium
Mistral
19.6
3.2K
262K
¥2.88 / ¥14.4Input/Output
285
mixtral-8x22b-instruct-v0.1
Mistral
19.3
5.6K
64K
¥14.4 / ¥43.2Input/Output
286
qwen1.5-72b-chat
Alibaba
19.0
3.8K
-
-
287
internlm2_5-20b-chat
-
18.8
1.3K
-
-
288
starling-lm-7b-beta
-
18.5
1.7K
200K
¥5.4 / ¥18.7Input/Output
289
phi-3-medium-4k-instruct
Microsoft
18.2
2.8K
4.1K
¥1.22 / ¥4.9Input/Output
290
qwq-32b-preview
Alibaba
17.9
374
131K
¥2.07 / ¥6.2Input/Output
291
qwen1.5-32b-chat
Alibaba
17.6
2.3K
-
-
292
zephyr-orpo-141b-A35b-v0.1
-
17.3
441
200K
¥108 / ¥432Input/Output
293
yi-34b-chat
-
17.0
1.4K
-
-
294
mixtral-8x7b-instruct-v0.1
Mistral
16.8
7.1K
32K
¥5.04 / ¥5.04Input/Output
295
gemini-pro
Google
16.5
431
1.05M
¥14.4 / ¥86.4Input/Output
296
qwen1.5-14b-chat
Alibaba
16.2
1.9K
-
-
297
llama-2-70b-chat
Meta
15.9
3.5K
-
-
298
wizardlm-70b
Microsoft
15.6
682
-
-
299
llama2-70b-steerlm-chat
Nvidia
15.3
282
-
-
300
tulu-2-dpo-70b
-
15.1
540
-
-
301
phi-3-small-8k-instruct
Microsoft
14.8
2.2K
8.19K
¥1.08 / ¥4.32Input/Output
302
gpt-3.5-turbo-0125
Openai
14.5
6.9K
16.4K
¥3.6 / ¥10.8Input/Output
303
openchat-3.5-0106
-
14.2
1.2K
-
-
304
llama-3.2-3b-instruct
Meta
13.9
1K
131K
¥0.22 / ¥0.35Input/Output
305
starling-lm-7b-alpha
-
13.6
841
200K
¥5.4 / ¥18.7Input/Output
306
dbrx-instruct-preview
-
13.4
3.4K
-
-
307
nous-hermes-2-mixtral-8x7b-dpo
-
13.1
282
1M
¥36 / ¥180Input/Output
308
qwen1.5-7b-chat
Alibaba
12.8
447
-
-
309
gemma-1.1-7b-it
Google
12.5
2.7K
-
-
310
llama-2-13b-chat
Meta
12.2
1.7K
-
-
311
vicuna-33b
-
11.9
1.9K
-
-
312
deepseek-llm-67b-chat
Deepseek
11.6
441
1M
¥1.01 / ¥2.02Input/Output
313
granite-3.0-8b-instruct
Ibm
11.4
813
-
-
314
mistral-7b-instruct-v0.2
Mistral
11.1
1.7K
262K
¥2.88 / ¥14.4Input/Output
315
openchat-3.5
-
10.8
615
-
-
316
granite-3.0-2b-instruct
Ibm
10.5
873
-
-
317
codellama-34b-instruct
Meta
10.2
639
-
-
318
phi-3-mini-4k-instruct
Microsoft
9.9
2.5K
4.1K
¥0.94 / ¥3.74Input/Output
319
snowflake-arctic-instruct
-
9.7
3.3K
-
-
320
llama-2-7b-chat
Meta
9.4
1.3K
128K
¥4.03 / ¥48Input/Output
321
gemma-7b-it
Google
9.1
818
-
-
322
openhermes-2.5-mistral-7b
-
8.8
381
1M
¥36 / ¥180Input/Output
323
phi-3-mini-4k-instruct-june-2024
Microsoft
8.5
1.4K
4.1K
¥0.94 / ¥3.74Input/Output
324
solar-10.7b-instruct-v1.0
-
8.2
311
128K
¥0 / ¥0Input/Output
325
gpt-3.5-turbo-1106
Openai
8.0
1.4K
16.4K
¥7.2 / ¥14.4Input/Output
326
olmo-7b-instruct
Allenai
7.7
548
-
-
327
mpt-30b-chat
-
7.4
207
-
-
328
wizardlm-13b
Microsoft
7.1
634
-
-
329
vicuna-13b
-
6.8
1.6K
-
-
330
qwen-14b-chat
Alibaba
6.5
426
32.8K
¥1.04 / ¥3.1Input/Output
331
zephyr-7b-beta
-
6.3
997
-
-
332
llama-3.2-1b-instruct
Meta
6.0
1K
16.4K
¥0.07 / ¥0.08Input/Output
333
gemma-2b-it
Google
5.7
447
-
-
334
phi-3-mini-128k-instruct
Microsoft
5.4
2.1K
128K
¥0.94 / ¥3.74Input/Output
335
gemma-1.1-2b-it
Google
5.1
1.1K
-
-
336
guanaco-33b
-
4.8
261
200K
¥14.4 / ¥57.6Input/Output
337
vicuna-7b
-
4.5
555
-
-
338
stripedhyena-nous-7b
-
4.3
455
-
-
339
smollm2-1.7b-instruct
-
4.0
252
-
-
340
qwen1.5-4b-chat
Alibaba
3.7
738
-
-
341
palm-2
Google
3.4
755
-
-
342
mistral-7b-instruct
Mistral
3.1
747
262K
¥2.88 / ¥14.4Input/Output
343
koala-13b
-
2.8
503
-
-
344
chatglm3-6b
-
2.6
378
200K
¥5.4 / ¥18.7Input/Output
345
RWKV-4-Raven-14B
-
2.3
360
-
-
346
chatglm2-6b
-
2.0
234
200K
¥5.4 / ¥18.7Input/Output
347
fastchat-t5-3b
-
1.7
287
-
-
348
mpt-7b-chat
-
1.4
306
-
-
349
chatglm-6b
-
1.1
300
200K
¥5.4 / ¥18.7Input/Output
350
oasst-pythia-12b
-
0.9
454
-
-
351
alpaca-13b
-
0.6
391
-
-
352
stablelm-tuned-alpha-7b
-
0.3
238
-
-
353
dolly-v2-12b
-
0.0
233
-
-
Top model analysis

claude-opus-4-6 why it ranks first

claude-opus-4-6 ranks first with a percent score of 100.0 and 7.1K samples. Use it as the first option for this leaderboard, then compare price, context and availability.

How to choose

Do not only look at rank #1

Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.

FAQ

FAQ

商业、管理与金融排行榜看什么指标?

主要看排名、百分制分数、样本量和来源。分数用于快速比较同一榜单内模型表现,样本量用于判断结果稳定性。

为什么不同榜单不能直接混合成总分?

不同榜单的任务、样本和评测口径不同,模力榜默认只在同一榜单内排序,避免把写作、代码、图像等能力强行合并。

商业、管理与金融模型应该怎么选?

优先看与你任务最接近的榜单,再结合价格、上下文长度、开源闭源和厂商可用性。排名靠前不代表适合所有预算和部署方式。

榜单多久更新?

页面展示的是最新成功采集的公开榜单数据。当前优先使用 LMArena leaderboard dataset,并在页面来源中保留原始链接。