Chat · Text · Medicine & Healthcare Leaderboard

Ranking for Text / Medicine & Healthcare, based on public preference data.

Selection guide

Medicine & Healthcare model ranking guide

Ranking for Text / Medicine & Healthcare, based on public preference data.

claude-opus-4-6claude-opus-4-6-thinkingclaude-opus-4-7-thinkingqwen3.5-max-previewmuse-spark
Current DirectoryChat · Text · Medicine & Healthcare
Models329
Published2026/05/27
Arena public preference evaluationOriginal leaderboard: Text / Industry Medicine And HealthcarePublished: 2026/05/27Leaderboard dataset: LMArena latest parquetOpen Arena sourceOpen leaderboard dataset
1
claude-opus-4-6
Anthropic
100.0
2.6K
1M
¥36 / ¥180Input/Output
2
claude-opus-4-6-thinking
Anthropic
99.7
2.5K
1M
¥36 / ¥180Input/Output
3
claude-opus-4-7-thinking
Anthropic
99.4
1.5K
1M
¥36 / ¥180Input/Output
4
qwen3.5-max-preview
Alibaba
99.1
1.4K
-
-
5
muse-spark
Meta
98.8
864
-
-
6
gemini-3-pro
Google
98.5
2.5K
1.05M
¥14.4 / ¥86.4Input/Output
7
claude-opus-4-7
Anthropic
98.2
1.5K
1M
¥36 / ¥180Input/Output
8
gemini-3.1-pro-preview
Google
97.9
3.2K
1.05M
¥14.4 / ¥86.4Input/Output
9
gemini-3.5-flash
Google
97.6
633
1.05M
¥10.8 / ¥64.8Input/Output
10
ernie-5.1
Baidu
97.3
1.1K
119K
¥5.4 / ¥21.6Input/Output
11
dola-seed-2.0-pro
Bytedance
97.0
2.7K
-
-
12
gemini-2.5-pro
Google
96.6
7.7K
1.05M
¥9 / ¥72Input/Output
13
qwen3.7-max-preview
Alibaba
96.3
297
1M
¥18 / ¥54Input/Output
14
ernie-5.0-preview-1203
Baidu
96.0
679
128K
¥7.92 / ¥14.4Input/Output
15
deepseek-v4-pro-thinking
Deepseek
95.7
1.1K
1M
¥3.13 / ¥6.26Input/Output
16
mimo-v2.5-pro
Xiaomi
95.4
1.1K
1.05M
¥7.2 / ¥21.6Input/Output
17
glm-5.1
Zai
95.1
997
200K
¥0 / ¥0Input/Output
18
deepseek-v3.2-exp-thinking
Deepseek
94.8
438
128K
¥0 / ¥0Input/Output
19
amazon-nova-experimental-chat-12-10
Amazon
94.5
230
-
-
20
qwen3-max-preview
Alibaba
94.2
1.5K
262K
¥6.2 / ¥24.8Input/Output
21
deepseek-v3.1-terminus-thinking
Deepseek
93.9
228
128K
¥1.8 / ¥5.04Input/Output
22
gemini-3-flash
Google
93.6
2.1K
1.05M
¥3.6 / ¥21.6Input/Output
23
grok-3-preview-02-24
Xai
93.3
1.6K
1M
¥9 / ¥18Input/Output
24
qwen3.6-max-preview
Alibaba
93.0
319
246K
¥9.5 / ¥56.9Input/Output
25
kimi-k2.6
Moonshot
92.7
1.1K
262K
¥6.84 / ¥28.8Input/Output
26
glm-4.7
Zai
92.4
816
205K
¥0 / ¥0Input/Output
27
ernie-5.0-0110
Baidu
92.1
2.3K
128K
¥7.92 / ¥14.4Input/Output
28
longcat-flash-chat
Meituan
91.8
668
128K
¥1.08 / ¥10.8Input/Output
29
deepseek-v4-pro
Deepseek
91.5
1.2K
1M
¥3.13 / ¥6.26Input/Output
30
gpt-5.4-high
Openai
91.2
2K
1.05M
¥18 / ¥108Input/Output
31
grok-4.20-multi-agent-beta-0309
Xai
90.9
2K
2M
¥14.4 / ¥43.2Input/Output
32
glm-4.6
Zai
90.5
1.9K
205K
¥4.32 / ¥15.8Input/Output
33
gpt-5.1-high
Openai
90.2
2.5K
400K
¥9 / ¥72Input/Output
34
glm-5
Zai
89.9
1.6K
205K
¥7.2 / ¥23Input/Output
35
grok-4.20-beta-0309-reasoning
Xai
89.6
2.1K
2M
¥14.4 / ¥43.2Input/Output
36
qwen3.5-397b-a17b
Alibaba
89.3
2.4K
262K
¥3.1 / ¥18.6Input/Output
37
qwen3-next-80b-a3b-instruct
Alibaba
89.0
1.2K
131K
¥1.04 / ¥4.13Input/Output
38
grok-4.20-beta1
Xai
88.7
1.8K
2M
¥14.4 / ¥43.2Input/Output
39
grok-4.1-thinking
Xai
88.4
4.2K
200K
¥14.4 / ¥72Input/Output
40
glm-4.5
Zai
88.1
1.4K
131K
¥4.32 / ¥15.8Input/Output
41
gemini-3-flash (thinking-minimal)
Google
87.8
3.7K
1.05M
¥3.6 / ¥21.6Input/Output
42
mistral-large-3
Mistral
87.5
2.8K
262K
¥3.6 / ¥10.8Input/Output
43
chatgpt-4o-latest-20250326
Openai
87.2
4.6K
128K
¥18 / ¥72Input/Output
44
hunyuan-t1-20250711
Tencent
86.9
258
131K
¥0 / ¥0Input/Output
45
gpt-5.5-high
Openai
86.6
1.2K
1.05M
¥36 / ¥216Input/Output
46
mistral-medium-2508
Mistral
86.3
5.7K
262K
¥2.88 / ¥14.4Input/Output
47
gpt-5.5
Openai
86.0
1.2K
1.05M
¥36 / ¥216Input/Output
48
ernie-5.0-preview-1022
Baidu
85.7
245
128K
¥7.92 / ¥14.4Input/Output
49
qwen3-235b-a22b-thinking-2507
Alibaba
85.4
522
131K
¥2.07 / ¥8.26Input/Output
50
qwen3.6-plus
Alibaba
85.1
1.2K
1M
¥3.6 / ¥21.6Input/Output
51
kimi-k2.5-thinking
Moonshot
84.8
2.5K
262K
¥4.32 / ¥21.6Input/Output
52
amazon-nova-experimental-chat-11-10
Amazon
84.5
1.7K
-
-
53
gemma-4-31b
Google
84.1
402
262K
¥3.24 / ¥7.2Input/Output
54
o3-2025-04-16
Openai
83.8
3.3K
200K
¥14.4 / ¥57.6Input/Output
55
deepseek-v3.1-terminus
Deepseek
83.5
258
128K
¥1.8 / ¥5.04Input/Output
56
gpt-5.2-chat-latest-20260210
Openai
83.2
2.3K
400K
¥12.6 / ¥101Input/Output
57
deepseek-v4-flash
Deepseek
82.9
1.2K
1M
¥1.01 / ¥2.02Input/Output
58
deepseek-v4-flash-thinking
Deepseek
82.6
1.2K
1M
¥1.01 / ¥2.02Input/Output
59
deepseek-r1-0528
Deepseek
82.3
1.2K
164K
¥3.6 / ¥15.5Input/Output
60
qwen3-235b-a22b-instruct-2507
Alibaba
82.0
5.6K
128K
¥2.09 / ¥8.23Input/Output
61
gpt-5.4
Openai
81.7
2.1K
1.05M
¥18 / ¥108Input/Output
62
grok-4.1
Xai
81.4
4.2K
200K
¥14.4 / ¥72Input/Output
63
gpt-5.1
Openai
81.1
2.6K
400K
¥9 / ¥72Input/Output
64
claude-opus-4-5-20251101
Anthropic
80.8
4.3K
200K
¥36 / ¥180Input/Output
65
claude-sonnet-4-6
Anthropic
80.5
2K
1M
¥21.6 / ¥108Input/Output
66
deepseek-v3.1-thinking
Deepseek
80.2
703
128K
¥1.44 / ¥5.04Input/Output
67
claude-sonnet-4-5-20250929
Anthropic
79.9
4.8K
200K
¥21.6 / ¥108Input/Output
68
mimo-v2-pro
Xiaomi
79.6
1.6K
1.05M
¥7.2 / ¥21.6Input/Output
69
deepseek-v3.2
Deepseek
79.3
3K
128K
¥2.09 / ¥3.1Input/Output
70
grok-4-0709
Xai
79.0
2.3K
256K
¥21.6 / ¥108Input/Output
71
qwen3-vl-235b-a22b-instruct
Alibaba
78.7
628
128K
¥2.16 / ¥8.64Input/Output
72
deepseek-v3.1
Deepseek
78.4
938
128K
¥1.44 / ¥5.04Input/Output
73
gemini-3.1-flash-lite-preview
Google
78.0
2.5K
1.05M
¥1.8 / ¥10.8Input/Output
74
gemini-2.5-flash
Google
77.7
7.6K
1.05M
¥2.16 / ¥18Input/Output
75
deepseek-v3.2-exp
Deepseek
77.4
594
128K
¥0 / ¥0Input/Output
76
qwen3.5-122b-a10b
Alibaba
77.1
1.9K
262K
¥2.88 / ¥23Input/Output
77
gpt-5.2
Openai
76.8
3.2K
400K
¥12.6 / ¥101Input/Output
78
claude-opus-4-5-20251101-thinking-32k
Anthropic
76.5
2.3K
200K
¥108 / ¥540Input/Output
79
amazon-nova-experimental-chat-26-02-10
Amazon
76.2
216
-
-
80
longcat-flash-chat-2602-exp
Meituan
75.9
1.6K
128K
¥1.08 / ¥10.8Input/Output
81
gemma-4-26b-a4b
Google
75.6
371
262K
¥0.94 / ¥2.88Input/Output
82
claude-sonnet-4-5-20250929-thinking-32k
Anthropic
75.3
4.8K
200K
¥21.6 / ¥108Input/Output
83
qwen3.5-27b
Alibaba
75.0
1.8K
262K
¥2.16 / ¥17.3Input/Output
84
gemini-2.5-flash-preview-09-2025
Google
74.7
1.7K
1M
¥2.16 / ¥18Input/Output
85
qwen3.5-35b-a3b
Alibaba
74.4
2K
262K
¥1.8 / ¥14.4Input/Output
86
gpt-5.2-high
Openai
74.1
3.2K
400K
¥12.6 / ¥101Input/Output
87
deepseek-v3.2-thinking
Deepseek
73.8
2.6K
128K
¥2.09 / ¥3.1Input/Output
88
mimo-v2.5
Xiaomi
73.5
1.1K
1.05M
¥2.88 / ¥14.4Input/Output
89
grok-4-fast-reasoning
Xai
73.2
882
2M
¥1.44 / ¥3.6Input/Output
90
kimi-k2-thinking-turbo
Moonshot
72.9
3.7K
262K
¥17.3 / ¥72Input/Output
91
mimo-v2-flash (non-thinking)
Xiaomi
72.6
3K
262K
¥0.72 / ¥2.16Input/Output
92
hunyuan-turbos-20250416
Tencent
72.3
589
131K
¥0 / ¥0Input/Output
93
qwen3-vl-235b-a22b-thinking
Alibaba
72.0
435
131K
¥2.06 / ¥8.26Input/Output
94
gpt-5.5-instant
Openai
71.6
1.9K
400K
¥9 / ¥72Input/Output
95
grok-4-fast-chat
Xai
71.3
394
2M
¥1.44 / ¥3.6Input/Output
96
grok-4-1-fast-reasoning
Xai
71.0
3.5K
2M
¥1.44 / ¥3.6Input/Output
97
claude-opus-4-1-20250805-thinking-16k
Anthropic
70.7
2.7K
200K
¥108 / ¥540Input/Output
98
claude-opus-4-1-20250805
Anthropic
70.4
4.4K
200K
¥108 / ¥540Input/Output
99
gpt-5-high
Openai
70.1
1.8K
400K
¥9 / ¥72Input/Output
100
hunyuan-hy3-preview
Tencent
69.8
391
256K
¥0 / ¥0Input/Output
101
gpt-5-chat
Openai
69.5
1.7K
400K
¥9 / ¥72Input/Output
102
grok-4.3
Xai
69.2
1.2K
1M
¥9 / ¥18Input/Output
103
gpt-4.5-preview-2025-02-27
Openai
68.9
589
8.19K
¥216 / ¥432Input/Output
104
step-3.5-flash
Stepfun
68.6
2.4K
256K
¥0.69 / ¥2.07Input/Output
105
qwen3.5-flash
Alibaba
68.3
2K
1M
¥1.24 / ¥12.4Input/Output
106
qwen3-235b-a22b-no-thinking
Alibaba
68.0
2.3K
131K
¥2.07 / ¥8.26Input/Output
107
kimi-k2.5-instant
Moonshot
67.7
541
262K
¥4.32 / ¥21.6Input/Output
108
mimo-v2-flash (thinking)
Xiaomi
67.4
691
262K
¥0.72 / ¥2.16Input/Output
109
amazon-nova-experimental-chat-10-20
Amazon
67.1
643
-
-
110
qwen3-max-2025-09-23
Alibaba
66.8
451
258K
¥6.19 / ¥24.7Input/Output
111
amazon-nova-experimental-chat-26-01-10
Amazon
66.5
238
-
-
112
glm-4.5-air
Zai
66.2
1.6K
131K
¥0 / ¥0Input/Output
113
minimax-m2.7
Minimax
65.9
1.5K
205K
¥0 / ¥0Input/Output
114
gpt-5.4-mini-high
Openai
65.5
1.9K
400K
¥5.4 / ¥32.4Input/Output
115
qwen3-30b-a3b-instruct-2507
Alibaba
65.2
1.3K
262K
¥2.16 / ¥3.6Input/Output
116
grok-3-mini-high
Xai
64.9
988
128K
¥0 / ¥0Input/Output
117
qwen3-next-80b-a3b-thinking
Alibaba
64.6
721
131K
¥1.04 / ¥10.3Input/Output
118
gpt-5.3-chat-latest
Openai
64.3
2.3K
128K
¥12.6 / ¥101Input/Output
119
gpt-oss-120b
Openai
64.0
1.6K
131K
¥1.08 / ¥4.32Input/Output
120
gemma-3-27b-it
Google
63.7
2.4K
128K
¥2.15 / ¥2.15Input/Output
121
glm-4-plus-0111
Zai
63.4
293
128K
¥72 / ¥72Input/Output
122
minimax-m2.1-preview
Minimax
63.1
1.1K
205K
¥0 / ¥0Input/Output
123
gemini-2.5-flash-lite-preview-09-2025-no-thinking
Google
62.8
2.5K
1.05M
¥0.72 / ¥2.88Input/Output
124
gemini-2.5-flash-lite-preview-06-17-thinking
Google
62.5
2K
65.5K
¥0.72 / ¥2.88Input/Output
125
nvidia-nemotron-3-super-120b-a12b
Nvidia
62.2
479
262K
¥1.44 / ¥5.76Input/Output
126
nova-2-lite
Amazon
61.9
711
128K
¥2.38 / ¥19.8Input/Output
127
qwen2.5-max
Alibaba
61.6
1.6K
32K
¥11.5 / ¥46Input/Output
128
kimi-k2-0905-preview
Moonshot
61.3
724
262K
¥4.32 / ¥18Input/Output
129
deepseek-v3-0324
Deepseek
61.0
2.7K
75K
¥1.44 / ¥5.76Input/Output
130
claude-haiku-4-5-20251001
Anthropic
60.7
4.8K
200K
¥7.2 / ¥36Input/Output
131
mistral-medium-2505
Mistral
60.4
2K
262K
¥2.88 / ¥14.4Input/Output
132
gpt-4.1-2025-04-14
Openai
60.1
2.8K
1.05M
¥14.4 / ¥57.6Input/Output
133
gpt-5.4-nano-high
Openai
59.8
1.9K
400K
¥1.44 / ¥9Input/Output
134
gpt-5-mini-high
Openai
59.5
1.4K
400K
¥1.8 / ¥14.4Input/Output
135
mercury-2
Inception Ai
59.1
229
128K
¥1.8 / ¥5.4Input/Output
136
deepseek-r1
Deepseek
58.8
830
164K
¥5.04 / ¥18Input/Output
137
qwen3-32b
Alibaba
58.5
212
131K
¥2.07 / ¥8.26Input/Output
138
gemini-2.0-flash-001
Google
58.2
2.3K
1.05M
¥1.08 / ¥4.32Input/Output
139
kimi-k2-0711-preview
Moonshot
57.9
1.7K
131K
¥4.32 / ¥18Input/Output
140
minimax-m2
Minimax
57.6
325
197K
¥0 / ¥0Input/Output
141
nvidia-llama-3.3-nemotron-super-49b-v1.5
Nvidia
57.3
197
131K
¥2.88 / ¥2.88Input/Output
142
claude-opus-4-20250514
Anthropic
57.0
2.6K
200K
¥108 / ¥540Input/Output
143
grok-3-mini-beta
Xai
56.7
1.4K
1M
¥9 / ¥18Input/Output
144
qwen3-235b-a22b
Alibaba
56.4
1.5K
131K
¥2.07 / ¥8.26Input/Output
145
o4-mini-2025-04-16
Openai
56.1
2.5K
200K
¥7.92 / ¥31.7Input/Output
146
ling-flash-2.0
Ant Group
55.8
352
131K
¥1.01 / ¥4.1Input/Output
147
minimax-m2.5
Minimax
55.5
2.7K
205K
¥0 / ¥0Input/Output
148
trinity-large-thinking
-
55.2
1.8K
262K
¥1.8 / ¥6.48Input/Output
149
step-3
Stepfun
54.9
347
65.5K
¥1.8 / ¥4.68Input/Output
150
minimax-m1
Minimax
54.6
2.1K
1M
¥0.95 / ¥9.03Input/Output
151
claude-opus-4-20250514-thinking-16k
Anthropic
54.3
2.2K
200K
¥108 / ¥540Input/Output
152
o1-2024-12-17
Openai
54.0
1.3K
128K
¥108 / ¥432Input/Output
153
step-2-16k-exp-202412
Stepfun
53.7
265
16.4K
¥37.5 / ¥118Input/Output
154
nvidia-nemotron-3-nano-30b-a3b-bf16
Nvidia
53.4
981
131K
¥0 / ¥0Input/Output
155
qwen-plus-0125
Alibaba
53.0
315
1M
¥0.83 / ¥2.07Input/Output
156
trinity-large-preview
-
52.7
2.1K
262K
¥1.8 / ¥6.48Input/Output
157
glm-4.5v
Zai
52.4
238
64K
¥4.32 / ¥13Input/Output
158
step-1o-turbo-202506
Stepfun
52.1
605
-
-
159
glm-4.7-flash
Zai
51.8
804
200K
¥0 / ¥0Input/Output
160
qwen3-coder-480b-a35b-instruct
Alibaba
51.5
1.5K
262K
¥6.2 / ¥24.8Input/Output
161
gemini-2.0-flash-lite-preview-02-05
Google
51.2
1.1K
1.05M
¥0.54 / ¥2.16Input/Output
162
mistral-small-2506
Mistral
50.9
1.1K
262K
¥2.88 / ¥14.4Input/Output
163
deepseek-v3
Deepseek
50.6
1.1K
128K
¥0 / ¥0Input/Output
164
gemma-3-4b-it
Google
50.3
197
128K
¥1.44 / ¥1.44Input/Output
165
qwq-32b
Alibaba
50.0
1.4K
131K
¥2.07 / ¥6.2Input/Output
166
hunyuan-large-2025-02-10
Tencent
49.7
212
-
-
167
gemma-3-12b-it
Google
49.4
155
128K
¥1.96 / ¥1.96Input/Output
168
command-a-03-2025
Cohere
49.1
3.1K
256K
¥18 / ¥72Input/Output
169
ring-flash-2.0
Ant Group
48.8
384
131K
¥1.01 / ¥4.1Input/Output
170
gpt-5-nano-high
Openai
48.5
480
400K
¥0.36 / ¥2.88Input/Output
171
claude-sonnet-4-20250514
Anthropic
48.2
2.4K
200K
¥21.6 / ¥108Input/Output
172
o3-mini-high
Openai
47.9
824
200K
¥7.92 / ¥31.7Input/Output
173
intellect-3
-
47.6
315
131K
¥1.44 / ¥7.92Input/Output
174
gpt-4.1-mini-2025-04-14
Openai
47.3
2.3K
1.05M
¥2.88 / ¥11.5Input/Output
175
claude-sonnet-4-20250514-thinking-32k
Anthropic
47.0
2.2K
200K
¥21.6 / ¥108Input/Output
176
gemma-3n-e4b-it
Google
46.6
1.3K
128K
¥0 / ¥0Input/Output
177
qwen3-30b-a3b
Alibaba
46.3
1.6K
128K
¥0.79 / ¥7.78Input/Output
178
gpt-oss-20b
Openai
46.0
561
131K
¥0.32 / ¥1.3Input/Output
179
olmo-3-32b-think
Allenai
45.7
315
128K
¥2.16 / ¥3.24Input/Output
180
hunyuan-standard-2025-02-10
Tencent
45.4
247
-
-
181
granite-4.1-8b
Ibm
45.1
280
131K
¥0.36 / ¥0.72Input/Output
182
o1-preview
Openai
44.8
1.7K
128K
¥108 / ¥432Input/Output
183
grok-2-2024-08-13
Xai
44.5
3.2K
1M
¥9 / ¥18Input/Output
184
yi-lightning
-
44.2
1.5K
12K
¥1.44 / ¥1.44Input/Output
185
gemini-1.5-pro-002
Google
43.9
2.7K
-
-
186
qwen2.5-plus-1127
Alibaba
43.6
474
-
-
187
llama-3.1-nemotron-70b-instruct
Nvidia
43.3
392
128K
¥0 / ¥0Input/Output
188
o3-mini
Openai
43.0
3K
200K
¥7.92 / ¥31.7Input/Output
189
grok-2-mini-2024-08-13
Xai
42.7
2.6K
1M
¥9 / ¥18Input/Output
190
o1-mini
Openai
42.4
2.6K
128K
¥7.92 / ¥31.7Input/Output
191
athene-v2-chat
-
42.1
1.2K
-
-
192
deepseek-v2.5-1210
Deepseek
41.8
291
1M
¥1.01 / ¥2.02Input/Output
193
gemini-1.5-flash-002
Google
41.5
1.7K
2M
¥0.54 / ¥2.2Input/Output
194
glm-4-plus
Zai
41.2
1.5K
128K
¥54 / ¥54Input/Output
195
olmo-3.1-32b-instruct
Allenai
40.9
805
200K
¥14.4 / ¥57.6Input/Output
196
claude-3-7-sonnet-20250219-thinking-32k
Anthropic
40.5
2.3K
-
-
197
llama-3.1-405b-instruct-bf16
Meta
40.2
1.9K
128K
¥0 / ¥0Input/Output
198
gpt-4o-mini-2024-07-18
Openai
39.9
3.4K
128K
¥1.08 / ¥4.32Input/Output
199
reka-core-20240904
-
39.6
364
-
-
200
gpt-4o-2024-05-13
Openai
39.3
6K
128K
¥36 / ¥108Input/Output
201
llama-4-maverick-17b-128e-instruct
Meta
39.0
2.3K
1M
¥1.8 / ¥6.26Input/Output
202
gpt-4.1-nano-2025-04-14
Openai
38.7
302
1.05M
¥14.4 / ¥57.6Input/Output
203
llama-4-scout-17b-16e-instruct
Meta
38.4
1.8K
128K
¥1.44 / ¥5.62Input/Output
204
llama-3.3-70b-instruct
Meta
38.1
2.7K
128K
¥0 / ¥0Input/Output
205
athene-70b-0725
-
37.8
1K
-
-
206
llama-3.1-405b-instruct-fp8
Meta
37.5
3.1K
128K
¥0 / ¥0Input/Output
207
mistral-small-3.1-24b-instruct-2503
Mistral
37.2
2K
262K
¥2.88 / ¥14.4Input/Output
208
olmo-3.1-32b-think
Allenai
36.9
595
200K
¥14.4 / ¥57.6Input/Output
209
amazon-nova-pro-v1.0
Amazon
36.6
1.1K
300K
¥5.76 / ¥23Input/Output
210
claude-3-7-sonnet-20250219
Anthropic
36.3
2.3K
200K
¥21.6 / ¥108Input/Output
211
deepseek-v2.5
Deepseek
36.0
1.2K
1M
¥1.01 / ¥2.02Input/Output
212
qwen-max-0919
Alibaba
35.7
933
131K
¥2.48 / ¥9.91Input/Output
213
qwen2.5-72b-instruct
Alibaba
35.4
2K
131K
¥4.13 / ¥12.4Input/Output
214
llama-3.1-70b-instruct
Meta
35.1
2.8K
131K
¥2.88 / ¥2.88Input/Output
215
claude-3-5-sonnet-20241022
Anthropic
34.8
4.6K
200K
¥21.6 / ¥108Input/Output
216
mistral-large-2411
Mistral
34.5
1.2K
128K
¥14.4 / ¥43.2Input/Output
217
gpt-4o-2024-08-06
Openai
34.1
2.2K
128K
¥18 / ¥72Input/Output
218
gemini-advanced-0514
Google
33.8
2.7K
-
-
219
magistral-medium-2506
Mistral
33.5
781
128K
¥14.4 / ¥36Input/Output
220
mistral-large-2407
Mistral
33.2
2.4K
131K
¥14.4 / ¥43.2Input/Output
221
claude-3-5-sonnet-20240620
Anthropic
32.9
4.5K
200K
¥21.6 / ¥108Input/Output
222
gpt-4-turbo-2024-04-09
Openai
32.6
5K
128K
¥72 / ¥216Input/Output
223
command-r-plus-08-2024
Cohere
32.3
547
128K
¥18 / ¥72Input/Output
224
gpt-4-0125-preview
Openai
32.0
5.2K
8.19K
¥216 / ¥432Input/Output
225
claude-3-opus-20240229
Anthropic
31.7
10.3K
200K
¥108 / ¥540Input/Output
226
hunyuan-large-vision
Tencent
31.4
366
-
-
227
claude-3-5-haiku-20241022
Anthropic
31.1
3.7K
200K
¥5.76 / ¥28.8Input/Output
228
gpt-4-1106-preview
Openai
30.8
5.5K
8.19K
¥216 / ¥432Input/Output
229
c4ai-aya-expanse-32b
Cohere
30.5
1.4K
-
-
230
gemma-2-9b-it-simpo
-
30.2
516
8.19K
¥1.44 / ¥1.44Input/Output
231
ibm-granite-h-small
Ibm
29.9
254
-
-
232
mistral-small-24b-instruct-2501
Mistral
29.6
654
262K
¥2.88 / ¥14.4Input/Output
233
llama-3-70b-instruct
Meta
29.3
8.3K
8.19K
¥3.67 / ¥5.33Input/Output
234
gemini-1.5-flash-8b-001
Google
29.0
1.8K
2M
¥0.54 / ¥2.2Input/Output
235
amazon-nova-lite-v1.0
Amazon
28.7
872
300K
¥0.43 / ¥1.73Input/Output
236
gemini-1.5-pro-001
Google
28.4
4.4K
-
-
237
command-r-08-2024
Cohere
28.0
500
128K
¥18 / ¥72Input/Output
238
amazon-nova-micro-v1.0
Amazon
27.7
952
128K
¥0.25 / ¥1.01Input/Output
239
command-r-plus
Cohere
27.4
4.2K
128K
¥18 / ¥72Input/Output
240
glm-4-0520
Zai
27.1
554
128K
¥108 / ¥108Input/Output
241
c4ai-aya-expanse-8b
Cohere
26.8
470
-
-
242
phi-4
Microsoft
26.5
1K
128K
¥0.9 / ¥3.6Input/Output
243
reka-flash-20240904
-
26.2
390
65.5K
¥0.72 / ¥1.44Input/Output
244
gemma-2-27b-it
Google
25.9
3.9K
8.19K
¥0.58 / ¥0.58Input/Output
245
jamba-1.5-large
-
25.6
474
256K
¥0 / ¥0Input/Output
246
nemotron-4-340b-instruct
Nvidia
25.3
1.1K
-
-
247
claude-3-sonnet-20240229
Anthropic
25.0
5.9K
200K
¥21.6 / ¥108Input/Output
248
llama-3.1-nemotron-51b-instruct
Nvidia
24.7
213
128K
¥0 / ¥0Input/Output
249
qwen2.5-coder-32b-instruct
Alibaba
24.4
221
131K
¥2.07 / ¥6.2Input/Output
250
gemini-1.5-flash-001
Google
24.1
3.5K
2M
¥0.54 / ¥2.2Input/Output
251
olmo-2-0325-32b-instruct
Allenai
23.8
204
-
-
252
llama-3.1-8b-instruct
Meta
23.5
2.6K
131K
¥0.79 / ¥0.79Input/Output
253
jamba-1.5-mini
-
23.2
486
256K
¥0 / ¥0Input/Output
254
ministral-8b-2410
Mistral
22.9
280
128K
¥0.72 / ¥0.72Input/Output
255
qwen2-72b-instruct
Alibaba
22.6
2.1K
131K
¥4.13 / ¥12.4Input/Output
256
gemma-2-9b-it
Google
22.3
2.8K
8.19K
¥1.44 / ¥1.44Input/Output
257
claude-3-haiku-20240307
Anthropic
22.0
6.4K
200K
¥1.8 / ¥9Input/Output
258
yi-1.5-34b-chat
-
21.6
1.3K
-
-
259
command-r
Cohere
21.3
2.9K
128K
¥18 / ¥72Input/Output
260
internlm2_5-20b-chat
-
21.0
548
-
-
261
llama-3-8b-instruct
Meta
20.7
5.4K
8.19K
¥0.29 / ¥0.29Input/Output
262
reka-flash-21b-20240226
-
20.4
1.3K
-
-
263
qwen1.5-110b-chat
Alibaba
20.1
1.4K
-
-
264
gpt-4-0314
Openai
19.8
2.9K
8.19K
¥216 / ¥432Input/Output
265
gemma-2-2b-it
Google
19.5
2.3K
128K
¥0 / ¥0Input/Output
266
reka-flash-21b-20240226-online
-
19.2
818
-
-
267
qwen1.5-72b-chat
Alibaba
18.9
2.4K
-
-
268
gemini-pro-dev-api
Google
18.6
1.1K
1.05M
¥14.4 / ¥86.4Input/Output
269
deepseek-coder-v2
Deepseek
18.3
847
1M
¥1.01 / ¥2.02Input/Output
270
mistral-large-2402
Mistral
18.0
3.4K
262K
¥2.88 / ¥14.4Input/Output
271
mistral-medium
Mistral
17.7
1.9K
262K
¥2.88 / ¥14.4Input/Output
272
starling-lm-7b-beta
-
17.4
853
200K
¥5.4 / ¥18.7Input/Output
273
gpt-4-0613
Openai
17.1
4.8K
8.19K
¥216 / ¥432Input/Output
274
mixtral-8x22b-instruct-v0.1
Mistral
16.8
2.8K
64K
¥14.4 / ¥43.2Input/Output
275
phi-3-medium-4k-instruct
Microsoft
16.5
1.3K
4.1K
¥1.22 / ¥4.9Input/Output
276
gemini-pro
Google
16.2
262
1.05M
¥14.4 / ¥86.4Input/Output
277
starling-lm-7b-alpha
-
15.9
568
200K
¥5.4 / ¥18.7Input/Output
278
qwen1.5-32b-chat
Alibaba
15.5
1.2K
-
-
279
zephyr-orpo-141b-A35b-v0.1
-
15.2
284
200K
¥108 / ¥432Input/Output
280
qwen1.5-14b-chat
Alibaba
14.9
939
-
-
281
llama-3.2-3b-instruct
Meta
14.6
441
131K
¥0.22 / ¥0.35Input/Output
282
yi-34b-chat
-
14.3
824
-
-
283
phi-3-small-8k-instruct
Microsoft
14.0
977
8.19K
¥1.08 / ¥4.32Input/Output
284
wizardlm-70b
Microsoft
13.7
385
-
-
285
mixtral-8x7b-instruct-v0.1
Mistral
13.4
4.1K
32K
¥5.04 / ¥5.04Input/Output
286
llama-2-70b-chat
Meta
13.1
2.2K
-
-
287
gpt-3.5-turbo-0125
Openai
12.8
3.8K
16.4K
¥3.6 / ¥10.8Input/Output
288
openchat-3.5-0106
-
12.5
743
-
-
289
gemma-1.1-7b-it
Google
12.2
1.3K
-
-
290
llama-2-13b-chat
Meta
11.9
970
-
-
291
mistral-7b-instruct-v0.2
Mistral
11.6
1.1K
262K
¥2.88 / ¥14.4Input/Output
292
dbrx-instruct-preview
-
11.3
1.7K
-
-
293
openchat-3.5
-
11.0
426
-
-
294
wizardlm-13b
Microsoft
10.7
323
-
-
295
snowflake-arctic-instruct
-
10.4
1.6K
-
-
296
deepseek-llm-67b-chat
Deepseek
10.1
385
1M
¥1.01 / ¥2.02Input/Output
297
vicuna-33b
-
9.8
1.2K
-
-
298
tulu-2-dpo-70b
-
9.5
348
-
-
299
granite-3.0-8b-instruct
Ibm
9.1
356
-
-
300
nous-hermes-2-mixtral-8x7b-dpo
-
8.8
266
1M
¥36 / ¥180Input/Output
301
qwen1.5-7b-chat
Alibaba
8.5
266
-
-
302
openhermes-2.5-mistral-7b
-
8.2
285
1M
¥36 / ¥180Input/Output
303
solar-10.7b-instruct-v1.0
-
7.9
210
128K
¥0 / ¥0Input/Output
304
phi-3-mini-4k-instruct-june-2024
Microsoft
7.6
653
4.1K
¥0.94 / ¥3.74Input/Output
305
llama-2-7b-chat
Meta
7.3
764
128K
¥4.03 / ¥48Input/Output
306
olmo-7b-instruct
Allenai
7.0
339
-
-
307
phi-3-mini-4k-instruct
Microsoft
6.7
1.1K
4.1K
¥0.94 / ¥3.74Input/Output
308
gemma-7b-it
Google
6.4
538
-
-
309
zephyr-7b-beta
-
6.1
580
-
-
310
codellama-34b-instruct
Meta
5.8
340
-
-
311
vicuna-13b
-
5.5
884
-
-
312
llama-3.2-1b-instruct
Meta
5.2
445
16.4K
¥0.07 / ¥0.08Input/Output
313
qwen-14b-chat
Alibaba
4.9
268
32.8K
¥1.04 / ¥3.1Input/Output
314
gpt-3.5-turbo-1106
Openai
4.6
906
16.4K
¥7.2 / ¥14.4Input/Output
315
granite-3.0-2b-instruct
Ibm
4.3
380
-
-
316
vicuna-7b
-
4.0
291
-
-
317
phi-3-mini-128k-instruct
Microsoft
3.7
1.1K
128K
¥0.94 / ¥3.74Input/Output
318
mistral-7b-instruct
Mistral
3.4
477
262K
¥2.88 / ¥14.4Input/Output
319
gemma-1.1-2b-it
Google
3.0
628
-
-
320
stripedhyena-nous-7b
-
2.7
327
-
-
321
palm-2
Google
2.4
303
-
-
322
koala-13b
-
2.1
253
-
-
323
qwen1.5-4b-chat
Alibaba
1.8
433
-
-
324
RWKV-4-Raven-14B
-
1.5
177
-
-
325
gemma-2b-it
Google
1.2
292
-
-
326
chatglm3-6b
-
0.9
253
200K
¥5.4 / ¥18.7Input/Output
327
chatglm-6b
-
0.6
193
200K
¥5.4 / ¥18.7Input/Output
328
alpaca-13b
-
0.3
212
-
-
329
oasst-pythia-12b
-
0.0
201
-
-
Top model analysis

claude-opus-4-6 why it ranks first

claude-opus-4-6 ranks first with a percent score of 100.0 and 2.6K samples. Use it as the first option for this leaderboard, then compare price, context and availability.

How to choose

Do not only look at rank #1

Start with the leaderboard closest to your task. Compare the top models by score and sample size, then check price, context length, open or closed access, and provider availability.

FAQ

FAQ

医疗健康排行榜看什么指标?

主要看排名、百分制分数、样本量和来源。分数用于快速比较同一榜单内模型表现,样本量用于判断结果稳定性。

为什么不同榜单不能直接混合成总分?

不同榜单的任务、样本和评测口径不同,模力榜默认只在同一榜单内排序,避免把写作、代码、图像等能力强行合并。

医疗健康模型应该怎么选?

优先看与你任务最接近的榜单,再结合价格、上下文长度、开源闭源和厂商可用性。排名靠前不代表适合所有预算和部署方式。

榜单多久更新?

页面展示的是最新成功采集的公开榜单数据。当前优先使用 LMArena leaderboard dataset,并在页面来源中保留原始链接。