← 返回排行榜
Anthropic

Claude 3.7 Sonnet

面向长上下文和推理的闭源模型。

闭源 API
Use Case Fit

适合的应用场景

按应用任务展示该模型被推荐的理由和证据数量。

跨 Benchmark 成绩

已收录结果

7Results
领域Benchmark排名分数指标来源更新时间
mathLMArena Math#41338 EloArena EloLMArena2026/05/30
mathMMLU-Pro Mathematics#483.2%AccuracyTIGER-Lab / MMLU-Pro2026/05/20
physicsMMLU-Pro Physics#380.7%AccuracyTIGER-Lab / MMLU-Pro2026/05/20
chemistryChemBench#278.9 ptsNormalized ScoreChemBench2026/05/28
economicsMMLU-Pro Economics#284.6%AccuracyTIGER-Lab / MMLU-Pro2026/05/20
financeOpen FinLLM Leaderboard#281.3 ptsComposite ScoreTheFinAI / Open FinLLM2026/05/29
medicineMedHELM#283.9 ptsOverall ScoreMedHELM2026/05/27