← 返回排行榜
OpenAI

GPT-4o

主流多模态闭源模型。

闭源 API
Use Case Fit

适合的应用场景

按应用任务展示该模型被推荐的理由和证据数量。

跨 Benchmark 成绩

已收录结果

7Results
领域Benchmark排名分数指标来源更新时间
mathLMArena Math#61304 EloArena EloLMArena2026/05/30
mathMMLU-Pro Mathematics#580.6%AccuracyTIGER-Lab / MMLU-Pro2026/05/20
physicsMMLU-Pro Physics#576.2%AccuracyTIGER-Lab / MMLU-Pro2026/05/20
chemistryChemBench#475.6 ptsNormalized ScoreChemBench2026/05/28
economicsMMLU-Pro Economics#480.2%AccuracyTIGER-Lab / MMLU-Pro2026/05/20
financeOpen FinLLM Leaderboard#182.7 ptsComposite ScoreTheFinAI / Open FinLLM2026/05/29
medicineMedHELM#184.2 ptsOverall ScoreMedHELM2026/05/27