o4 mini
LatestOpenAI
•
Proprietary# 61
Released
Apr 16, 2025
# 14
Knowledge Cutoff
Jun 24
# 9
Context Length
200K
Benchmarks
# 73
Code RankedAGI
55.8%
# 35
SWEBench Verified
68.1%
# 78
Agentic RankedAGI
42.3%
# 7
Coding LiveBench 25.5
74.2%
# 13
Code LMArena
1334
# 21
Cyber Gym
2.5%
# 5
Codeforces ELO
2719
# 72
Reason RankedAGI
53.1%
# 45
HLE
18.1%
# 44
GPQA Diamond
81.4%
# 11
Reason LiveBench 25.5
78.5%
# 37
Text Arena
1390
# 32
AIME 2025 I & II
84.0%
# 10
Math LiveBench 25.5
81.0%
# 6
MMMU
81.6%
# 28
Halluc. Hughes
4.6%
# 6
IF LiveBench 25.5
81.8%
# 10
Avg LiveBench 25.5
67.4%
# 14
Coding LiveBench 25.4
61.8%
# 8
Agentic LiveBench 25.5
21.7%
# 93
Math RankedAGI
61.5%
# 76
RAGI RankedAGI
51.3%
# 28
Svelte Bench v1
13.3%
# 53
Code DesignArena
1040
# 32
Toolathlon Pass@1
14.8%
Pricing
# 29
Input Cost /M
$1.1
# 34
Output Cost /M
$4.4
# 23
Cached Cost /M
$0.28