Update README.md
Browse files
README.md
CHANGED
@@ -53,7 +53,7 @@ For more details, including benchmark evaluation, hardware requirements, and inf
|
|
53 |
| MMLU-Redux | 92.1 | **92.7** | 89.5 | 91.4 |
|
54 |
| GPQA | **82.8** | 71.1 | 65.8 | 73.4 |
|
55 |
| SuperGPQA | 57.8 | **60.7** | 51.8 | 56.8 |
|
56 |
-
| **Reasoning** | | | | |
|
57 |
| AIME25 | 72.0 | 81.5 | 70.9 | **85.0** |
|
58 |
| HMMT25 | 64.2 | 62.5 | 49.8 | **71.4** |
|
59 |
| LiveBench 20241125 | 74.3 | **77.1** | 74.3 | 76.8 |
|
|
|
53 |
| MMLU-Redux | 92.1 | **92.7** | 89.5 | 91.4 |
|
54 |
| GPQA | **82.8** | 71.1 | 65.8 | 73.4 |
|
55 |
| SuperGPQA | 57.8 | **60.7** | 51.8 | 56.8 |
|
56 |
+
| **Reasoning** | | | | |
|
57 |
| AIME25 | 72.0 | 81.5 | 70.9 | **85.0** |
|
58 |
| HMMT25 | 64.2 | 62.5 | 49.8 | **71.4** |
|
59 |
| LiveBench 20241125 | 74.3 | **77.1** | 74.3 | 76.8 |
|