Humanity's Last Exam
Paper
•
2501.14249
•
Published
•
76
Note currently the hardest
Note *BB* => BBH => BBEH
Note BB => *BBH* => BBEH
Note BB => BBH => **BBEH**
Note OG MMLU !
Note IFEval
Note Coding Benchmark
Note Best for long context (as of July 2025) long context: at least 8K