SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper β’ 2502.14739 β’ Published Feb 20 β’ 104 β’ 10
OmniBench: Towards The Future of Universal Omni-Language Models Paper β’ 2409.15272 β’ Published Sep 23, 2024 β’ 30 β’ 2