Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 4 days ago • 9 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated 3 days ago • 11.6k • 1.28k • 19
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 4 days ago • 9
Multilingual Leaderboards Leaderboards for languages other than English Running on CPU Upgrade 74 74 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 123 123 Open Chinese LLM Leaderboard 🏆 Explore and submit LLM benchmarks Running on CPU Upgrade 167 167 Open Arabic LLM Leaderboard 🏆 Track, rank and evaluate open Arabic LLMs and chatbots Runtime error 40 40 OpenLLM French leaderboard 🇫🇷 🥇 Explore and submit LLM benchmarks
Running on CPU Upgrade 74 74 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain.
Running on CPU Upgrade 167 167 Open Arabic LLM Leaderboard 🏆 Track, rank and evaluate open Arabic LLMs and chatbots
Global PIQA A physical commonsense reasoning benchmark for 100+ languages, written in collaboration with 300+ researchers from 65 countries. Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 4 days ago • 9 mrlbenchmarks/global-piqa-nonparallel Viewer • Updated 3 days ago • 11.6k • 1.28k • 19
Global PIQA: Evaluating Physical Commonsense Reasoning Across 100+ Languages and Cultures Paper • 2510.24081 • Published 4 days ago • 9
Multilingual Leaderboards Leaderboards for languages other than English Running on CPU Upgrade 74 74 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain. Running on CPU Upgrade 123 123 Open Chinese LLM Leaderboard 🏆 Explore and submit LLM benchmarks Running on CPU Upgrade 167 167 Open Arabic LLM Leaderboard 🏆 Track, rank and evaluate open Arabic LLMs and chatbots Runtime error 40 40 OpenLLM French leaderboard 🇫🇷 🥇 Explore and submit LLM benchmarks
Running on CPU Upgrade 74 74 La Leaderboard 🌸 Evaluate open LLMs in the languages of LATAM and Spain.
Running on CPU Upgrade 167 167 Open Arabic LLM Leaderboard 🏆 Track, rank and evaluate open Arabic LLMs and chatbots