Spaces:
Running
Running
| <br/> | |
| # π¦ ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models | |
| <!-- [π FnF Paper](https://arxiv.org/abs/2305.18654) | --> | |
| [π° Blog](https://huggingface.co/blog/yuchenlin/zebra-logic) [π» GitHub](https://github.com/WildEval/ZeroEval) | [π€ HuggingFace](https://huggingface.co/collections/allenai/zebra-logic-bench-6697137cbaad0b91e635e7b0) | [π¦ X](https://twitter.com/billyuchenlin/) | [π¬ Discussion](https://huggingface.co/spaces/allenai/ZebraLogicBench-Leaderboard/discussions) | Updated: **{LAST_UPDATED}** | |