Add new official benchmark on the Hub

XuanlangDai · March 25, 2026, 5:59am

Hi, I’d like to register our benchmark dataset internlm/WildClawBench as an official benchmark on the Hub. We have added eval.yaml to the repo with evaluation_framework: wildclawbench. Could you please add it to the benchmark allow-list? Thanks!

hellencharless54 · March 25, 2026, 6:09pm

Hi,

Thanks for reaching out! Your dataset internlm/WildClawBench and the included eval.yaml look good. We can add it to the official benchmark allow-list.

Before we do, please ensure that:

The repository follows the Hub’s benchmark submission guidelines.
The eval.yaml includes all required fields and a working evaluation script.
Any dependencies or instructions for reproducing the benchmark are clearly documented.

Once confirmed, we’ll proceed with adding it to the allow-list and it should appear as an official benchmark on the Hub.

Thanks for contributing this!

XuanlangDai · March 27, 2026, 7:01am

Hi,

Thanks for the update! I’ve double-checked everything to ensure the repo, eval.yaml, and documentation are all fully aligned with the Hub’s guidelines.

The benchmark is ready for the allow-list. Looking forward to seeing it live!

Topic		Replies	Views
Tools, datasets ,benchmarks in AI Safety 🤗Datasets	0	130	June 20, 2024
Access to Llama 2 repo 🤗Datasets	0	323	September 13, 2023
Centralized Benchmarks 🤗Datasets	2	518	May 24, 2022
About the Hub category 🤗Hub	0	2700	July 17, 2021
For fine-tuned LLAMA 2 Beginners	0	313	October 16, 2023

Add new official benchmark on the Hub

Related topics