Add community evaluation results for GPQA, HLE, MMLU-PRO

#39

by nielsr HF Staff - opened Jan 15

←

Jan 15

•

This PR adds community-provided evaluation results for the following benchmarks:

These results were extracted from the model card. This is based on the new evaluation results feature.

Note: This is an automated PR. Please review the evaluation results before merging.

suhara changed pull request status to merged Jan 18

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment