Text Generation
Transformers
Safetensors
PyTorch
nemotron_h
nvidia
conversational
custom_code
Eval Results

Add community evaluation results for GPQA, HLE, MMLU-PRO

#39
by nielsr HF Staff - opened

This PR adds community-provided evaluation results for the following benchmarks:

  • GPQA
    HLE
    MMLU-PRO

These results were extracted from the model card. This is based on the new evaluation results feature.

Note: This is an automated PR. Please review the evaluation results before merging.

suhara changed pull request status to merged

Sign up or log in to comment