TMLR-Group-HF/Co-rewarding-I-Llama-3.2-3B-Instruct-MATH Text Generation • 4B • Updated 18 days ago • 15
TMLR-Group-HF/Self-Certainty-Qwen3-1.7B-Base-MATH Text Generation • 2B • Updated 18 days ago • 14 • 1
TMLR-Group-HF/Self-Certainty-Llama-3.2-3B-Instruct-MATH Text Generation • 4B • Updated 18 days ago • 10