Free-form datasets, human annotations, and sample-level model outputs for "Answer Matching Outperforms Multiple Choice for Language Model Evaluation"
Nikhil Chandak
nikhilchandak
AI & ML interests
None yet
Recent Activity
liked
a dataset
14 days ago
brendel-group/MATH-Beyond
liked
a dataset
27 days ago
ricdomolm/mini-coder-trajs-400k
liked
a model
27 days ago
ricdomolm/mini-coder-1.7b
Organizations
None yet