ai-coscientist 's Collections

AblationBench

This is a collection of datasets used to evaluate language models in the task of ablation planning in empirical AI research.