Submitted by kylemontgomery 3 Predicting Task Performance with Context-aware Scaling Laws WangLab 1 2
Submitted by kylemontgomery 4 Budget-aware Test-time Scaling via Discriminative Verification WangLab 3 2
Submitted by ncrispino 7 SteeringControl: Holistic Evaluation of Alignment Steering in LLMs WangLab 2