Fast and Robust: Task Sampling with Posterior and Diversity Synergies for Adaptive Decision-Makers in Randomized Environments Paper • 2504.19139 • Published Apr 27 • 1
Model Predictive Task Sampling for Efficient and Robust Adaptation Paper • 2501.11039 • Published Jan 19 • 1
Can Prompt Difficulty be Online Predicted for Accelerating RL Finetuning of Reasoning Models? Paper • 2507.04632 • Published Jul 7 • 2