Artificial Intelligence
ROOT: Robust Orthogonalized Optimizer for Neural Network Training
Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective