On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published May 7 • 82
Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment Paper • 2502.16894 • Published Feb 24 • 32
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper • 2311.00430 • Published Nov 1, 2023 • 57
How Efficient Are Today's Continual Learning Algorithms? Paper • 2303.18171 • Published Mar 29, 2023 • 1
SIESTA: Efficient Online Continual Learning with Sleep Paper • 2303.10725 • Published Mar 19, 2023 • 1