fMoE: Fine-Grained Expert Offloading for Large Mixture-of-Experts Serving Paper • 2502.05370 • Published Feb 7, 2025
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 3 days ago • 81