Hyper-Bagel: A Unified Acceleration Framework for Multimodal Understanding and Generation Paper • 2509.18824 • Published Sep 23 • 22
Cocktail: Mixing Multi-Modality Controls for Text-Conditional Image Generation Paper • 2306.00964 • Published Jun 1, 2023 • 1