Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation Paper • 2505.23844 • Published May 28 • 4 • 2
Token Reduction Should Go Beyond Efficiency in Generative Models -- From Vision, Language to Multimodality Paper • 2505.18227 • Published May 23 • 15 • 3