LaWa: Using Latent Space for In-Generation Image Watermarking Paper • 2408.05868 • Published Aug 11, 2024 • 3
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models Paper • 2503.02175 • Published Mar 4, 2025 • 3
Spatial Reasoning with Vision-Language Models in Ego-Centric Multi-View Scenes Paper • 2509.06266 • Published Sep 8, 2025 • 11