UM-Text: A Unified Multimodal Model for Image Understanding Paper • 2601.08321 • Published 4 days ago • 6
From RAG to Agentic RAG for Faithful Islamic Question Answering Paper • 2601.07528 • Published 5 days ago
Prototypicality Bias Reveals Blindspots in Multimodal Evaluation Metrics Paper • 2601.04946 • Published 9 days ago
ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation Paper • 2601.03955 • Published 10 days ago • 2
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation Paper • 2512.24724 • Published 17 days ago • 6
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow Paper • 2512.24766 • Published 17 days ago • 7
Gabliteration: Adaptive Multi-Directional Neural Weight Modification for Selective Behavioral Alteration in Large Language Models Paper • 2512.18901 • Published 26 days ago • 3
Vision-Language-Action Models for Autonomous Driving: Past, Present, and Future Paper • 2512.16760 • Published 30 days ago • 13
What matters for Representation Alignment: Global Information or Spatial Structure? Paper • 2512.10794 • Published Dec 11, 2025 • 8
Multi-Modal Data-Efficient 3D Scene Understanding for Autonomous Driving Paper • 2405.05258 • Published May 8, 2024
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations Paper • 2507.05260 • Published Jul 7, 2025
An Empirical Study of Training State-of-the-Art LiDAR Segmentation Models Paper • 2405.14870 • Published May 23, 2024
Veila: Panoramic LiDAR Generation from a Monocular RGB Image Paper • 2508.03690 • Published Aug 5, 2025
SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining Paper • 2503.19912 • Published Mar 25, 2025
Point Transformer V3 Extreme: 1st Place Solution for 2024 Waymo Open Dataset Challenge in Semantic Segmentation Paper • 2407.15282 • Published Jul 21, 2024
SEE4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting Paper • 2510.26796 • Published Oct 30, 2025
ThreadWeaver: Adaptive Threading for Efficient Parallel Reasoning in Language Models Paper • 2512.07843 • Published Nov 24, 2025 • 21