QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs Paper • 2510.11696 • Published 13 days ago • 165
StreamingVLM: Real-Time Understanding for Infinite Video Streams Paper • 2510.09608 • Published 16 days ago • 48
Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper • 2510.07318 • Published 18 days ago • 27
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published 23 days ago • 92
SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer Paper • 2509.24695 • Published 28 days ago • 42
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published about 1 month ago • 176
AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning Paper • 2507.12841 • Published Jul 17 • 41
Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation Paper • 2507.08441 • Published Jul 11 • 61
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting Paper • 2410.17856 • Published Oct 23, 2024 • 51
LongVILA: Scaling Long-Context Visual Language Models for Long Videos Paper • 2408.10188 • Published Aug 19, 2024 • 52
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs Paper • 2406.18629 • Published Jun 26, 2024 • 42
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models Paper • 2309.12307 • Published Sep 21, 2023 • 89