Visual Representation Learning with Stochastic Frame Prediction Paper • 2406.07398 • Published Jun 11, 2024 • 1
Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction Paper • 2411.14762 • Published Nov 22, 2024 • 11