Exploring Conditions for Diffusion models in Robotic Control Paper • 2510.15510 • Published 18 days ago • 39
Learning with Unmasked Tokens Drives Stronger Vision Learners Paper • 2310.13593 • Published Oct 20, 2023
Match me if you can: Semi-Supervised Semantic Correspondence Learning with Unpaired Images Paper • 2311.18540 • Published Nov 30, 2023
Similarity of Neural Architectures using Adversarial Attack Transferability Paper • 2210.11407 • Published Oct 20, 2022
DNNs May Determine Major Properties of Their Outputs Early, with Timing Possibly Driven by Bias Paper • 2502.08167 • Published Feb 12 • 1
Masking meets Supervision: A Strong Learning Alliance Paper • 2306.11339 • Published Jun 20, 2023
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation Paper • 2411.19067 • Published Nov 28, 2024 • 8
Token-Supervised Value Models for Enhancing Mathematical Reasoning Capabilities of Large Language Models Paper • 2407.12863 • Published Jul 12, 2024 • 1
SeiT++: Masked Token Modeling Improves Storage-efficient Training Paper • 2312.10105 • Published Dec 15, 2023
Rethinking Channel Dimensions for Efficient Model Design Paper • 2007.00992 • Published Jul 2, 2020 • 1
Rethinking Spatial Dimensions of Vision Transformers Paper • 2103.16302 • Published Mar 30, 2021 • 1
SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage Paper • 2303.11114 • Published Mar 20, 2023
ViDT: An Efficient and Effective Fully Transformer-based Object Detector Paper • 2110.03921 • Published Oct 8, 2021 • 1
AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights Paper • 2006.08217 • Published Jun 15, 2020
Scratching Visual Transformer's Back with Uniform Attention Paper • 2210.08457 • Published Oct 16, 2022