OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published 6 days ago • 28
SCA: Improve Semantic Consistent in Unrestricted Adversarial Attacks via DDPM Inversion Paper • 2410.02240 • Published Oct 3, 2024 • 1
SCA: Improve Semantic Consistent in Unrestricted Adversarial Attacks via DDPM Inversion Paper • 2410.02240 • Published Oct 3, 2024 • 1
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published Oct 21 • 21
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published Oct 21 • 21
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published Oct 21 • 21 • 2
GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains Paper • 2505.18700 • Published May 24 • 4
Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models Paper • 2507.17853 • Published Jul 23 • 1
Detail++: Training-Free Detail Enhancer for Text-to-Image Diffusion Models Paper • 2507.17853 • Published Jul 23 • 1
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning Paper • 2508.07607 • Published Aug 11 • 1
X2Edit: Revisiting Arbitrary-Instruction Image Editing through Self-Constructed Data and Task-Aware Representation Learning Paper • 2508.07607 • Published Aug 11 • 1
Training-Free Watermarking for Autoregressive Image Generation Paper • 2505.14673 • Published May 20 • 12
Training-Free Watermarking for Autoregressive Image Generation Paper • 2505.14673 • Published May 20 • 12