ReXGroundingCT: A 3D Chest CT Dataset for Segmentation of Findings from Free-Text Reports Paper • 2507.22030 • Published Jul 29, 2025
Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decode Paper • 2508.04107 • Published Aug 6, 2025 • 4
Phrase-grounded Fact-checking for Automatically Generated Chest X-ray Reports Paper • 2509.21356 • Published Sep 20, 2025
UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation Paper • 2504.21336 • Published Apr 30, 2025 • 4
RadGenome-Chest CT: A Grounded Vision-Language Dataset for Chest CT Analysis Paper • 2404.16754 • Published Apr 25, 2024
Self-Supervised Anatomical Consistency Learning for Vision-Grounded Medical Report Generation Paper • 2509.25963 • Published Sep 30, 2025
Med-GLIP: Advancing Medical Language-Image Pre-training with Large-scale Grounded Dataset Paper • 2508.10528 • Published Aug 14, 2025
M3D: Advancing 3D Medical Image Analysis with Multi-Modal Large Language Models Paper • 2404.00578 • Published Mar 31, 2024 • 1
Dream-VL & Dream-VLA: Open Vision-Language and Vision-Language-Action Models with Diffusion Language Model Backbone Paper • 2512.22615 • Published 21 days ago • 44
Urban Socio-Semantic Segmentation with Vision-Language Reasoning Paper • 2601.10477 • Published 2 days ago • 142