IDEA-Bench: How Far are Generative Models from Professional Designing? Paper • 2412.11767 • Published Dec 16, 2024
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios Paper • 2505.21333 • Published May 27 • 38
Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying Paper • 2405.07653 • Published May 13, 2024