DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency Paper • 2504.12080 • Published Apr 16 • 8
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Paper • 2504.18415 • Published Apr 25 • 47
Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark Paper • 2504.16427 • Published Apr 23 • 18