Running on Zero 165 HunyuanWorld-Mirror π 165 Universal 3D World Reconstruction with Any Prior Prompting
view post Post 3893 The new Qwen-2 VL models seem to perform quite well in object detection. You can prompt them to respond with bounding boxes in a reference frame of 1k x 1k pixels and scale those boxes to the original image size.You can try it out with my space maxiw/Qwen2-VL-Detection 6 replies Β· π 14 14 π 5 5 π€ 1 1 + Reply
view article Article Welcome PaliGemma 2 β New vision language models by Google +2 Dec 5, 2024 β’ 162
Runtime error Featured 515 Florence2 + SAM2 π₯ 515 Segment and caption objects in images and videos