Letting Large Models Debate: The First Multilingual LLM Debate Competition
•
33
None defined yet.
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench
Emu3.5: Native Multimodal Models are World Learners
Explore and submit LLM benchmarks
FlagEval VLM Leaderboard
URSA Text-to-Image-to-Video
Explore and compare model evaluations
Open Veo3-style Audio-Video Generation
Search for information using keywords
Leaderboard for MVRB (Massive Visualized IR Benchmark)
Segment and caption objects in images
Generate images and answer questions using text input
Generate and chat using multimodal inputs
Segment 3D medical images using prompts
Generate images from text prompts
Generate images from text prompts
Segment objects in images and videos with a single touch