FlagEval

non-profit

https://flageval.baai.ac.cn/

AI & ML interests

None defined yet.

spaces 2

FlagEval-Arena

Arena

FlagEval-Debate

Display a debate interface

models 1

FlagEval/flageval_judgemodel

Text Generation • 33B • Updated Dec 30, 2024 • 2 • 1

datasets 13

FlagEval/ERQAPlus

Viewer • Updated Nov 27, 2025 • 800 • 21 • 1

FlagEval/coco_val2014_sampled

Viewer • Updated Nov 6, 2025 • 1k • 30

FlagEval/MeasureBench

Viewer • Updated Nov 3, 2025 • 2.44k • 396 • 1

FlagEval/EmbodiedVerse-Bench

Viewer • Updated Jun 25, 2025 • 2.04k • 62

FlagEval/Where2Place

Viewer • Updated May 29, 2025 • 100 • 230

FlagEval/SAT

Viewer • Updated May 6, 2025 • 150 • 112

FlagEval/HMMT_2025

Viewer • Updated May 6, 2025 • 30 • 455 • 1

FlagEval/ERQA

Viewer • Updated Apr 22, 2025 • 400 • 1.94k • 4

FlagEval/sub_spatial

Viewer • Updated Apr 21, 2025 • 690 • 11

FlagEval/EmbSpatial-Bench

Viewer • Updated Apr 21, 2025 • 3.64k • 1.03k • 4

View 13 datasets