Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FlagEval
non-profit
https://flageval.baai.ac.cn/
Activity Feed
Follow
18
AI & ML interests
None defined yet.
Recent Activity
xuanricheng
authored
a paper
about 1 month ago
FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions
philokey
authored
a paper
about 1 month ago
CMMU: A Benchmark for Chinese Multi-modal Multi-type Question Understanding and Reasoning
philokey
authored
a paper
about 1 month ago
Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs
View all activity
Team members
11
FlagEval
's Spaces
2
Sort: Recently updated
Running
6
FlagEval-Arena
🐢
Arena
Running
12
FlagEval-Debate
🐠
Display a debate interface