BigCodeBench Leaderboard
Explore and analyze code completion benchmarks
Explore and analyze code completion benchmarks
Uncensored General Intelligence Leaderboard
Display LMArena Leaderboard
Embedding Leaderboard
Track, rank and evaluate open LLMs and chatbots
Submit code models for evaluation and view leaderboard
Display a web page
Display and request speech recognition model benchmarks
Image Generation and Image Editing Arena & Leaderboard
Display LLM performance leaderboards
Display and explore a leaderboard for model evaluations
imgsys.org -- arena for text guided image generation
Embed ZeroEval for evaluation
Redirect to leaderboard page
Generate visual data analysis plots
Blind vote on HF TTS models!
Tracks perf of LLMs, VLMs and agents on web navigation tasks
DABstep Reasoning Benchmark Leaderboard
Ranking of LLMs for agentic tasks