seedbench yj12869741/SeedBench Viewer • Updated Jul 1 • 8.67k • 31 SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper • 2505.13220 • Published May 19 • 4
SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper • 2505.13220 • Published May 19 • 4
seedbench yj12869741/SeedBench Viewer • Updated Jul 1 • 8.67k • 31 SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper • 2505.13220 • Published May 19 • 4
SeedBench: A Multi-task Benchmark for Evaluating Large Language Models in Seed Science Paper • 2505.13220 • Published May 19 • 4