2 17

Runpeng Dai PRO

Leo-Dai

AI & ML interests

None yet

Recent Activity

authored a paper 13 days ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

upvoted a paper 13 days ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

commented on a paper 13 days ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

View all activity

Organizations

upvoted a paper 13 days ago

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

Paper • 2510.09517 • Published 15 days ago • 6

upvoted a paper 19 days ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published 26 days ago • 133

upvoted 2 papers 23 days ago

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

Paper • 2510.01444 • Published 24 days ago • 19

CLUE: Non-parametric Verification from Experience via Hidden-State Clustering

Paper • 2510.01591 • Published 24 days ago • 26

upvoted 2 papers 27 days ago

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Paper • 2509.06949 • Published Sep 8 • 56

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11 • 78

upvoted 3 papers about 1 month ago

Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Paper • 2509.15194 • Published Sep 18 • 33

Look Again, Think Slowly: Enhancing Visual Reflection in Vision-Language Models

Paper • 2509.12132 • Published Sep 15 • 5

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

Paper • 2509.09675 • Published Sep 11 • 28

upvoted 3 papers about 2 months ago

upvoted a paper 3 months ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 130

upvoted a collection 3 months ago

Qwen3

Collection

84 items • Updated Aug 6 • 1.36k

upvoted 2 papers 4 months ago

Learning to Reason via Mixture-of-Thought for Logical Reasoning

Paper • 2505.15817 • Published May 21 • 18

R1-RE: Cross-Domain Relationship Extraction with RLVR

Paper • 2507.04642 • Published Jul 7 • 6

upvoted a paper over 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 625

Runpeng Dai PRO

AI & ML interests

Recent Activity

Organizations

Leo-Dai's activity