Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
WJoeWeiler 's Collections
Memory

Memory

updated May 6
Upvote
-

  • Toward Evaluative Thinking: Meta Policy Optimization with Evolving Reward Models

    Paper • 2504.20157 • Published Apr 28 • 37

  • The Leaderboard Illusion

    Paper • 2504.20879 • Published Apr 29 • 72

  • ReasonIR: Training Retrievers for Reasoning Tasks

    Paper • 2504.20595 • Published Apr 29 • 53

  • RM-R1: Reward Modeling as Reasoning

    Paper • 2505.02387 • Published May 5 • 78

  • Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

    Paper • 2505.02707 • Published May 5 • 85
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs