Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Privileged On-Policy Exploration

Team
classroom
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

wendyxwz  updated a dataset 8 days ago
CMU-POPE/trial
wendyxwz  updated a dataset 8 days ago
CMU-POPE/trial
CohenQu  authored a paper 10 months ago
Recursive Introspection: Teaching Language Model Agents How to Self-Improve
View all activity

Yuxiao Qu's profile picture Wen Ye's profile picture

wendyxwz 
updated a dataset 8 days ago

CMU-POPE/trial

Viewer • Updated 8 days ago • 10 • 13
CohenQu 
authored 3 papers 10 months ago

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

Paper • 2407.18219 • Published Jul 25, 2024 • 3

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning

Paper • 2310.18247 • Published Oct 27, 2023

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 47
CohenQu 
authored a paper about 1 year ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 30
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs