Privileged On-Policy Exploration

Team

classroom

AI & ML interests

None defined yet.

Recent Activity

wendyxwz updated a dataset 8 days ago

CMU-POPE/trial

wendyxwz updated a dataset 8 days ago

CMU-POPE/trial

CohenQu authored a paper 10 months ago

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

View all activity

wendyxwz

updated a dataset 8 days ago

CMU-POPE/trial

Viewer • Updated 8 days ago • 10 • 13

CohenQu

authored 3 papers 10 months ago

Recursive Introspection: Teaching Language Model Agents How to Self-Improve

Paper • 2407.18219 • Published Jul 25, 2024 • 3

Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning

Paper • 2310.18247 • Published Oct 27, 2023

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 47

CohenQu

authored a paper about 1 year ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 30