Michal Valko's picture

Open to Collab

2 2 1

Michal Valko

misovalko

·

https://misovalko.github.io/

AI & ML interests

large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models

Recent Activity

upvoted a paper 13 days ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

authored a paper 13 days ago

Optimal Design for Reward Modeling in RLHF

authored a paper 13 days ago

Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms

View all activity

Organizations

liked a Space almost 2 years ago

Daily Papers

Complete list of past Daily Papers