Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Open to Collab
2140.8
TFLOPS
2
2
1
Michal Valko
misovalko
Follow
Yasbok's profile picture
MTCHA's profile picture
malavwarke's profile picture
29 followers
·
109 following
https://misovalko.github.io/
misovalko
misovalko
michalvalko
misovalko.bsky.social
AI & ML interests
large language models, reasoning, fine-tuning, test-time computation, reinforcement learning with human feedback, world models
Recent Activity
upvoted
a
paper
11 days ago
A General Theoretical Paradigm to Understand Learning from Human Preferences
authored
a paper
11 days ago
Optimal Design for Reward Modeling in RLHF
authored
a paper
11 days ago
Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms
View all activity
Organizations
misovalko
's datasets
1
Sort: Recently updated
misovalko/my-research-papers
Updated
13 days ago
•
12