Perusha Moodley's picture

7 9

Perusha Moodley

moodlep

·

https://www.perusha.dev/

AI & ML interests

RL, DRL, Decision Transformers, Auxiliary signals, self-supervised methods

Organizations

upvoted an article 10 months ago

Article

SmolLM - blazingly fast and remarkably powerful

+1

Jul 16, 2024

•

435

upvoted a paper 11 months ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 103

upvoted a collection 12 months ago

Scaling Test-Time Compute with Open Models

Models and datasets used in our blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute • 10 items • Updated Jan 6 • 27

upvoted a collection about 1 year ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated 2 days ago • 96

upvoted an article over 1 year ago

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

+2

Apr 22, 2024

•

81

upvoted a paper over 1 year ago

Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning

Paper • 2310.20587 • Published Oct 31, 2023 • 18