Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Benjamin Therien's picture
5

Benjamin Therien

btherien
Gargaz's profile picture lucazsh's profile picture Fishtiks's profile picture
·
https://bentherien.github.io/
  • benjamintherien
  • bentherien
  • benjamintherien

AI & ML interests

Passionate about machine learning research! Currently working on efficient foundation model pre-training and learned optimization.

Organizations

Mila – Quebec Artificial Intelligence Institute's profile picture CERC-AAI's profile picture

authored 3 papers over 1 year ago

Continual Pre-Training of Large Language Models: How to (re)warm your model?

Paper • 2308.04014 • Published Aug 8, 2023 • 2

$μ$LO: Compute-Efficient Meta-Generalization of Learned Optimizers

Paper • 2406.00153 • Published May 31, 2024 • 13

Simple and Scalable Strategies to Continually Pre-train Large Language Models

Paper • 2403.08763 • Published Mar 13, 2024 • 51
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs