Arthur Douillard's picture

5 10

Arthur Douillard

ArthurDouillard

·

https://arthurdouillard.com/

AI & ML interests

Continual Learning, Computer Vision, Transformers

Organizations

None yet

upvoted a paper 4 months ago

DiLoCoX: A Low-Communication Large-Scale Training Framework for Decentralized Cluster

Paper • 2506.21263 • Published Jun 26 • 4

upvoted 2 papers 8 months ago

Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo

Paper • 2503.09799 • Published Mar 12 • 15

Eager Updates For Overlapped Communication and Computation in DiLoCo

Paper • 2502.12996 • Published Feb 18 • 7

upvoted a paper 9 months ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 30

upvoted a paper about 1 year ago

Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

Paper • 2408.05147 • Published Aug 9, 2024 • 40

upvoted 3 papers over 1 year ago

BOND: Aligning LLMs with Best-of-N Distillation

Paper • 2407.14622 • Published Jul 19, 2024 • 20

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

WARP: On the Benefits of Weight Averaged Rewarded Policies

Paper • 2406.16768 • Published Jun 24, 2024 • 23

upvoted 2 papers almost 2 years ago

Asynchronous Local-SGD Training for Language Modeling

Paper • 2401.09135 • Published Jan 17, 2024 • 12

DiLoCo: Distributed Low-Communication Training of Language Models

Paper • 2311.08105 • Published Nov 14, 2023 • 16