Activity Feed

AI & ML interests

Building breatkthrough AI to solve the world's biggest problems.

Recent Activity

baileyk  updated a dataset about 6 hours ago
allenai/dolma3_mix-6T-1025
baileyk  new activity about 6 hours ago
allenai/dolma3_mix-6T-1025:Full Dataset
epwalsh  authored a paper 15 days ago
Olmo 3
View all activity

allenai 's collections 33

Olmo 3.1
The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...
DataDecide
A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale.
Tulu V2.5 Suite
A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more!
OLMo 2 Preview Post-trained Models
These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions.
Olmo 3.1
The latest members of the Olmo 3 family: another 3 weeks of RL for 32B Think, the 32B Instruct model, large post-training research datasets...
DataDecide
A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale.
Tulu V2.5 Suite
A suite of models trained using DPO and PPO across a wide variety (up to 14) of preference datasets. See https://arxiv.org/abs/2406.09279 for more!
OLMo 2 Preview Post-trained Models
These model's tokenizer did not use HF's fast tokenizer, resulting in variations in how pre-tokenization was applied. Resolved in latest versions.