Olmo 3 Post-training Collection All artifacts for post-training Olmo 3. Datasets follow the model that resulted from training on them. • 32 items • Updated 4 days ago • 46
RLCR Collection Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty • 10 items • Updated Aug 6 • 7
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 249
[NeurIPS 2025] RPC Resources Collection Sampled Reasoning Paths for NeurIPS 2025 Paper: A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning • 6 items • Updated Oct 23 • 8
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 Mar 26 • 177
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 385
view article Article PyTorchModelHubMixin: Bridging the Gap for Custom AI Models on Hugging Face Nov 11, 2024 • 20