Curated Numinamath
					Collection
				
				3 items
				โข 
				Updated
					
				
axolotl version: 0.5.0
This is an open-source fine-tuned reasoning adapter of microsoft/Phi-3.5-mini-instruct, transformed into a math reasoning model using data curated from collinear-ai/R1-Distill-SFT-Curated. It achieves the following results on the evaluation set:
This model is a LoRA adaptor and for best results merge it with base model microsoft/Phi-3.5-mini-instruct before use.
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | 
|---|---|---|---|
| No log | 0.0003 | 1 | 0.6714 | 
| 0.337 | 0.3335 | 1243 | 0.3361 | 
| 0.3248 | 0.6669 | 2486 | 0.3203 | 
The following figure shows the accuracy and the speedup of Collinear Curators C1 and C2 when compared to training on unfiltered dataset.

Base model
microsoft/Phi-3.5-mini-instruct