Model Card for Model ID

Quick Llama 3 8B finetune with ORPO. Demontration that it can be fine tune in 2 hours only. Thanks to Maxime Labonne's notebook:

https://colab.research.google.com/drive/1eHNWg9gnaXErdAa8_mcvjMupbSS6rDvi?usp=sharing

  • Number of training samples from the dataset: 1500 out of 40K
  • Hardware Type: L4
  • Hours of training: 2
  • Cloud Provider: google colab
Downloads last month
2
Safetensors
Model size
8B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train mayacinka/OrpoLlama-3-8B