Mistral-Base-7B-DPO / README.md
PeterLauLukCh's picture
Update README.md
026756a verified
metadata
license: mit
datasets:
  - trl-lib/ultrafeedback_binarized
base_model:
  - alignment-handbook/zephyr-7b-sft-full