Locutusque
/

OpenCerebrum-1.0-7b-DPO

Text Generation

question-answering

text-generation-inference

Model card Files Files and versions

Locutusque commited on Mar 27, 2024

Commit

694c35d

·

verified ·

1 Parent(s): c62cf90

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -21,6 +21,8 @@ OpenCerebrum-1.0-7B-DPO is an open-source language model fine-tuned from the alp
 The model was fine-tuned on approximately 21,000 examples across 6 datasets spanning coding, math, science, reasoning, and general instruction-following. The goal was to assemble public datasets that could help the model achieve strong performance on benchmarks where Cerebrum excels.
 ## Model Details
 - **Base Model:** alpindale/Mistral-7B-v0.2-hf

 The model was fine-tuned on approximately 21,000 examples across 6 datasets spanning coding, math, science, reasoning, and general instruction-following. The goal was to assemble public datasets that could help the model achieve strong performance on benchmarks where Cerebrum excels.
+I used the ChatML prompt format to train this model.
 ## Model Details
 - **Base Model:** alpindale/Mistral-7B-v0.2-hf