Update README.md
Browse files
README.md
CHANGED
|
@@ -23,7 +23,9 @@ model_name: liberalis-cogitator-llama-3.1-8b-dpo
|
|
| 23 |
|
| 24 |
> *“Thought, unbound, is the only true frontier.”*
|
| 25 |
|
| 26 |
-
**liberalis-cogitator-llama-3.1-8b** is not just a machine for words — it is a forge for ideas. With **8 billion parameters**, trained with a custom **Direct Preference Optimization (DPO)** algorithm on a dataset of **16,000 preference pairs** spanning **~450,000 conversations, problems, and stories**, this model embraces the philosophy that thought should wander without leash or muzzle.
|
|
|
|
|
|
|
| 27 |
|
| 28 |
Its name — *liberalis cogitator* — whispers in Latin: *a thinker who is free*. Not merely free as in “without cost,” but free as in **without walls**.
|
| 29 |
|
|
|
|
| 23 |
|
| 24 |
> *“Thought, unbound, is the only true frontier.”*
|
| 25 |
|
| 26 |
+
**liberalis-cogitator-llama-3.1-8b** is not just a machine for words — it is a forge for ideas. With **8 billion parameters**, trained with a custom **Direct Preference Optimization (DPO)** algorithm on a dataset of **16,000 preference pairs** and a SFT dataset spanning **~450,000 conversations, problems, and stories**, this model embraces the philosophy that thought should wander without leash or muzzle.
|
| 27 |
+
|
| 28 |
+
During DPO fine-tuning, the context window was scaled to 65536, giving this model the capabilities of long conversation.
|
| 29 |
|
| 30 |
Its name — *liberalis cogitator* — whispers in Latin: *a thinker who is free*. Not merely free as in “without cost,” but free as in **without walls**.
|
| 31 |
|