Update README.md
Browse files
README.md
CHANGED
|
@@ -14,6 +14,51 @@ language:
|
|
| 14 |
This model was converted to GGUF format from [`allura-org/MN-12b-RP-Ink`](https://huggingface.co/allura-org/MN-12b-RP-Ink) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
| 15 |
Refer to the [original model card](https://huggingface.co/allura-org/MN-12b-RP-Ink) for more details on the model.
|
| 16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
## Use with llama.cpp
|
| 18 |
Install llama.cpp through brew (works on Mac and Linux)
|
| 19 |
|
|
|
|
| 14 |
This model was converted to GGUF format from [`allura-org/MN-12b-RP-Ink`](https://huggingface.co/allura-org/MN-12b-RP-Ink) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
| 15 |
Refer to the [original model card](https://huggingface.co/allura-org/MN-12b-RP-Ink) for more details on the model.
|
| 16 |
|
| 17 |
+
---
|
| 18 |
+
Model details:
|
| 19 |
+
-
|
| 20 |
+
A roleplay-focused LoRA finetune of Mistral Nemo Instruct. Methodology and hyperparams inspired by SorcererLM and Slush.
|
| 21 |
+
Renamed to Ink to distinguish from [insert every other rp tune ever], but it's the same data as was used in the Teleut RP model.
|
| 22 |
+
Dataset
|
| 23 |
+
|
| 24 |
+
The worst mix of data you've ever seen. Like, seriously, you do not want to see the things that went into this model. It's bad.
|
| 25 |
+
|
| 26 |
+
"this is like washing down an adderall with a bottle of methylated rotgut" - inflatebot
|
| 27 |
+
Quants
|
| 28 |
+
|
| 29 |
+
Static GGUFs
|
| 30 |
+
|
| 31 |
+
Recommended Settings
|
| 32 |
+
|
| 33 |
+
Chat template: Mistral v3-Tekken
|
| 34 |
+
Recommended samplers (not the be-all-end-all, try some on your own!):
|
| 35 |
+
|
| 36 |
+
Temp 1.25 / MinP 0.1
|
| 37 |
+
Temp 1.03 / TopK 200 / MinP 0.05 / TopA 0.2
|
| 38 |
+
|
| 39 |
+
Hyperparams
|
| 40 |
+
General
|
| 41 |
+
|
| 42 |
+
Epochs = 2
|
| 43 |
+
LR = 6e-5
|
| 44 |
+
LR Scheduler = Cosine
|
| 45 |
+
Optimizer = Paged AdamW 8bit
|
| 46 |
+
Effective batch size = 12
|
| 47 |
+
|
| 48 |
+
LoRA
|
| 49 |
+
|
| 50 |
+
Rank = 16
|
| 51 |
+
Alpha = 32
|
| 52 |
+
Dropout = 0.25 (Inspiration: Slush)
|
| 53 |
+
|
| 54 |
+
Credits
|
| 55 |
+
|
| 56 |
+
Humongous thanks to the people who created the data. I would credit you all, but that would be cheating ;)
|
| 57 |
+
Big thanks to all Allura members, especially Toasty, for testing and emotional support ilya /platonic
|
| 58 |
+
Also special thanks to Bot for making the model card image here :3
|
| 59 |
+
NO thanks to Infermatic. They suck at hosting models
|
| 60 |
+
|
| 61 |
+
---
|
| 62 |
## Use with llama.cpp
|
| 63 |
Install llama.cpp through brew (works on Mac and Linux)
|
| 64 |
|