Triangle104
/

MN-12b-RP-Ink-Q8_0-GGUF

Model card Files Files and versions

Triangle104 commited on Dec 26, 2024

Commit

e76c7e4

·

verified ·

1 Parent(s): 9f7db32

Update README.md

Files changed (1) hide show

README.md +45 -0

README.md CHANGED Viewed

@@ -14,6 +14,51 @@ language:
 This model was converted to GGUF format from [`allura-org/MN-12b-RP-Ink`](https://huggingface.co/allura-org/MN-12b-RP-Ink) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/allura-org/MN-12b-RP-Ink) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`allura-org/MN-12b-RP-Ink`](https://huggingface.co/allura-org/MN-12b-RP-Ink) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/allura-org/MN-12b-RP-Ink) for more details on the model.
+---
+Model details:
+-
+A roleplay-focused LoRA finetune of Mistral Nemo Instruct. Methodology and hyperparams inspired by SorcererLM and Slush.
+Renamed to Ink to distinguish from [insert every other rp tune ever], but it's the same data as was used in the Teleut RP model.
+Dataset
+The worst mix of data you've ever seen. Like, seriously, you do not want to see the things that went into this model. It's bad.
+"this is like washing down an adderall with a bottle of methylated rotgut" - inflatebot
+Quants
+    Static GGUFs
+Recommended Settings
+Chat template: Mistral v3-Tekken
+Recommended samplers (not the be-all-end-all, try some on your own!):
+    Temp 1.25 / MinP 0.1
+    Temp 1.03 / TopK 200 / MinP 0.05 / TopA 0.2
+Hyperparams
+General
+    Epochs = 2
+    LR = 6e-5
+    LR Scheduler = Cosine
+    Optimizer = Paged AdamW 8bit
+    Effective batch size = 12
+LoRA
+    Rank = 16
+    Alpha = 32
+    Dropout = 0.25 (Inspiration: Slush)
+Credits
+Humongous thanks to the people who created the data. I would credit you all, but that would be cheating ;)
+Big thanks to all Allura members, especially Toasty, for testing and emotional support ilya /platonic
+Also special thanks to Bot for making the model card image here :3
+NO thanks to Infermatic. They suck at hosting models
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)