Update README.md

46acfd9 verified 3 months ago

724 Bytes

metadata

license: other
language:
  - en
base_model:
  - mistralai/Mistral-Small-3.2-24B-Instruct-2506

alpha 32.

Original set was created by: "Preference dataset meant to decrease repetition, measured as either copying n-grams from input or infinite / semi-infite repetition of tokens; the chosen split is V3 03/24 instructed to avoid n-gram repetition, while the rejected split consists of either V3 03/24 instructed to copy from the input or Qwen 3 8B with a rep pen of 0.7."