Motif-Technologies
/

Motif-2-12.7B-Instruct

Text Generation

text-generation-inference

Model card Files Files and versions

leejunhyeok commited on 7 days ago

Commit

8b93d6b

·

verified ·

1 Parent(s): e571869

add usecases in readme

Files changed (1) hide show

README.md +49 -1

README.md CHANGED Viewed

@@ -63,4 +63,52 @@ Detailed information including technical report will be released later.
 |---|---|---|---|---|---|---|---|---|
 ||Instruct|Instruct|Non-thinking|Thinking|Non-thinking|Thinking|Non-thinking|Thinking|
 |Average|67.08|50.95|54.97|77.82|54.66|79.55|54.78|78.66|
-|Improvement||+31.65%|+22.02%|-13.80%|+22.72%|-15.68%|+22.45%|-14.73%|

 |---|---|---|---|---|---|---|---|---|
 ||Instruct|Instruct|Non-thinking|Thinking|Non-thinking|Thinking|Non-thinking|Thinking|
 |Average|67.08|50.95|54.97|77.82|54.66|79.55|54.78|78.66|
+|Improvement||+31.65%|+22.02%|-13.80%|+22.72%|-15.68%|+22.45%|-14.73%|
+## How to use in transformers
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
+model = AutoModelForCausalLM.from_pretrained(
+    "Motif-Technologies/Motif-2-12.7B-Instruct",
+    trust_remote_code = True,
+    _attn_implementation = "flash_attention_2",
+    dtype = torch.bfloat16 # currently supports bf16 only, for efficiency
+).cuda()
+tokenizer = AutoTokenizer.from_pretrained(
+    "Motif-Technologies/Motif-2-12.7B-Instruct",
+    trust_remote_code = True,
+)
+query = "What is the capital city of South Korea?"
+input_ids = tokenizer.apply_chat_template(
+    [
+        {'role': 'system', 'content': 'you are an helpful assistant'},
+        {'role': 'user', 'content': query},
+    ],
+    add_generation_prompt = True,
+    enable_thinking = False, # or True
+    return_tensors='pt',
+).cuda()
+output = model.generate(input_ids, max_new_tokens=1024, pad_token_id=tokenizer.eos_token_id)
+output = tokenizer.decode(output[0, input_ids.shape[-1]:], skip_special_tokens = False)
+print(output)
+```
+### outputs
+```
+# with enable_thinking=True, the model is FORCED to think.
+Okay, the user is asking for the capital city of South Korea. Let me think. I know that South Korea's capital is Seoul. But wait, I should double-check to make sure I'm not mixing it up with other countries. For example, North Korea's capital is Pyongyang. So yes, South Korea's capital is definitely Seoul. I should just provide that as the answer.
+</think>
+The capital city of South Korea is **Seoul**.
+<|endofturn|><|endoftext|>
+# with enable_thinking=False, the model chooses to think or not. in this example, thinking is not worth it.
+The capital city of South Korea is Seoul.
+<|endofturn|><|endoftext|>
+```
+## How to use in vllm
+TBD