LiqunMa
/

FBI-LLM_7B

Text Generation

text-generation-inference

Model card Files Files and versions

LiqunMa commited on Jul 6, 2024

Commit

740eb40

·

verified ·

1 Parent(s): 3c37a9f

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ We use [AmberDateset](https://huggingface.co/datasets/LLM360/AmberDatasets) to t
 ## Result
-<img src=https://huggingface.co/LiqunMa/FBI-LLM_7B/resolve/main/main_result.jpg width="70%" />
 <!-- ![image](https://huggingface.co/LiqunMa/FBI-LLM_7B/resolve/main/main_result.jpg =200x) -->
 ## How to use
@@ -53,8 +53,7 @@ def load_model(model_size, model_dir):
     for p in ckpt_plist:
       weight_dict = torch.load(p)
       for k,v in _weight_dict.items():
-          if 'self_attn.rotary_emb.inv_freq' not in k:
-              weight_dict[k] = v
     model.load_state_dict(weight_dict)
     for param in model.parameters():

 ## Result
+<img src=https://huggingface.co/LiqunMa/FBI-LLM_7B/resolve/main/main_result.jpg width="90%" />
 <!-- ![image](https://huggingface.co/LiqunMa/FBI-LLM_7B/resolve/main/main_result.jpg =200x) -->
 ## How to use
     for p in ckpt_plist:
       weight_dict = torch.load(p)
       for k,v in _weight_dict.items():
+          weight_dict[k] = v
     model.load_state_dict(weight_dict)
     for param in model.parameters():