BAAI
/

AquilaChat2-70B-Expr

Text Generation

Transformers

PyTorch

aquila

Model card Files Files and versions

xet

Community

ldwang commited on Nov 29, 2023

Commit

c105074

1 Parent(s): 46d15f0

Update README.md

Browse files

Files changed (1) hide show

README.md +8 -19

README.md CHANGED Viewed

@@ -9,16 +9,15 @@ license: other
 <h4 align="center">
     <p>
         <b>English</b> |
-        <a href="https://huggingface.co/BAAI/AquilaChat2-7B/blob/main/README_zh.md">简体中文</a>
     </p>
 </h4>
-We opensource our **Aquila2** series, now including **Aquila2**, the base language models, namely **Aquila2-7B** and **Aquila2-34B**, as well as **AquilaChat2**, the chat models, namely **AquilaChat2-7B** and **AquilaChat2-34B**, as well as the long-text chat models, namely **AquilaChat2-7B-16k** and **AquilaChat2-34B-16k**
 The additional details of the Aquila model will be presented in the official technical report. Please stay tuned for updates on official channels.
-## Quick Start  AquilaChat2-7B（Chat model）
 ### 1. Inference
@@ -27,29 +26,19 @@ import torch
 from transformers import AutoTokenizer, AutoModelForCausalLM
 from transformers import BitsAndBytesConfig
-device = torch.device("cuda:0")
-model_info = "BAAI/AquilaChat2-7B"
 tokenizer = AutoTokenizer.from_pretrained(model_info, trust_remote_code=True)
-quantization_config=BitsAndBytesConfig(
-                        load_in_4bit=True,
-                        bnb_4bit_use_double_quant=True,
-                        bnb_4bit_quant_type="nf4",
-                        bnb_4bit_compute_dtype=torch.bfloat16,
-                    )
-model = AutoModelForCausalLM.from_pretrained(model_info, trust_remote_code=True, torch_dtype=torch.float16,
-                                                # quantization_config=quantization_config, # Uncomment this line for 4bit quantization
-                                                )
 model.eval()
-model.to(device)
 text = "请给出10个要到北京旅游的理由。"
 from predict import predict
 out = predict(model, text, tokenizer=tokenizer, max_gen_len=200, top_p=0.95,
-              seed=1234, topk=100, temperature=0.9, sft=True, device=device,
-              model_name="AquilaChat2-7B")
 print(out)
 ```
 ## License
-Aquila2 series open-source model is licensed under [ BAAI Aquila Model Licence Agreement](https://huggingface.co/BAAI/AquilaChat2-7B/blob/main/BAAI-Aquila-Model-License%20-Agreement.pdf)

 <h4 align="center">
     <p>
         <b>English</b> |
+        <a href="https://huggingface.co/BAAI/AquilaChat2-70B/blob/main/README_zh.md">简体中文</a>
     </p>
 </h4>
+We opensource our **Aquila2** series, now including **Aquila2**, the base language models, namely **Aquila2-7B**, **Aquila2-34B** and **Aquila2-70B** , as well as **AquilaChat2**, the chat models, namely **AquilaChat2-7B**, **AquilaChat2-34B** and **AquilaChat2-70B**, as well as the long-text chat models, namely **AquilaChat2-7B-16k** and **AquilaChat2-34B-16k**
 The additional details of the Aquila model will be presented in the official technical report. Please stay tuned for updates on official channels.
+## Quick Start
 ### 1. Inference
 from transformers import AutoTokenizer, AutoModelForCausalLM
 from transformers import BitsAndBytesConfig
+model_info = "BAAI/AquilaChat2-70B"
 tokenizer = AutoTokenizer.from_pretrained(model_info, trust_remote_code=True)
+model = AutoModelForCausalLM.from_pretrained(model_info, trust_remote_code=True, torch_dtype=torch.bfloat16)
 model.eval()
 text = "请给出10个要到北京旅游的理由。"
 from predict import predict
 out = predict(model, text, tokenizer=tokenizer, max_gen_len=200, top_p=0.95,
+              seed=1234, topk=100, temperature=0.9, sft=True,
+              model_name="AquilaChat2-70B")
 print(out)
 ```
 ## License
+Aquila2 series open-source model is licensed under [ BAAI Aquila Model Licence Agreement](https://huggingface.co/BAAI/AquilaChat2-70B/blob/main/BAAI-Aquila-Model-License-Agreement.pdf)