baidu
/

ERNIE-4.5-300B-A47B-PT

Text Generation

Model card Files Files and versions

WYF3634076 commited on Sep 1

Commit

894e3d4

·

verified ·

1 Parent(s): f7d35c3

Update README.md

vllm model card update

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -120,12 +120,12 @@ print("generate_text:", generate_text)
 ```bash
 # 80G * 16 GPU
-vllm serve baidu/ERNIE-4.5-300B-A47B-PT --trust-remote-code
 ```
 ```bash
-# FP8 online quantification 80G * 16 GPU
-vllm serve baidu/ERNIE-4.5-300B-A47B-PT --trust-remote-code --quantization fp8
 ```
 ## Best Practices

 ```bash
 # 80G * 16 GPU
+vllm serve baidu/ERNIE-4.5-300B-A47B-PT --tensor-parallel-size 16
 ```
 ```bash
+# FP8 online quantification 80G * 8 GPU
+vllm serve baidu/ERNIE-4.5-300B-A47B-PT --tensor-parallel-size 8 --quantization fp8
 ```
 ## Best Practices