File size: 548 Bytes
3f13c8c
 
600c271
3f13c8c
600c271
3f13c8c
 
 
 
600c271
3f13c8c
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
# DeepSeek V3.2

First convert huggingface model weights to the the format required by our inference demo. Set `MP` to match your available GPU count:
```bash
cd inference
export EXPERTS=256
python convert.py --hf-ckpt-path ${HF_CKPT_PATH} --save-path ${SAVE_PATH} --n-experts ${EXPERTS} --model-parallel ${MP}
```

Launch the interactive chat interface and start exploring DeepSeek's capabilities:
```bash
export CONFIG=config_671B_v3.2.json
torchrun --nproc-per-node ${MP} generate.py --ckpt-path ${SAVE_PATH} --config ${CONFIG} --interactive
```