phi3-avro-vllm / vllm_config.json
oriolrius's picture
Upload fine-tuned Phi-3 model
07c6872 verified
{
"model_type": "phi3",
"dtype": "float16",
"tensor_parallel_size": 1,
"max_model_len": 4096,
"trust_remote_code": true
}