Qwen3-0.6B-NVFP4 / recipe.yaml
llmat's picture
Add NVFP4 quantized model for Qwen/Qwen3-0.6B.
d83e6a5 verified
raw
history blame contribute delete
130 Bytes
default_stage:
default_modifiers:
QuantizationModifier:
targets: [Linear]
ignore: [lm_head]
scheme: NVFP4