llmat
/

Qwen3-0.6B-NVFP4

Text Generation

compressed-tensors

Model card Files Files and versions

Qwen3-0.6B-NVFP4 / recipe.yaml

llmat's picture

Add NVFP4 quantized model for Qwen/Qwen3-0.6B.

d83e6a5 verified 2 months ago

history blame contribute delete

130 Bytes

	default_stage:
	default_modifiers:
	QuantizationModifier:
	targets: [Linear]
	ignore: [lm_head]
	scheme: NVFP4