Qwen3OmniQuantized.modelfile · vito95311/Qwen3-Omni-30B-A3B-Thinking-GGUF-INT8FP16 at main

Qwen3-Omni-30B-A3B-Thinking-GGUF-INT8FP16 / Qwen3OmniQuantized.modelfile

Initial GGUF release: Qwen3-Omni quantized models with Ollama support

d4ef36e about 1 month ago

453 Bytes

	FROM /var/www/qwen3_omni_quantized.gguf

	PARAMETER temperature 0.7
	PARAMETER top_p 0.8
	PARAMETER top_k 40
	PARAMETER repeat_penalty 1.1

	TEMPLATE """{{ if .System }}<\|im_start\|>system
	{{ .System }}<\|im_end\|>
	{{ end }}{{ if .Prompt }}<\|im_start\|>user
	{{ .Prompt }}<\|im_end\|>
	<\|im_start\|>assistant
	{{ end }}{{ .Response }}<\|im_end\|>"""

	SYSTEM """你是Qwen3-Omni，一個由阿里雲開發的AI助手。你可以處理文本、圖像和音頻輸入。"""