Qwen3-Omni-30B-A3B-Thinking-GGUF-INT8FP16 / Qwen3OmniQuantized.modelfile
vito95311's picture
Initial GGUF release: Qwen3-Omni quantized models with Ollama support
d4ef36e
FROM /var/www/qwen3_omni_quantized.gguf
PARAMETER temperature 0.7
PARAMETER top_p 0.8
PARAMETER top_k 40
PARAMETER repeat_penalty 1.1
TEMPLATE """{{ if .System }}<|im_start|>system
{{ .System }}<|im_end|>
{{ end }}{{ if .Prompt }}<|im_start|>user
{{ .Prompt }}<|im_end|>
<|im_start|>assistant
{{ end }}{{ .Response }}<|im_end|>"""
SYSTEM """你是Qwen3-Omni,一個由阿里雲開發的AI助手。你可以處理文本、圖像和音頻輸入。"""