metadata
license: apache-2.0
tags:
- nougat
- quant
- avx512_vnni
nougat-small-onnx-quant_avx512_vnni
This was quantized from pszemraj/nougat-small-onnx using the --avx512_vnni flag. You need to have a processor with avx512_vnni instructions for this to work properly.
per_channelis set to True for better accuracy- Usage is the same as: https://huggingface.co/pszemraj/nougat-small-onnx
- verify that you have the requirements by checking for results with
lscpu | grep avx512_vnni