Update README.md
#7 opened almost 2 years ago
by
milistu
Inference Speed Compared to ExLlama?
#6 opened almost 2 years ago
by
larsskaug
32g Version
👍
1
#5 opened almost 2 years ago
by
larsskaug
How to use this AWQ model from Python code gives an error.
#4 opened almost 2 years ago
by
sudhir2016
AWQ models with transformer pipeline
4
#3 opened about 2 years ago
by
RaviNaik
Prompt for RAG
1
#2 opened about 2 years ago
by
Matthieu