Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,9 @@ tags:
|
|
| 11 |
base_model:
|
| 12 |
- Qwen/Qwen2.5-72B-Instruct
|
| 13 |
---
|
|
|
|
|
|
|
|
|
|
| 14 |
# Athene-V2-Chat-72B: Rivaling GPT-4o across Benchmarks
|
| 15 |
|
| 16 |
<p align="center">
|
|
|
|
| 11 |
base_model:
|
| 12 |
- Qwen/Qwen2.5-72B-Instruct
|
| 13 |
---
|
| 14 |
+
> [!NOTE]
|
| 15 |
+
> EXL2 4.65bpw-h6 quantized version of [Nexusflow/Athene-V2-Chat](https://huggingface.co/Nexusflow/Athene-V2-Chat). Supports 32K context with Q4 cache on systems with 48 GB VRAM.
|
| 16 |
+
|
| 17 |
# Athene-V2-Chat-72B: Rivaling GPT-4o across Benchmarks
|
| 18 |
|
| 19 |
<p align="center">
|