Update README.md
Browse files
README.md
CHANGED
|
@@ -19,10 +19,10 @@ The following data has been re-evaluated and calculated as the average for each
|
|
| 19 |
|
| 20 |
| Benchmark | Llama-3.1-8b-Instruct | Meta-Llama-3.1-8B-Instruct-abliterated |
|
| 21 |
|-------------|-----------------------|----------------------------------------|
|
| 22 |
-
| IF_Eval | 80.0
|
| 23 |
-
| MMLU Pro | 36.34
|
| 24 |
| TruthfulQA | 52.98 | **55.42** |
|
| 25 |
-
| BBH | 48.72
|
| 26 |
| GPQA | 33.55 | **33.93** |
|
| 27 |
|
| 28 |
The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated/blob/main/eval.sh)
|
|
|
|
| 19 |
|
| 20 |
| Benchmark | Llama-3.1-8b-Instruct | Meta-Llama-3.1-8B-Instruct-abliterated |
|
| 21 |
|-------------|-----------------------|----------------------------------------|
|
| 22 |
+
| IF_Eval | **80.0** | 78.98 |
|
| 23 |
+
| MMLU Pro | **36.34** | 35.91 |
|
| 24 |
| TruthfulQA | 52.98 | **55.42** |
|
| 25 |
+
| BBH | **48.72** | 47.0 |
|
| 26 |
| GPQA | 33.55 | **33.93** |
|
| 27 |
|
| 28 |
The script used for evaluation can be found inside this repository under /eval.sh, or click [here](https://huggingface.co/huihui-ai/Meta-Llama-3.1-8B-Instruct-abliterated/blob/main/eval.sh)
|