DiscoResearch
/

DiscoLM-70b

Text Generation

text-generation-inference

Model card Files Files and versions

bjoernp commited on Dec 3, 2023

Commit

fee642c

·

1 Parent(s): 3177614

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -60,7 +60,7 @@ The model was trained with compute provided by [HessianAI](https://hessian.ai/)
 ### Hugginface Leaderboard
 This models is still an early Alpha and we can't guarantee that there isn't any contamination.
-However, the average of **71.24** would earn the #2 spot on the HF leaderboard at the time of writing and the highest score for a >70b model yet.
 | Metric | Value |
 |-----------------------|-------|
@@ -84,6 +84,9 @@ We use [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-eval
 | MMLU   | 64.7 |
 | **Avg.**                  | **48.87** |
 ### MTBench
 ```json
@@ -103,7 +106,8 @@ We use [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-eval
     "average": 7.48125
 }
 ```
 ## Prompt Format
 This model follows the ChatML format:

 ### Hugginface Leaderboard
 This models is still an early Alpha and we can't guarantee that there isn't any contamination.
+However, the average of **71.24** would earn the #2 spot on the HF leaderboard at the time of writing.
 | Metric | Value |
 |-----------------------|-------|
 | MMLU   | 64.7 |
 | **Avg.**                  | **48.87** |
+Screenshot of the current (sadly no longer maintained) FastEval CoT leaderboard:
+![FastEval Leaderboard](imgs/cot_leaderboard.png)
 ### MTBench
 ```json
     "average": 7.48125
 }
 ```
+Screenshot of the current FastEval MT Bench leaderboard:
+![FastEval Leaderboard](imgs/mtbench_leaderboard.png)
 ## Prompt Format
 This model follows the ChatML format: