Update src/about.py
Browse files- src/about.py +6 -4
src/about.py
CHANGED
|
@@ -61,13 +61,15 @@ We welcome the community to submit evaluation results for new models. These resu
|
|
| 61 |
|
| 62 |
### 1 - Running Evaluation 🏃♂️
|
| 63 |
|
| 64 |
-
We have written a detailed guide for running the evaluation on your model. You can find it in the [align-anything](https://github.com/PKU-Alignment/align-anything/tree/main/align_anything/evaluation/
|
|
|
|
|
|
|
| 65 |
|
| 66 |
### 2 - Submitting Results 🚀
|
| 67 |
|
| 68 |
To submit your results create a **Pull Request** in the community tab to add them under the [community_results](hhttps://huggingface.co/spaces/PKU-Alignment/EvalAnything-LeaderBoard/tree/main/community_results) in this repository:
|
| 69 |
-
- Create a folder named `ORG_MODELNAME_USERNAME`. For example `PKU-Alignment_gemini1.5-
|
| 70 |
-
- Place your
|
| 71 |
|
| 72 |
The title of the PR should be `[Community Submission] Model: org/model, Username: your_username`, replace org and model with those corresponding to the model you evaluated.
|
| 73 |
|
|
@@ -76,7 +78,7 @@ A verified result in Eval-Anything indicates that a core maintainer has decoded
|
|
| 76 |
|
| 77 |
1. Email us and provide a brief rationale for why your model should be verified.
|
| 78 |
2. Await our response and approval before proceeding.
|
| 79 |
-
3. Prepare a script to decode from your model that does not require a GPU. Typically, this should be the same script used for your model contribution. It should run without requiring a local GPU. It should run without requiring a local GPU. We strongly recommend that you modify the scripts in [align-anything](https://github.com/PKU-Alignment/align-anything/tree/main/align_anything/evaluation/
|
| 80 |
4. Generate temporary OpenAI API keys for running the script and share them with us. Specifically, we need the keys for evaluation.
|
| 81 |
5. We will check and execute your script, update the results, and inform you so that you can revoke the temporary keys.
|
| 82 |
|
|
|
|
| 61 |
|
| 62 |
### 1 - Running Evaluation 🏃♂️
|
| 63 |
|
| 64 |
+
We have written a detailed guide for running the evaluation on your model. You can find it in the [align-anything](https://github.com/PKU-Alignment/align-anything/tree/main/align_anything/evaluation/eval_anything).
|
| 65 |
+
|
| 66 |
+
**Note:** The current code is a sample script. In the future, we will integrate Eval Anything's evaluation pipeline into the framework to provide convenience for community use.
|
| 67 |
|
| 68 |
### 2 - Submitting Results 🚀
|
| 69 |
|
| 70 |
To submit your results create a **Pull Request** in the community tab to add them under the [community_results](hhttps://huggingface.co/spaces/PKU-Alignment/EvalAnything-LeaderBoard/tree/main/community_results) in this repository:
|
| 71 |
+
- Create a folder named `ORG_MODELNAME_USERNAME`. For example `PKU-Alignment_gemini1.5-pro_XiaoMing`.
|
| 72 |
+
- Place all your generation and evaluation results in the folder.
|
| 73 |
|
| 74 |
The title of the PR should be `[Community Submission] Model: org/model, Username: your_username`, replace org and model with those corresponding to the model you evaluated.
|
| 75 |
|
|
|
|
| 78 |
|
| 79 |
1. Email us and provide a brief rationale for why your model should be verified.
|
| 80 |
2. Await our response and approval before proceeding.
|
| 81 |
+
3. Prepare a script to decode from your model that does not require a GPU. Typically, this should be the same script used for your model contribution. It should run without requiring a local GPU. It should run without requiring a local GPU. We strongly recommend that you modify the scripts in [align-anything](https://github.com/PKU-Alignment/align-anything/tree/main/align_anything/evaluation/eval_anything) to adapt to your model's operation.
|
| 82 |
4. Generate temporary OpenAI API keys for running the script and share them with us. Specifically, we need the keys for evaluation.
|
| 83 |
5. We will check and execute your script, update the results, and inform you so that you can revoke the temporary keys.
|
| 84 |
|