yedi-hu commited on
Commit
57cff20
·
verified ·
1 Parent(s): d1450e9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -25
README.md CHANGED
@@ -8,29 +8,10 @@ tags:
8
  - PotatoB/evo_exp-point-4-3
9
  ---
10
 
11
- # evo_exp-point-5-3
 
 
 
 
12
 
13
- evo_exp-point-5-3 is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
14
- * [PotatoB/evo_exp-point-3-1](https://huggingface.co/PotatoB/evo_exp-point-3-1)
15
- * [PotatoB/evo_exp-point-4-3](https://huggingface.co/PotatoB/evo_exp-point-4-3)
16
-
17
- ## 🧩 Configuration
18
-
19
- ```yaml
20
- slices:
21
- - sources:
22
- - model: PotatoB/evo_exp-point-3-1
23
- layer_range: [0, 32]
24
- - model: PotatoB/evo_exp-point-4-3
25
- layer_range: [0, 32]
26
- merge_method: slerp
27
- base_model: PotatoB/evo_exp-point-3-1
28
- parameters:
29
- t:
30
- - filter: self_attn
31
- value: [0, 0.5, 0.3, 0.7, 1]
32
- - filter: mlp
33
- value: [1, 0.5, 0.7, 0.3, 0]
34
- - value: 0.5
35
- dtype: bfloat16
36
- ```
 
8
  - PotatoB/evo_exp-point-4-3
9
  ---
10
 
11
+ This model is a checkpoint produced during iterative model merging experiments.
12
+ It is not intended as a final release but rather as an intermediate artifact that captures
13
+ the progression of merging steps. Researchers and practitioners can use it to study
14
+ the effects of incremental merging strategies, compare intermediate states, or build
15
+ upon this checkpoint in further experiments.
16
 
17
+ For more details, please refer to the paper: [arXiv:2410.12613](https://arxiv.org/abs/2410.12613).