wanlige commited on
Commit
7fd12b8
·
verified ·
1 Parent(s): 05894fe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +387 -214
README.md CHANGED
@@ -1,214 +1,387 @@
1
- ---
2
- library_name: transformers
3
- tags:
4
- - mergekit
5
- - merge
6
- base_model:
7
- - Qwen/Qwen2.5-14B-Instruct
8
- - Qwen/Qwen2.5-Coder-14B
9
- - deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
10
- - huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2
11
- - tanliboy/lambda-qwen2.5-14b-dpo-test
12
- - SicariusSicariiStuff/Impish_QWEN_14B-1M
13
- - Qwen/Qwen2.5-14B
14
- language:
15
- - zho
16
- - eng
17
- - fra
18
- - spa
19
- - por
20
- - deu
21
- - ita
22
- - rus
23
- - jpn
24
- - kor
25
- - vie
26
- - tha
27
- - ara
28
- model-index:
29
- - name: li-14b-v0.4
30
- results:
31
- - task:
32
- type: text-generation
33
- name: Text Generation
34
- dataset:
35
- name: IFEval (0-Shot)
36
- type: HuggingFaceH4/ifeval
37
- args:
38
- num_few_shot: 0
39
- metrics:
40
- - type: inst_level_strict_acc and prompt_level_strict_acc
41
- value: 81.33
42
- name: strict accuracy
43
- source:
44
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
45
- name: Open LLM Leaderboard
46
- - task:
47
- type: text-generation
48
- name: Text Generation
49
- dataset:
50
- name: BBH (3-Shot)
51
- type: BBH
52
- args:
53
- num_few_shot: 3
54
- metrics:
55
- - type: acc_norm
56
- value: 50.38
57
- name: normalized accuracy
58
- source:
59
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
60
- name: Open LLM Leaderboard
61
- - task:
62
- type: text-generation
63
- name: Text Generation
64
- dataset:
65
- name: MATH Lvl 5 (4-Shot)
66
- type: hendrycks/competition_math
67
- args:
68
- num_few_shot: 4
69
- metrics:
70
- - type: exact_match
71
- value: 55.74
72
- name: exact match
73
- source:
74
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
75
- name: Open LLM Leaderboard
76
- - task:
77
- type: text-generation
78
- name: Text Generation
79
- dataset:
80
- name: GPQA (0-shot)
81
- type: Idavidrein/gpqa
82
- args:
83
- num_few_shot: 0
84
- metrics:
85
- - type: acc_norm
86
- value: 11.86
87
- name: acc_norm
88
- source:
89
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
90
- name: Open LLM Leaderboard
91
- - task:
92
- type: text-generation
93
- name: Text Generation
94
- dataset:
95
- name: MuSR (0-shot)
96
- type: TAUR-Lab/MuSR
97
- args:
98
- num_few_shot: 0
99
- metrics:
100
- - type: acc_norm
101
- value: 16.35
102
- name: acc_norm
103
- source:
104
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
105
- name: Open LLM Leaderboard
106
- - task:
107
- type: text-generation
108
- name: Text Generation
109
- dataset:
110
- name: MMLU-PRO (5-shot)
111
- type: TIGER-Lab/MMLU-Pro
112
- config: main
113
- split: test
114
- args:
115
- num_few_shot: 5
116
- metrics:
117
- - type: acc
118
- value: 46.3
119
- name: accuracy
120
- source:
121
- url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
122
- name: Open LLM Leaderboard
123
- ---
124
- > [!TIP] This model is currently ranked #1 among the models up to 15B parameters and #50 among all models on the [Open LLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard).
125
-
126
- <a href="https://www.36588.com.cn/">
127
- <img src="./sjky.png" alt="关于我们" />
128
- </a>
129
-
130
- ---
131
-
132
-
133
-
134
- 世纪开元智印互联科技集团股份有限公司创立于2001年3月9日,总部位于山东省济南市。历经20余年发展,世纪开元以技术创新为核心,实现互联网与传统印刷行业的深度融合,探索出了区别于传统印刷行业的新模式、新业态。
135
-
136
- 世纪开元主要从事定制化影像、商务印刷及包装印刷类产品的研发、设计、生产及销售,通过将互联网、数字化、自动化和智能化等新模式和新技术与包装印刷行业相结合的方式,使小批量个性化定制产品订单得以相对标准化、规模化生产,旨在满足个人消费者及各类企业用户的小批量定制化需求,为用户提供一站式场景化定制印刷服务,实现全流程智能制造,已成长为业内领先的“工业互联网”印刷企业。
137
-
138
- 在未来发展中,世纪开元将一如既往地加大技术研发投入,深度融合互联网、大数据、人工智能等新一代信息技术,注重专项技术人才的培养,积极引进数字化、智能化手段优化创新业务流程和实现用户体验的提升,并通过多维度的企业发展,带动行业协同发展,促进印刷行业新旧动能转换,开拓印刷行业发展新方向。
139
-
140
- 了解更多,请访问我们的官网:[世纪开元](https://www.36588.com.cn/)
141
-
142
- ---
143
-
144
-
145
-
146
- Established on March 9, 2001, and headquartered in Jinan, Shandong Province, **Century Innovation** has grown over the past two decades by focusing on technological innovation. The company has achieved deep integration of the Internet with the traditional printing industry, pioneering new models and business formats distinct from conventional printing practices.
147
-
148
- **Century Innovation** specializes in the research, design, production, and sales of customized imaging, commercial printing, and packaging products. By combining the Internet, digitalization, automation, and intelligent technologies with the printing industry, the company enables relatively standardized and scalable production for small-batch personalized custom orders. This approach aims to meet the needs of individual consumers and various enterprise users for small-batch customization, providing users with one-stop, scenario-based custom printing services and achieving full-process intelligent manufacturing. As a result, **Century Innovation** has become a leading "Industrial Internet" printing enterprise.
149
-
150
- In the future, **Century Innovation** will continue to increase investment in technology R&D, deeply integrate the Internet, big data, artificial intelligence, and other next-generation information technologies, and focus on cultivating specialized technical talent. The company will actively adopt digital and intelligent means to optimize innovative business processes and enhance user experience. Through multi-dimensional development, it aims to drive industry collaboration, promote the transformation of old and new drivers in the printing industry, and explore new directions for its growth.
151
-
152
- To learn more, visit our official website: [Century Innovation](https://www.36588.com.cn/)
153
-
154
- ---
155
-
156
-
157
-
158
-
159
-
160
-
161
-
162
- # merge
163
-
164
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
165
-
166
- ## Merge Details
167
- ### Merge Method
168
-
169
- This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct) as a base.
170
-
171
- ### Models Merged
172
-
173
- The following models were included in the merge:
174
- * [Qwen/Qwen2.5-Coder-14B](https://huggingface.co/Qwen/Qwen2.5-Coder-14B)
175
- * [deepseek-ai/DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B)
176
- * [huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2)
177
- * [tanliboy/lambda-qwen2.5-14b-dpo-test](https://huggingface.co/tanliboy/lambda-qwen2.5-14b-dpo-test)
178
- * [SicariusSicariiStuff/Impish_QWEN_14B-1M](https://huggingface.co/SicariusSicariiStuff/Impish_QWEN_14B-1M)
179
- * [Qwen/Qwen2.5-14B](https://huggingface.co/Qwen/Qwen2.5-14B)
180
-
181
- ### Configuration
182
-
183
- The following YAML configuration was used to produce this model:
184
-
185
- ```yaml
186
- models:
187
- - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B #logic
188
- - model: huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2 #uncensored
189
- - model: Qwen/Qwen2.5-14B #text generation
190
- - model: Qwen/Qwen2.5-14B-Instruct #chat assistant
191
- - model: Qwen/Qwen2.5-Coder-14B #coding
192
- - model: SicariusSicariiStuff/Impish_QWEN_14B-1M #math
193
- - model: tanliboy/lambda-qwen2.5-14b-dpo-test #dpo
194
- merge_method: model_stock
195
- base_model: Qwen/Qwen2.5-14B-Instruct
196
- normalize: true
197
- int8_mask: true
198
- dtype: bfloat16
199
-
200
- ```
201
-
202
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
203
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/wanlige__li-14b-v0.4-details)
204
-
205
- | Metric |Value|
206
- |-------------------|----:|
207
- |Avg. |43.66|
208
- |IFEval (0-Shot) |81.33|
209
- |BBH (3-Shot) |50.38|
210
- |MATH Lvl 5 (4-Shot)|55.74|
211
- |GPQA (0-shot) |11.86|
212
- |MuSR (0-shot) |16.35|
213
- |MMLU-PRO (5-shot) |46.30|
214
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ tags:
4
+ - mergekit
5
+ - merge
6
+ base_model:
7
+ - Qwen/Qwen2.5-14B-Instruct
8
+ - Qwen/Qwen2.5-Coder-14B
9
+ - deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
10
+ - huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2
11
+ - tanliboy/lambda-qwen2.5-14b-dpo-test
12
+ - SicariusSicariiStuff/Impish_QWEN_14B-1M
13
+ - Qwen/Qwen2.5-14B
14
+ model-index:
15
+
16
+ - name: li-14b-v0.4
17
+
18
+ results:
19
+
20
+ - task:
21
+
22
+ type: text-generation
23
+
24
+ name: Text Generation
25
+
26
+ dataset:
27
+
28
+ name: IFEval (0-Shot)
29
+
30
+ type: HuggingFaceH4/ifeval
31
+
32
+ args:
33
+
34
+ num_few_shot: 0
35
+
36
+ metrics:
37
+
38
+ - type: inst_level_strict_acc and prompt_level_strict_acc
39
+
40
+ value: 81.33
41
+
42
+ name: strict accuracy
43
+
44
+ source:
45
+
46
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
47
+
48
+ name: Open LLM Leaderboard
49
+
50
+ - task:
51
+
52
+ type: text-generation
53
+
54
+ name: Text Generation
55
+
56
+ dataset:
57
+
58
+ name: BBH (3-Shot)
59
+
60
+ type: BBH
61
+
62
+ args:
63
+
64
+ num_few_shot: 3
65
+
66
+ metrics:
67
+
68
+ - type: acc_norm
69
+
70
+ value: 50.38
71
+
72
+ name: normalized accuracy
73
+
74
+ source:
75
+
76
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
77
+
78
+ name: Open LLM Leaderboard
79
+
80
+ - task:
81
+
82
+ type: text-generation
83
+
84
+ name: Text Generation
85
+
86
+ dataset:
87
+
88
+ name: MATH Lvl 5 (4-Shot)
89
+
90
+ type: hendrycks/competition_math
91
+
92
+ args:
93
+
94
+ num_few_shot: 4
95
+
96
+ metrics:
97
+
98
+ - type: exact_match
99
+
100
+ value: 55.74
101
+
102
+ name: exact match
103
+
104
+ source:
105
+
106
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
107
+
108
+ name: Open LLM Leaderboard
109
+
110
+ - task:
111
+
112
+ type: text-generation
113
+
114
+ name: Text Generation
115
+
116
+ dataset:
117
+
118
+ name: GPQA (0-shot)
119
+
120
+ type: Idavidrein/gpqa
121
+
122
+ args:
123
+
124
+ num_few_shot: 0
125
+
126
+ metrics:
127
+
128
+ - type: acc_norm
129
+
130
+ value: 11.86
131
+
132
+ name: acc_norm
133
+
134
+ source:
135
+
136
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
137
+
138
+ name: Open LLM Leaderboard
139
+
140
+ - task:
141
+
142
+ type: text-generation
143
+
144
+ name: Text Generation
145
+
146
+ dataset:
147
+
148
+ name: MuSR (0-shot)
149
+
150
+ type: TAUR-Lab/MuSR
151
+
152
+ args:
153
+
154
+ num_few_shot: 0
155
+
156
+ metrics:
157
+
158
+ - type: acc_norm
159
+
160
+ value: 16.35
161
+
162
+ name: acc_norm
163
+
164
+ source:
165
+
166
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
167
+
168
+ name: Open LLM Leaderboard
169
+
170
+ - task:
171
+
172
+ type: text-generation
173
+
174
+ name: Text Generation
175
+
176
+ dataset:
177
+
178
+ name: MMLU-PRO (5-shot)
179
+
180
+ type: TIGER-Lab/MMLU-Pro
181
+
182
+ config: main
183
+
184
+ split: test
185
+
186
+ args:
187
+
188
+ num_few_shot: 5
189
+
190
+ metrics:
191
+
192
+ - type: acc
193
+
194
+ value: 46.3
195
+
196
+ name: accuracy
197
+
198
+ source:
199
+
200
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=wanlige/li-14b-v0.4
201
+
202
+ name: Open LLM Leaderboard
203
+
204
+ ---
205
+
206
+ > [!TIP] This model is currently ranked #1 among the models up to 15B parameters and #50 among all models on the [Open LLM Leaderboard](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard).
207
+
208
+
209
+
210
+ <a href="https://www.36588.com.cn/">
211
+
212
+ <img src="./sjky.png" alt="关于我们" />
213
+
214
+ </a>
215
+
216
+
217
+
218
+ ---
219
+
220
+
221
+
222
+
223
+
224
+
225
+
226
+ 世纪开元智印互联科技集团股份有限公司创立于2001年3月9日,总部位于山东省济南市。历经20余年发展,世纪开元以技术创新为核心,实现互联网与传统印刷行业的深度融合,探索出了区别于传统印刷行业的新模式、新业态。
227
+
228
+
229
+
230
+ 世纪开元主要从事定制化影像、商务印刷及包装印刷类产品的研发、设计、生产及销售,通过将互联网、数字化、自动化和智能化等新模式和新技术与包装印刷行业相结合的方式,使小批量个性化定制产品订单得以相对标准化、规模化生产,旨在满足个人消费者及各类企业用户的小批量定制化需求,为用户提供一站式场景化定制印刷服务,实现全流程智能制造,已成长为业内领先的“工业互联网”印刷企业。
231
+
232
+
233
+
234
+ 在未来发展中,世纪开元将一如既往地加大技术研发投入,深度融合互联网、大数据、人工智能等新一代信息技术,注重专项技术人才的培养,积极引进数字化、智能化手段优化创新业务流程和实现用户体验的提升,并通过多维度的企业发展,带动行业协同发展,促进印刷行业新旧动能转换,开拓印刷行业发展新方向。
235
+
236
+
237
+
238
+ 了解更多,请访问我们的官网:[世纪开元](https://www.36588.com.cn/)
239
+
240
+
241
+
242
+ ---
243
+
244
+
245
+
246
+
247
+
248
+
249
+
250
+ Established on March 9, 2001, and headquartered in Jinan, Shandong Province, **Century Innovation** has grown over the past two decades by focusing on technological innovation. The company has achieved deep integration of the Internet with the traditional printing industry, pioneering new models and business formats distinct from conventional printing practices.
251
+
252
+
253
+
254
+ **Century Innovation** specializes in the research, design, production, and sales of customized imaging, commercial printing, and packaging products. By combining the Internet, digitalization, automation, and intelligent technologies with the printing industry, the company enables relatively standardized and scalable production for small-batch personalized custom orders. This approach aims to meet the needs of individual consumers and various enterprise users for small-batch customization, providing users with one-stop, scenario-based custom printing services and achieving full-process intelligent manufacturing. As a result, **Century Innovation** has become a leading "Industrial Internet" printing enterprise.
255
+
256
+
257
+
258
+ In the future, **Century Innovation** will continue to increase investment in technology R&D, deeply integrate the Internet, big data, artificial intelligence, and other next-generation information technologies, and focus on cultivating specialized technical talent. The company will actively adopt digital and intelligent means to optimize innovative business processes and enhance user experience. Through multi-dimensional development, it aims to drive industry collaboration, promote the transformation of old and new drivers in the printing industry, and explore new directions for its growth.
259
+
260
+
261
+
262
+ To learn more, visit our official website: [Century Innovation](https://www.36588.com.cn/)
263
+
264
+
265
+
266
+ ---
267
+
268
+
269
+
270
+
271
+
272
+
273
+
274
+
275
+
276
+
277
+
278
+
279
+
280
+
281
+
282
+ # merge
283
+
284
+
285
+
286
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
287
+
288
+
289
+
290
+ ## Merge Details
291
+
292
+ ### Merge Method
293
+
294
+
295
+
296
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct) as a base.
297
+
298
+
299
+
300
+ ### Models Merged
301
+
302
+
303
+
304
+ The following models were included in the merge:
305
+
306
+ * [Qwen/Qwen2.5-Coder-14B](https://huggingface.co/Qwen/Qwen2.5-Coder-14B)
307
+
308
+ * [deepseek-ai/DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B)
309
+
310
+ * [huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2](https://huggingface.co/huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2)
311
+
312
+ * [tanliboy/lambda-qwen2.5-14b-dpo-test](https://huggingface.co/tanliboy/lambda-qwen2.5-14b-dpo-test)
313
+
314
+ * [SicariusSicariiStuff/Impish_QWEN_14B-1M](https://huggingface.co/SicariusSicariiStuff/Impish_QWEN_14B-1M)
315
+
316
+ * [Qwen/Qwen2.5-14B](https://huggingface.co/Qwen/Qwen2.5-14B)
317
+
318
+
319
+
320
+ ### Configuration
321
+
322
+
323
+
324
+ The following YAML configuration was used to produce this model:
325
+
326
+
327
+
328
+ ```yaml
329
+
330
+ models:
331
+
332
+ - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B #logic
333
+
334
+ - model: huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2 #uncensored
335
+
336
+ - model: Qwen/Qwen2.5-14B #text generation
337
+
338
+ - model: Qwen/Qwen2.5-14B-Instruct #chat assistant
339
+
340
+ - model: Qwen/Qwen2.5-Coder-14B #coding
341
+
342
+ - model: SicariusSicariiStuff/Impish_QWEN_14B-1M #math
343
+
344
+ - model: tanliboy/lambda-qwen2.5-14b-dpo-test #dpo
345
+
346
+ merge_method: model_stock
347
+
348
+ base_model: Qwen/Qwen2.5-14B-Instruct
349
+
350
+ normalize: true
351
+
352
+ int8_mask: true
353
+
354
+ dtype: bfloat16
355
+
356
+
357
+
358
+ ```
359
+
360
+
361
+
362
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
363
+
364
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/wanlige__li-14b-v0.4-details)
365
+
366
+
367
+
368
+ | Metric |Value|
369
+
370
+ |-------------------|----:|
371
+
372
+ |Avg. |43.66|
373
+
374
+ |IFEval (0-Shot) |81.33|
375
+
376
+ |BBH (3-Shot) |50.38|
377
+
378
+ |MATH Lvl 5 (4-Shot)|55.74|
379
+
380
+ |GPQA (0-shot) |11.86|
381
+
382
+ |MuSR (0-shot) |16.35|
383
+
384
+ |MMLU-PRO (5-shot) |46.30|
385
+
386
+
387
+