Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,31 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
- zh
|
| 6 |
+
base_model:
|
| 7 |
+
- Qwen/Qwen2.5-14B
|
| 8 |
+
- Qwen/Qwen2.5-14B-Instruct
|
| 9 |
+
- Qwen/Qwen2.5-14B-Instruct-1M
|
| 10 |
+
- tanliboy/lambda-qwen2.5-14b-dpo-test
|
| 11 |
+
- arcee-ai/SuperNova-Medius
|
| 12 |
+
- arcee-ai/Virtuoso-Small-v2
|
| 13 |
+
- Azure99/Blossom-V6-14B
|
| 14 |
+
- Qwen/Qwen2.5-Coder-14B
|
| 15 |
+
- Qwen/Qwen2.5-Coder-14B-Instruct
|
| 16 |
+
- deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
|
| 17 |
+
- huihui-ai/DeepSeek-R1-Distill-Qwen-14B-abliterated-v2
|
| 18 |
+
pipeline_tag: text-generation
|
| 19 |
+
tags:
|
| 20 |
+
- merge
|
| 21 |
+
---
|
| 22 |
+

|
| 23 |
+
# Qwen2.5-14B-YOYO-V4-p3
|
| 24 |
+
|
| 25 |
+
This is the **final preview version** of the fourth-generation Qwen YOYO series model, and it is currently my favorite iteration.
|
| 26 |
+
|
| 27 |
+
Aside from context length, it is identical to **Qwen2.5-14B-YOYO-V4** in all other aspects.
|
| 28 |
+
|
| 29 |
+
The upcoming official release will not only expand the context length to 1M tokens but also disclose the complete merge recipe.
|
| 30 |
+
|
| 31 |
+
Stay tuned for more updates!
|