Update README.md
Browse files
README.md
CHANGED
|
@@ -16,7 +16,7 @@ T-pro-it-2.0 is a model built upon the Qwen 3 model family and incorporates both
|
|
| 16 |
### 📚 Dataset
|
| 17 |
|
| 18 |
Instruction Pre-Training:
|
| 19 |
-
40B tokens of instruction data, with
|
| 20 |
|
| 21 |
Supervised Fine-Tuning (SFT):
|
| 22 |
~500K high-quality and diverse instructions with balanced complexity. Reasoning tasks make up about 20% of the dataset.
|
|
|
|
| 16 |
### 📚 Dataset
|
| 17 |
|
| 18 |
Instruction Pre-Training:
|
| 19 |
+
40B tokens of instruction data, with one-third focused on reasoning tasks.
|
| 20 |
|
| 21 |
Supervised Fine-Tuning (SFT):
|
| 22 |
~500K high-quality and diverse instructions with balanced complexity. Reasoning tasks make up about 20% of the dataset.
|