Update README.md
Browse files
README.md
CHANGED
|
@@ -35,7 +35,7 @@ For **SDAR** models, inference hyperparameters are set to: `block_length = 4`, `
|
|
| 35 |
For **Qwen3-1.7B-AR-SFT** and **Qwen3-30B-AR-SFT**, we use *greedy decoding*, and the base models **Qwen3-1.7B-Base** and **Qwen3-30B-Base** are derived from the [Qwen3 Technical Report](https://arxiv.org/abs/2505.09388).
|
| 36 |
|
| 37 |
<p align="center">
|
| 38 |
-
<img src="https://raw.githubusercontent.com/JetAstra/SDAR/main/assets/table1.png" style="max-width:
|
| 39 |
<p align="center">
|
| 40 |
|
| 41 |
### SDAR-Sci v.s. AR Baseline
|
|
@@ -44,7 +44,7 @@ This table presents a **controlled comparison** between AR and SDAR under the sa
|
|
| 44 |
The results are averaged over 8 runs for GPQA, and over 32 runs each for AIME 2024, AIME 2025, and LiveMathBench.
|
| 45 |
|
| 46 |
<p align="center">
|
| 47 |
-
<img src="https://raw.githubusercontent.com/JetAstra/SDAR/main/assets/table2.png" style="max-width:
|
| 48 |
<p align="center">
|
| 49 |
|
| 50 |
#### SDAR-Sci v.s. Other Models
|
|
@@ -53,5 +53,5 @@ This table positions **SDAR-30B-A3B-Sci(sample)** against leading open-source an
|
|
| 53 |
Scores for external models are sourced from the [InternLM/Intern-S1](https://github.com/InternLM/Intern-S1) repository.
|
| 54 |
|
| 55 |
<p align="center">
|
| 56 |
-
<img src="https://raw.githubusercontent.com/JetAstra/SDAR/main/assets/table3.png" style="max-width:
|
| 57 |
<p align="center">
|
|
|
|
| 35 |
For **Qwen3-1.7B-AR-SFT** and **Qwen3-30B-AR-SFT**, we use *greedy decoding*, and the base models **Qwen3-1.7B-Base** and **Qwen3-30B-Base** are derived from the [Qwen3 Technical Report](https://arxiv.org/abs/2505.09388).
|
| 36 |
|
| 37 |
<p align="center">
|
| 38 |
+
<img src="https://raw.githubusercontent.com/JetAstra/SDAR/main/assets/table1.png" style="max-width:100%; height:auto;">
|
| 39 |
<p align="center">
|
| 40 |
|
| 41 |
### SDAR-Sci v.s. AR Baseline
|
|
|
|
| 44 |
The results are averaged over 8 runs for GPQA, and over 32 runs each for AIME 2024, AIME 2025, and LiveMathBench.
|
| 45 |
|
| 46 |
<p align="center">
|
| 47 |
+
<img src="https://raw.githubusercontent.com/JetAstra/SDAR/main/assets/table2.png" style="max-width:100%; height:auto;">
|
| 48 |
<p align="center">
|
| 49 |
|
| 50 |
#### SDAR-Sci v.s. Other Models
|
|
|
|
| 53 |
Scores for external models are sourced from the [InternLM/Intern-S1](https://github.com/InternLM/Intern-S1) repository.
|
| 54 |
|
| 55 |
<p align="center">
|
| 56 |
+
<img src="https://raw.githubusercontent.com/JetAstra/SDAR/main/assets/table3.png" style="max-width:100%; height:auto;">
|
| 57 |
<p align="center">
|