Gen-Verse
/

TraDo-4B-Instruct

Model card Files Files and versions

Lingaaaaaaa commited on Sep 9

Commit

6992822

·

verified ·

1 Parent(s): 5d24bf8

Update README.md

Files changed (1) hide show

README.md +15 -11

README.md CHANGED Viewed

@@ -2,6 +2,17 @@
 license: mit
 ---
 <p align="center">
   <img src="https://github.com/yinjjiew/Data/raw/main/dllm-rl/figure1.png" width="100%"/>
 </p>
@@ -12,22 +23,15 @@ license: mit
 </p>
-# Introduction to TraDo
-We introduce **TraDo**, SOTA diffusion language model, trained with **TraceRL**.
-* **TraDo-4B-Instruct** and **TraDo-8B-Instruct** outperform similarly sized strong AR models across math reasoning tasks.
-* **TraDo-8B-Thinking** is the first Long-CoT diffusion language model.
-[Paper](https://arxiv.org/abs/2506.03136) | [Code](https://github.com/Gen-Verse/dLLM-RL)
 # Citation
 ```
-@article{wang2025cure,
-  title={Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning},
-  author={Wang, Yinjie and Yang, Ling and Tian, Ye and Shen, Ke and Wang, Mengdi},
-  journal={arXiv preprint arXiv:2506.03136},
   year={2025}
 }
 ```

 license: mit
 ---
+# Introduction to TraDo
+[Paper](https://arxiv.org/abs/2509.06949) | [Code](https://github.com/Gen-Verse/dLLM-RL)
+We introduce **TraDo**, SOTA diffusion language model, trained with **TraceRL**.
+* **TraDo-4B-Instruct** and **TraDo-8B-Instruct** outperform similarly sized strong AR models across math reasoning tasks.
+* **TraDo-8B-Thinking** is the first Long-CoT diffusion language model.
 <p align="center">
   <img src="https://github.com/yinjjiew/Data/raw/main/dllm-rl/figure1.png" width="100%"/>
 </p>
 </p>
 # Citation
 ```
+@article{wang2025trado,
+  title={Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models},
+  author={Wang, Yinjie and Yang, Ling and Li, Bowen and Tian, Ye and Shen, Ke and Wang, Mengdi},
+  journal={arXiv preprint arXiv:2509.06949},
   year={2025}
 }
 ```