THU-KEG
/

LLaDA-8B-BGPO-code

Reinforcement Learning

code-generation

Model card Files Files and versions

linny2002 commited on Oct 14

Commit

ee75175

·

verified ·

1 Parent(s): 20b81f6

Update README.md

Files changed (1) hide show

README.md +16 -17

README.md CHANGED Viewed

@@ -1,22 +1,20 @@
----
-license: apache-2.0
-language:
-- en
-tags:
-- reinforcement-learning
-- code-generation
-- dllm
-- bgpo
-- llada
-size_categories:
-- 8B
-base_model:
-- GSAI-ML/LLaDA-8B-Instruct
----
 # LLaDA-8B-BGPO-code
-[![Paper](https://img.shields.io/badge/Paper-arXiv:-red)]()
 [![Code](https://img.shields.io/badge/Code-GitHub-blue)](https://github.com/THU-KEG/BGPO)
 ## Model Description
@@ -49,4 +47,5 @@ base_model:
 - Primarily designed for code generation tasks.
 - Performance may vary on other tasks.
-- Requires appropriate computational resources for inference.

+---
+license: apache-2.0
+language:
+- en
+tags:
+- reinforcement-learning
+- code-generation
+- dllm
+- bgpo
+- llada
+size_categories:
+- 8B
+---
 # LLaDA-8B-BGPO-code
+[![Paper](https://img.shields.io/badge/Paper-arXiv:2510.11683-red)](https://arxiv.org/abs/2510.11683)
 [![Code](https://img.shields.io/badge/Code-GitHub-blue)](https://github.com/THU-KEG/BGPO)
 ## Model Description
 - Primarily designed for code generation tasks.
 - Performance may vary on other tasks.
+- Requires appropriate computational resources for inference.