| license: mit | |
| base_model: | |
| - Qwen/Qwen2.5-7B-Instruct | |
| library_name: transformers | |
| **SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning** | |
| [[arXiv]](https://arxiv.org/abs/2504.19162) [[Project]](https://chen-judge.github.io/SPC/) | |
| **Jiaqi Chen**, Bang Zhang, Ruotian Ma, Peisong Wang, Xiaodan Liang, Zhaopeng Tu, Xiaolong Li, Kwan-Yee K. Wong. |