Update README.md
Browse files
README.md
CHANGED
|
@@ -6,7 +6,7 @@ base_model:
|
|
| 6 |
- Qwen/Qwen3-4B
|
| 7 |
pipeline_tag: visual-question-answering
|
| 8 |
---
|
| 9 |
-
# R-4B: Incentivizing General-Purpose Auto-Thinking
|
| 10 |
|
| 11 |
[[π Arxiv Paper (Coming soon)](https://huggingface.co/YannQi/R-4B)] [[π€ Hugging Face](https://huggingface.co/YannQi/R-4B)] [[π€οΈ ModelScope](https://huggingface.co/YannQi/R-4B)] [[π» Code](https://github.com/yannqi/R-4B)]
|
| 12 |
|
|
@@ -33,7 +33,7 @@ The development of R-4B follows a two-stage training paradigm:
|
|
| 33 |
|
| 34 |
- **Auto-thinking Mode:** Unleash **auto-thinking** that works across general topics, from simple Q&A to complex scientific analysis. It saves time and computation by thinking only when it matters.
|
| 35 |
- **Support Manual Control:** Explicitly command the model to use its `thinking` or `non-thinking` capabilities, enabling you to make your choices for every job.
|
| 36 |
-
- π **
|
| 37 |
Our model is now **fully open-source**. It achieves **state-of-the-art performance** among models of comparable size.
|
| 38 |
|
| 39 |
## π’ News
|
|
|
|
| 6 |
- Qwen/Qwen3-4B
|
| 7 |
pipeline_tag: visual-question-answering
|
| 8 |
---
|
| 9 |
+
# R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Integration
|
| 10 |
|
| 11 |
[[π Arxiv Paper (Coming soon)](https://huggingface.co/YannQi/R-4B)] [[π€ Hugging Face](https://huggingface.co/YannQi/R-4B)] [[π€οΈ ModelScope](https://huggingface.co/YannQi/R-4B)] [[π» Code](https://github.com/yannqi/R-4B)]
|
| 12 |
|
|
|
|
| 33 |
|
| 34 |
- **Auto-thinking Mode:** Unleash **auto-thinking** that works across general topics, from simple Q&A to complex scientific analysis. It saves time and computation by thinking only when it matters.
|
| 35 |
- **Support Manual Control:** Explicitly command the model to use its `thinking` or `non-thinking` capabilities, enabling you to make your choices for every job.
|
| 36 |
+
- π **Strong Performance, Open for Everyone!**
|
| 37 |
Our model is now **fully open-source**. It achieves **state-of-the-art performance** among models of comparable size.
|
| 38 |
|
| 39 |
## π’ News
|