37 172 43

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 2 days ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

upvoted a paper 3 days ago

WithAnyone: Towards Controllable and ID Consistent Image Generation

upvoted a paper 3 days ago

When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA

View all activity

Organizations

liked a model about 2 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24 • 22.6k • 495

liked a dataset about 2 months ago

inclusionAI/ASearcher-train-data

Preview • Updated Aug 13 • 213 • 20

liked 2 datasets 2 months ago

We-Math/We-Math2.0-Pro

Viewer • Updated Aug 19 • 4.55k • 136 • 19

We-Math/We-Math2.0-Standard

Viewer • Updated Aug 19 • 5.84k • 142 • 21

liked a model 2 months ago

Kwai-Klear/Klear-Reasoner-8B

8B • Updated 28 days ago • 24 • 16

liked a model 3 months ago

dongguanting/RAG-Critic-3B

Text Generation • 3B • Updated Jun 28 • 4 • 3

liked 3 datasets 3 months ago

liked 5 models 3 months ago

dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated Aug 12 • 8 • 1

dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated Aug 12 • 45 • 5

dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated Aug 19 • 38 • 2

dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29 • 22 • 2

dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated Aug 12 • 14 • 2

liked 3 models 4 months ago

dongguanting/Tool-Star-Qwen-1.5B

Text Generation • 2B • Updated Jun 6 • 53 • 2

dongguanting/Tool-Star-Qwen-0.5B

Text Generation • 0.6B • Updated Jun 6 • 1

dongguanting/Tool-Star-Qwen-7B

Text Generation • 8B • Updated Jun 30 • 52 • 2

liked a dataset 4 months ago

basicv8vc/SimpleQA

Viewer • Updated Nov 5, 2024 • 4.33k • 4.58k • 28

liked 2 datasets 5 months ago

dongguanting/Tool-Star-SFT-54K

Viewer • Updated May 29 • 54k • 175 • 9

dongguanting/Multi-Tool-RL-10K

Viewer • Updated May 25 • 10k • 70 • 4

KABI

AI & ML interests

Recent Activity

Organizations

dongguanting's activity