Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
demystify-long-cot
's Collections
Demysitifying Long CoT
Demysitifying Long CoT
updated
Mar 16
Curation of resources used in the paper "Demystifying Long Chain-of-Thought Reasoning in LLMs"
Upvote
4
Demystifying Long Chain-of-Thought Reasoning in LLMs
Paper
•
2502.03373
•
Published
Feb 5
•
58
demystify-long-cot/math-train-qwq-rs-n256
Viewer
•
Updated
Jan 21
•
1.14M
•
33
•
1
demystify-long-cot/llama-3.1-8b-math-qwq-n256-rft
8B
•
Updated
Jan 20
•
7
demystify-long-cot/math-train-qwq-rs-n192
Viewer
•
Updated
Jan 21
•
854k
•
26
demystify-long-cot/llama-3.1-8b-math-qwq-n192-rft-ppo
8B
•
Updated
Jan 20
•
5
demystify-long-cot/llama-3.1-8b-math-qwq-n192-rft
8B
•
Updated
Jan 20
•
6
demystify-long-cot/math-train-qwen-rs-n256
Viewer
•
Updated
Jan 23
•
1.53M
•
30
demystify-long-cot/llama-3.1-8b-math-qwen-n256-rft
8B
•
Updated
Jan 20
•
3
demystify-long-cot/math-train-action-n40
Viewer
•
Updated
Jan 23
•
217k
•
27
demystify-long-cot/math-train-rl
Viewer
•
Updated
Jan 20
•
7.5k
•
18
Upvote
4
Share collection
View history
Collection guide
Browse collections