PrimeIntellect
/

INTELLECT-MATH-SFT

Model card Files Files and versions

justus27 commited on Jan 22

Commit

3abb996

·

verified ·

1 Parent(s): 2390dfb

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -2,7 +2,7 @@
 license: mit
 ---
-# INTELLECT-MATH: State-of-the-Art Mathematical Reasoning through Better Initializations for Reinforcement Learning
 INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning. It was trained in two stages, an SFT stage, in which the model was fine-tuned on verified QwQ outputs, and an RL stage, in which the model was trained using the [PRIME-RL](https://github.com/PRIME-RL/PRIME) recipe.

 license: mit
 ---
+# INTELLECT-MATH: Frontier Mathematical Reasoning through Better Initializations for Reinforcement Learning
 INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning. It was trained in two stages, an SFT stage, in which the model was fine-tuned on verified QwQ outputs, and an RL stage, in which the model was trained using the [PRIME-RL](https://github.com/PRIME-RL/PRIME) recipe.