Update README.md
Browse files
    	
        README.md
    CHANGED
    
    | 
         @@ -2,7 +2,7 @@ 
     | 
|
| 2 | 
         
             
            license: mit
         
     | 
| 3 | 
         
             
            ---
         
     | 
| 4 | 
         | 
| 5 | 
         
            -
            # INTELLECT-MATH:  
     | 
| 6 | 
         | 
| 7 | 
         
             
            INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning. It was trained in two stages, an SFT stage, in which the model was fine-tuned on verified QwQ outputs, and an RL stage, in which the model was trained using the [PRIME-RL](https://github.com/PRIME-RL/PRIME) recipe.
         
     | 
| 8 | 
         | 
| 
         | 
|
| 2 | 
         
             
            license: mit
         
     | 
| 3 | 
         
             
            ---
         
     | 
| 4 | 
         | 
| 5 | 
         
            +
            # INTELLECT-MATH: Frontier Mathematical Reasoning through Better Initializations for Reinforcement Learning
         
     | 
| 6 | 
         | 
| 7 | 
         
             
            INTELLECT-MATH is a 7B parameter model optimized for mathematical reasoning. It was trained in two stages, an SFT stage, in which the model was fine-tuned on verified QwQ outputs, and an RL stage, in which the model was trained using the [PRIME-RL](https://github.com/PRIME-RL/PRIME) recipe.
         
     | 
| 8 | 
         |