AI & ML interests
LLMs
Organizations
None yet
rkumar1999/Llama3.2-3B-Prover-Math-openr1-distill-SFT
Text Generation
•
175k
•
Updated
•
10
rkumar1999/Phi-mini-MoE-Prover-Math-openr1-distill-SFT
Text Generation
•
2.99M
•
Updated
•
8
rkumar1999/Phi-mini-MoE-Mix-Prover-openr1-distill-SFT
Text Generation
•
2.99M
•
Updated
•
7
rkumar1999/Phi-mini-MoE-Prover-openr1-distill-SFT
Text Generation
•
2.99M
•
Updated
•
7
rkumar1999/phi-tiny-moe-lean-sft
Text Generation
•
4B
•
Updated
•
8
rkumar1999/phi-tiny-moe-math-lean-sft
Text Generation
•
4B
•
Updated
•
12
rkumar1999/Qwen1.5-MoE-A2.7B-mixed-openr1-distill-SFT
Text Generation
•
Updated
•
10
rkumar1999/Llama3.2-3B-Prover-openr1-distill-GRPO
Text Generation
•
Updated
•
8
rkumar1999/Llama3.2-3B-Prover-openr1-distill-SFT
Text Generation
•
175k
•
Updated
•
14
rkumar1999/Llama3.2-3B-Prover-openr1-SFT
Text Generation
•
3B
•
Updated
•
6
rkumar1999/llama3.2-3b-obt
Updated
rkumar1999/DeepSeek-V2-Lite-Chat-mix-logic-prover
Text Generation
•
Updated
rkumar1999/DeepSeek-V2-Lite-Chat-MixLoRA
Updated
rkumar1999/DeepSeek-V2-Lite-Chat-deepseek-prover
Text Generation
•
Updated
rkumar1999/gpt-oss-20b-deepseek-prover
Text Generation
•
Updated
rkumar1999/Llama-32.b-moe-finetuned
Updated
rkumar1999/Llama-32.b-moe
Text Generation
•
3B
•
Updated
•
4
rkumar1999/Llama-3.2-3B-Open-R1-Distill-GRPO
Text Generation
•
Updated
•
7
rkumar1999/Llama-3.2-3B-Open-R1-Distill
Text Generation
•
Updated
•
8
rkumar1999/Llama-3.2-3B-Deepseek-Prover-v1
Text Generation
•
Updated
•
11
rkumar1999/Llama-3.1-8B-Instruct-Open-R1-Distill-Lean
Text Generation
•
Updated
•
9
rkumar1999/Llama-3.1-8B-Instruct-Open-R1-Distill-GRPO
Updated
rkumar1999/Llama-3.1-8B-Instruct-Open-R1-Distill
Text Generation
•
Updated
•
8
rkumar1999/gpt2-fine-tuned-gsm8k
Updated
•
21
rkumar1999/gpt2-fine-tuned-math