Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Cran-May
/
CohenQu-DeepSeek-R1-Distill-Qwen-1.5B-GRPO-duplicate-fixed-6140715-Q4_K_M-GGUF
like
0
Transformers
GGUF
hf-cmu-collab/DeepScaleR-1.5B-Preview_on-policy_GRPO
Generated from Trainer
trl
grpo
llama-cpp
gguf-my-repo
imatrix
conversational
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
CohenQu-DeepSeek-R1-Distill-Qwen-1.5B-GRPO-duplicate-fixed-6140715-Q4_K_M-GGUF
1.12 GB
1 contributor
History:
4 commits
Cran-May
Upload README.md with huggingface_hub
b5dd1ee
verified
10 months ago
.gitattributes
1.69 kB
Upload imatrix.dat with huggingface_hub
10 months ago
README.md
2.59 kB
Upload README.md with huggingface_hub
10 months ago
cohenqu-deepseek-r1-distill-qwen-1.5b-grpo-duplicate-fixed-6140715-q4_k_m-imat.gguf
1.12 GB
xet
Upload cohenqu-deepseek-r1-distill-qwen-1.5b-grpo-duplicate-fixed-6140715-q4_k_m-imat.gguf with huggingface_hub
10 months ago
imatrix.dat
2.04 MB
xet
Upload imatrix.dat with huggingface_hub
10 months ago