ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-F16 Reinforcement Learning • 8B • Updated Mar 25, 2025 • 620 • 90
unsloth/DeepSeek-R1-Distill-Llama-8B Text Generation • 8B • Updated Jul 18, 2025 • 2.95k • • 107