-
-
-
-
-
-
Inference Providers
Active filters:
dpo, trl
Text Generation
•
0.1B
•
Updated
•
7
•
1
Text Generation
•
8B
•
Updated
•
1
mradermacher/ToxicMist-v0.2-7B-DPO-GGUF
7B
•
Updated
•
51
powermove72/Llama-3.2-3B-Instruct-abliterated-DPO
3B
•
Updated
•
16
Text Generation
•
0.1B
•
Updated
•
5
Text Generation
•
Updated
•
5
mradermacher/II-Tulu-8B-DPO-v2-GGUF
8B
•
Updated
•
48
mradermacher/II-Tulu-8B-DPO-v2-i1-GGUF
8B
•
Updated
•
73
Text Generation
•
0.1B
•
Updated
•
6
L2zz/good-feedback-Phi3.5-mini-dDPO
Updated
LBK95/Llama-2-7b-hf-DPO-LookAhead-5_Q2_TTree1.4_TT0.9_TP0.7_TE0.2_V2
chenhunghan/SmolLM2-FT-DPO
Text Generation
•
0.1B
•
Updated
•
8
mradermacher/Llama-2-7b-sft-SPIN-gpt4o-GGUF
mradermacher/Mistral-7B-v0.1-sft-SPIN-Mistral-8x7B-Instruct-v0.1-GGUF
7B
•
Updated
•
60
mradermacher/Mistral-7B-v0.1-sft-SPIN-gpt4o-GGUF
7B
•
Updated
•
71
mradermacher/Mistral-7B-v0.1-sft-SPIN-Mistral-8x7B-Instruct-v0.1-i1-GGUF
7B
•
Updated
•
80
mradermacher/Mistral-7B-v0.1-sft-SPIN-gpt4o-i1-GGUF
7B
•
Updated
•
97
mradermacher/Llama-2-7b-sft-SPIN-gpt4o-i1-GGUF
7B
•
Updated
•
71
michaelnguyen11/TwinLlama-3.2-3B-DPO
Text Generation
•
3B
•
Updated
•
7
tensorblock/chat_gpt2_dpo-GGUF
Text Generation
•
0.2B
•
Updated
•
172
Text Generation
•
0.1B
•
Updated
•
6
bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen
Text Generation
•
7B
•
Updated
•
7
mradermacher/Mistral-7B-v0.3-sft-SPIN-self-GGUF
7B
•
Updated
•
37
mradermacher/Llama-3.1-8B-sft-SPIN-self-GGUF
8B
•
Updated
•
38
•
1
mradermacher/TwinLlama-3.2-3B-DPO-GGUF
3B
•
Updated
•
32
bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-norandom
Text Generation
•
7B
•
Updated
•
8
bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-noshort
Text Generation
•
7B
•
Updated
•
8
mradermacher/GEITje-7B-ultra-GGUF
7B
•
Updated
•
113