Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

9,239

Full-text search

Active filters: dpo, trl

tzwilliam0/humor_high

Updated Dec 19, 2024

mgat1/SmolLM2-FT-DPO

Text Generation • 0.1B • Updated Dec 19, 2024 • 7 • 1

AmeerH/FynderPearlDPO

Text Generation • 8B • Updated Dec 19, 2024 • 1

mradermacher/ToxicMist-v0.2-7B-DPO-GGUF

7B • Updated Dec 20, 2024 • 51

powermove72/Llama-3.2-3B-Instruct-abliterated-DPO

3B • Updated Dec 20, 2024 • 16

zhaoxj/SmolLM2-FT-DPO

Text Generation • 0.1B • Updated Dec 20, 2024 • 5

XeIaso/SmolLM2-FT-DPO

Text Generation • Updated Dec 20, 2024 • 5

mradermacher/II-Tulu-8B-DPO-v2-GGUF

8B • Updated Dec 21, 2024 • 48

mradermacher/II-Tulu-8B-DPO-v2-i1-GGUF

8B • Updated Dec 21, 2024 • 73

tzwilliam0/humor_model

Updated Dec 21, 2024

jucanbe/SmolLM2-FT-DPO

Text Generation • 0.1B • Updated Dec 22, 2024 • 6

L2zz/good-feedback-Phi3.5-mini-dDPO

Updated Dec 22, 2024

LBK95/Llama-2-7b-hf-DPO-LookAhead-5_Q2_TTree1.4_TT0.9_TP0.7_TE0.2_V2

Updated Dec 22, 2024 • 3

chenhunghan/SmolLM2-FT-DPO

Text Generation • 0.1B • Updated Dec 22, 2024 • 8

mradermacher/Llama-2-7b-sft-SPIN-gpt4o-GGUF

7B • Updated Feb 11 • 26

mradermacher/Mistral-7B-v0.1-sft-SPIN-Mistral-8x7B-Instruct-v0.1-GGUF

7B • Updated Dec 23, 2024 • 60

mradermacher/Mistral-7B-v0.1-sft-SPIN-gpt4o-GGUF

7B • Updated Dec 23, 2024 • 71

mradermacher/Mistral-7B-v0.1-sft-SPIN-Mistral-8x7B-Instruct-v0.1-i1-GGUF

7B • Updated Dec 23, 2024 • 80

mradermacher/Mistral-7B-v0.1-sft-SPIN-gpt4o-i1-GGUF

7B • Updated Dec 23, 2024 • 97

mradermacher/Llama-2-7b-sft-SPIN-gpt4o-i1-GGUF

7B • Updated Dec 23, 2024 • 71

michaelnguyen11/TwinLlama-3.2-3B-DPO

Text Generation • 3B • Updated Dec 24, 2024 • 7

tensorblock/chat_gpt2_dpo-GGUF

Text Generation • 0.2B • Updated Jul 9 • 172

Digish/SmolLM2-FT-DPO

Text Generation • 0.1B • Updated Dec 23, 2024 • 6

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen

Text Generation • 7B • Updated Dec 24, 2024 • 7

mradermacher/Mistral-7B-v0.3-sft-SPIN-self-GGUF

7B • Updated Dec 25, 2024 • 37

mradermacher/Llama-3.1-8B-sft-SPIN-self-GGUF

8B • Updated Dec 24, 2024 • 38 • 1

mradermacher/TwinLlama-3.2-3B-DPO-GGUF

3B • Updated Dec 25, 2024 • 32

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-norandom

Text Generation • 7B • Updated Dec 24, 2024 • 8

bigheiniuJ/zephyr-7b-dpo-full-prompt-extend-chosen-noshort

Text Generation • 7B • Updated Dec 24, 2024 • 8

mradermacher/GEITje-7B-ultra-GGUF

7B • Updated Jul 31 • 113