-
-
-
-
-
-
Inference Providers
Active filters:
dpo, trl
wirthdrew1/zephyr-7b-dpo-qlora
Updated
•
29
•
1
mimicheng/mistral-7b-dpo-qlora-2ep
Updated
•
16
dctanner/sablo-pebble-mistral-dpo-lora-HelpSteer_binarized-2
Updated
•
13
Text Generation
•
3.41M
•
Updated
•
14
ondevicellm/tinyllama_moe_dpo_ultrachat_v2_epochs5
Text Generation
•
6B
•
Updated
•
23
metric-space/arceeai-cpt-sft-dpo-full
Text Generation
•
8B
•
Updated
•
17
Text Generation
•
Updated
weijie210/zephyr-7b-dpo-full
Text Generation
•
7B
•
Updated
•
11
Evan-Lin/dpo-llama2-deprecated
ondevicellm/tinyllama_moe_dpo_ultrachat_v2_epochs3
Text Generation
•
6B
•
Updated
•
14
AlekseyKorshuk/evol-codealpaca-pairwise-sharegpt-test-dpo
Text Generation
•
3B
•
Updated
•
9
dball/zephyr-7b-dpo-qlora
Updated
•
28
thobuiq/openhermes-mistral-dpo-gptq
Text Generation
•
3.41M
•
Updated
•
8
Sharathhebbar24/chat_gpt2_dpo
Text Generation
•
0.1B
•
Updated
•
67
•
1
ayoubkirouane/Mistral-SLERP-Merged7B-DPO
Text Generation
•
Updated
•
9
bartowski/zephyr-7b-dpo-full-exl2
Text Generation
•
Updated
•
4
•
1
argilla/phi2-lora-distilabel-intel-orca-dpo-pairs
Text Generation
•
Updated
•
11
•
2
janhq/llamacorn-1.1b-chat-GGUF
1B
•
Updated
•
20
•
1
dvilasuero/phi2-lora-quantized-distilabel-intel-orca-dpo-pairs
dlibf/zephyr-7b-dpo-full_sft3epoch
Text Generation
•
7B
•
Updated
•
8
dlibf/zephyr-7b-dpo-full_sft2epoch
Text Generation
•
7B
•
Updated
•
6
AlekseyKorshuk/evol-codealpaca-v1-sft-4e-5-dpo-3ep
Text Generation
•
3B
•
Updated
•
6
ondevicellm/tinyllama_mole_dpo_ep3
Text Generation
•
1B
•
Updated
•
17
AlekseyKorshuk/ultrachat-phi-2-dpo-chatml
Text Generation
•
3B
•
Updated
•
6
•
1
BramVanroy/GEITje-7B-ultra
Text Generation
•
7B
•
Updated
•
23.4k
•
49