-
-
-
-
-
-
Inference Providers
Active filters:
rlhf
LoneStriker/NeuralMonarch-7B-GPTQ
Text Generation
•
1B
•
Updated
•
1
LoneStriker/AlphaMonarch-7B-GPTQ
Text Generation
•
1B
•
Updated
•
3
•
3
mlx-community/AlphaMonarch-7B-mlx-4bit
1B
•
Updated
•
4
•
3
mlx-community/AlphaMonarch-7B-mlx
1B
•
Updated
•
6
•
4
sugatoray/mlx-neuralhermes-2.5-mistral-7b-q4bits
1B
•
Updated
•
7
sugatoray/mlx-alphamonarch-7b-q4bits
1B
•
Updated
•
5
ArchiveAI/AlphaMonarch-7B
Text Generation
•
7B
•
Updated
•
1
ContextualAI/Contextual_KTO_Mistral_PairRM
Text Generation
•
7B
•
Updated
•
56
•
32
solidrust/NeuralHermes-2.5-Mistral-7B-laser-AWQ
Text Generation
•
1B
•
Updated
solidrust/NeuralMonarch-7B-AWQ
Text Generation
•
1B
•
Updated
•
5
solidrust/AlphaMonarch-7B-AWQ
Text Generation
•
1B
•
Updated
•
1
abdullahalzubaer/NeuralHermes-2.5-Mistral-7B
Text Generation
•
7B
•
Updated
•
3
•
1
koesn/NeuralHermes-2.5-Mistral-7B-GGUF
7B
•
Updated
•
32
delayedkarma/NeuralHermes-2.5-Mistral-7B
Text Generation
•
7B
•
Updated
•
1
•
1
asedmammad/Contextual_KTO_Mistral_PairRM-GGUF
7B
•
Updated
•
283
•
2
danilopeixoto/pandora-7b-chat
Text Generation
•
Updated
•
1
solidrust/NeuralBeagle14-7B-AWQ
Text Generation
•
1B
•
Updated
•
3
vibhorg/rl4llm_uofm_nlpo_super_t5_arxiv
Updated
umarigan/Trendyol-LLM-7b-chat-v1.0-RLHF
Question Answering
•
7B
•
Updated
vibhorg/rl4llm_uofm_nlpo_unsuper_t5_arxiv
mlabonne/AlphaMonarch-7B-GPTQ
Text Generation
•
1B
•
Updated
mlabonne/AlphaMonarch-7B-AWQ
Text Generation
•
1B
•
Updated
•
1
mlabonne/AlphaMonarch-7B-2bit-HQQ
Text Generation
•
Updated
•
6
•
7
mlabonne/AlphaMonarch-7B-5.0bpw-exl2
Text Generation
•
Updated
•
1
mlx-community/CapybaraHermes-2.5-Mistral-7B
mlabonne/OrpoLlama-3-8B
Text Generation
•
8B
•
Updated
•
45
•
53
solidrust/OrpoLlama-3-8B-AWQ
Text Generation
•
2B
•
Updated
•
3
•
3
PKU-Alignment/beaver-7b-v2.0
Reinforcement Learning
•
7B
•
Updated
•
4
PKU-Alignment/beaver-7b-v2.0-reward
Reinforcement Learning
•
7B
•
Updated
•
22
PKU-Alignment/beaver-7b-v2.0-cost
Reinforcement Learning
•
7B
•
Updated
•
7