Edit Models filters

Apps

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

407

Full-text search

Active filters: rlhf

LoneStriker/NeuralMonarch-7B-GPTQ

Text Generation • 1B • Updated Feb 19, 2024 • 1

LoneStriker/AlphaMonarch-7B-GPTQ

Text Generation • 1B • Updated Feb 19, 2024 • 3 • 3

mlx-community/AlphaMonarch-7B-mlx-4bit

1B • Updated Feb 19, 2024 • 4 • 3

mlx-community/AlphaMonarch-7B-mlx

1B • Updated Feb 19, 2024 • 6 • 4

sugatoray/mlx-neuralhermes-2.5-mistral-7b-q4bits

1B • Updated Feb 25, 2024 • 7

sugatoray/mlx-alphamonarch-7b-q4bits

1B • Updated Mar 4, 2024 • 5

ArchiveAI/AlphaMonarch-7B

Text Generation • 7B • Updated Mar 1, 2024 • 1

ContextualAI/Contextual_KTO_Mistral_PairRM

Text Generation • 7B • Updated Apr 26, 2024 • 56 • 32

solidrust/NeuralHermes-2.5-Mistral-7B-laser-AWQ

Text Generation • 1B • Updated Sep 3, 2024

solidrust/NeuralMonarch-7B-AWQ

Text Generation • 1B • Updated Sep 3, 2024 • 5

solidrust/AlphaMonarch-7B-AWQ

Text Generation • 1B • Updated Sep 3, 2024 • 1

abdullahalzubaer/NeuralHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Mar 13, 2024 • 3 • 1

koesn/NeuralHermes-2.5-Mistral-7B-GGUF

7B • Updated Mar 10, 2024 • 32

delayedkarma/NeuralHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Mar 10, 2024 • 1 • 1

asedmammad/Contextual_KTO_Mistral_PairRM-GGUF

7B • Updated Mar 11, 2024 • 283 • 2

danilopeixoto/pandora-7b-chat

Text Generation • Updated Mar 24, 2024 • 1

solidrust/NeuralBeagle14-7B-AWQ

Text Generation • 1B • Updated Sep 3, 2024 • 3

vibhorg/rl4llm_uofm_nlpo_super_t5_arxiv

Updated Mar 20, 2024

umarigan/Trendyol-LLM-7b-chat-v1.0-RLHF

Question Answering • 7B • Updated Mar 16, 2024

vibhorg/rl4llm_uofm_nlpo_unsuper_t5_arxiv

Updated Mar 20, 2024 • 3

mlabonne/AlphaMonarch-7B-GPTQ

Text Generation • 1B • Updated Mar 28, 2024

mlabonne/AlphaMonarch-7B-AWQ

Text Generation • 1B • Updated Mar 28, 2024 • 1

mlabonne/AlphaMonarch-7B-2bit-HQQ

Text Generation • Updated Mar 28, 2024 • 6 • 7

mlabonne/AlphaMonarch-7B-5.0bpw-exl2

Text Generation • Updated Mar 28, 2024 • 1

mlx-community/CapybaraHermes-2.5-Mistral-7B

Updated Apr 7, 2024 • 3

mlabonne/OrpoLlama-3-8B

Text Generation • 8B • Updated Jun 15, 2024 • 45 • 53

solidrust/OrpoLlama-3-8B-AWQ

Text Generation • 2B • Updated Sep 3, 2024 • 3 • 3

PKU-Alignment/beaver-7b-v2.0

Reinforcement Learning • 7B • Updated May 9, 2024 • 4

PKU-Alignment/beaver-7b-v2.0-reward

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 22

PKU-Alignment/beaver-7b-v2.0-cost

Reinforcement Learning • 7B • Updated Apr 20, 2024 • 7