Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Model Tree
Reset
Qwen/Qwen2-0.5B-Instruct
Adapters
Finetunes
Quantizations
Merges
Apps
llama.cpp
LM Studio
Jan
Backyard AI
Draw Things
DiffusionBee
Jellybox
RecurseChat
Msty
Sanctum
Invoke
JoyFusion
LocalAI
vLLM
node-llama-cpp
Ollama
TGI
MLX LM
Docker Model Runner
Lemonade
Inference Providers
Select all
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
custom_code
8-bit precision
text-embeddings-inference
Mixture of Experts
Carbon Emissions
Apply filters
Models
486
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
Qwen/Qwen2-0.5B-Instruct
Clear all
jiachenjiang/Qwen2-0.5B-GRPO-test
Updated
Feb 12
jamesjje/Qwen2-0.5B-GRPO-test
Updated
Feb 13
cjayasuriya/Qwen2-0.5B-GRPO-test
Updated
Feb 13
WbjuSrceu/Qwen2-0.5B-GRPO-test
Updated
Mar 4
lidiya/Qwen2-0.5B-GRPO-test
Updated
Feb 13
jiaying0220/Qwen2-0.5B-GRPO-test
Updated
Feb 13
Human420/Qwen2-0.5B-GRPO-test
Updated
Feb 17
jiaying0220/Qwen2-0.5B-GRPO-test_2_13
Text Generation
•
0.5B
•
Updated
Feb 13
•
1
longlian/Qwen2-0.5B-GRPO-peft-demo
Updated
Feb 14
longlian/Qwen2-0.5B-GRPO-demo
Text Generation
•
0.5B
•
Updated
Feb 14
•
2
araziziml/Qwen2-0.5B-GRPO
Text Generation
•
0.5B
•
Updated
Feb 15
•
3
•
Kyleyee/Qwen2-0.5B-reward-hh
Text Classification
•
0.5B
•
Updated
Feb 16
solarcloud/Qwen2-0.5B-GRPO-test
Updated
Feb 17
Gredora/Qwen2-0.5B-GRPO-test
Updated
Feb 18
konstantin-ketterer/Qwen2-0.5B-GRPO-test
Updated
Feb 18
araziziml/Qwen2-0.5B-GRPO-exp2
Text Generation
•
0.5B
•
Updated
Feb 18
•
4
•
araziziml/Qwen2-0.5B-GRPO-exp3
Text Generation
•
0.5B
•
Updated
Feb 18
•
4
•
ananxiang88/Qwen2-0.5B-GRPO-test
Updated
Feb 21
satyakiu/Qwen2-0.5B-GRPO-test
Updated
Feb 19
araziziml/Qwen2-0.5B-DPO
Text Generation
•
0.5B
•
Updated
Feb 20
•
4
•
junchengdong/Qwen2-0.5B-GRPO-test
Updated
Feb 21
LahiruWije/Qwen2-0.5B-GRPO-test
Text Generation
•
0.5B
•
Updated
Feb 22
•
9
cbynum/Qwen2-0.5B-GRPO-test
Updated
Feb 21
lhcsnelm/Qwen2-0.5B-GRPO-test
Updated
Feb 23
valerielucro/Qwen2-0.5B-GRPO-VLLM-1-epoch
Updated
Feb 22
valerielucro/Qwen2-0.5B-GRPO-VLLM-8-epoch
Updated
Feb 22
valerielucro/Qwen2-0.5B-GRPO-VLLM-30-epoch
Updated
Feb 22
Sugarc0de/Qwen2-0.5B-GRPO-test
Updated
Feb 22
Beakal/Qwen2-0.5B-GRPO-test
Updated
Feb 24
Patrik1352/Agent_Qven
Updated
Feb 23
Previous
1
2
3
4
5
...
17
Next