Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
gaunernst 's Collections
DeepSeek testing
Gemma 3 QAT INT4 (from GGUF)
Gemma 3 QAT INT4 (from Flax)
Mini BERT models
Face Recognition Models
LLMs < 1B
LLMs 1B - 2B
LLMs 2B - 4B
Smallish LLM pre-training datasets
Llama2-compatible
Llama3-compatible

LLMs < 1B

updated Sep 29, 2024
Upvote
-

  • Qwen/Qwen2-0.5B

    Text Generation • 0.5B • Updated Oct 22, 2024 • 529k • 160

  • Qwen/Qwen2-0.5B-Instruct

    Text Generation • 0.5B • Updated Aug 21, 2024 • 200k • 196

  • HuggingFaceTB/SmolLM-135M

    Text Generation • 0.1B • Updated Aug 1, 2024 • 417k • 234

  • HuggingFaceTB/SmolLM-135M-Instruct

    Text Generation • 0.1B • Updated Sep 4, 2024 • 24.5k • 126

  • HuggingFaceTB/SmolLM-360M

    Text Generation • 0.4B • Updated Aug 1, 2024 • 19.2k • 67

  • HuggingFaceTB/SmolLM-360M-Instruct

    Text Generation • 0.4B • Updated Aug 18, 2024 • 10k • 83

  • apple/OpenELM-270M

    Text Generation • 0.3B • Updated Feb 28 • 1.24k • 75

  • apple/OpenELM-270M-Instruct

    Text Generation • 0.3B • Updated Feb 28 • 567 • 140

  • apple/OpenELM-450M

    Text Generation • 0.5B • Updated Feb 28 • 232 • 26

  • apple/OpenELM-450M-Instruct

    Text Generation • 0.5B • Updated Feb 28 • 1.42k • 49

  • facebook/opt-125m

    Text Generation • Updated Sep 15, 2023 • 4.68M • 226

  • facebook/opt-350m

    Text Generation • Updated Sep 15, 2023 • 112k • 148

  • amd/AMD-Llama-135m

    Text Generation • 0.1B • Updated Oct 9, 2024 • 6.43k • 118
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs