Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kd-tensor 's Collections
RAG
safety-alignment
toread
Papers to read
Synthetic Data Generation

Papers to read

updated Nov 3, 2024
Upvote
-

  • Jamba: A Hybrid Transformer-Mamba Language Model

    Paper • 2403.19887 • Published Mar 28, 2024 • 111

  • The Unreasonable Ineffectiveness of the Deeper Layers

    Paper • 2403.17887 • Published Mar 26, 2024 • 82

  • Tuning Language Models by Proxy

    Paper • 2401.08565 • Published Jan 16, 2024 • 22

  • Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning

    Paper • 2402.04833 • Published Feb 7, 2024 • 5
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs