Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lewtun 's Collections
— Awesome RL datasets 📈 —
— Long-context post-training 🧶 —
H4
Awesome RLHF
Mistral 7B + UltraChat + Arithmo checkpoints
Hub tools
Gemma RLAIF

— Long-context post-training 🧶 —

updated Sep 14

Resources for post-training LLMs with long-context samples

Upvote
5

  • zai-org/LongAlign-10k

    Viewer • Updated Feb 22, 2024 • 9.89k • 639 • 79

  • HuggingFaceTB/smoltalk2

    Viewer • Updated 30 days ago • 8.61M • 7.51k • 111

    Note Contains an English subset of LongAlign-10k, but with completions generated by Qwen3-32B: https://huggingface.co/datasets/HuggingFaceTB/smoltalk2/viewer/SFT?views%5B%5D=sft_longalign_64k_qwen3_32b_yarn_131k_think


  • zai-org/LongReward-10k

    Viewer • Updated Oct 29, 2024 • 30k • 163 • 6

  • Tongyi-Zhiwen/DocQA-RL-1.6K

    Viewer • Updated May 23 • 3.6k • 70 • 36

  • caskcsg/LongMagpie_64k_dataset

    Preview • Updated Aug 2 • 366 • 3
Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs