Charformer: Fast Character Transformers via Gradient-based Subword Tokenization Paper • 2106.12672 • Published Jun 23, 2021
LiPO: Listwise Preference Optimization through Learning-to-Rank Paper • 2402.01878 • Published Feb 2, 2024 • 20
PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs Paper • 2406.02886 • Published Jun 5, 2024 • 11