-
Memory Augmented Language Models through Mixture of Word Experts
Paper • 2311.10768 • Published • 18 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 44 -
Fine-tuning Language Models for Factuality
Paper • 2311.08401 • Published • 30 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 77
Collections
Discover the best community collections!
Collections including paper arxiv:2312.09390
-
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
Paper • 2308.07074 • Published -
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing
Paper • 2310.13855 • Published • 1 -
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms
Paper • 2311.13133 • Published -
Group Preference Optimization: Few-Shot Alignment of Large Language Models
Paper • 2310.11523 • Published
-
ORPO: Monolithic Preference Optimization without Reference Model
Paper • 2403.07691 • Published • 69 -
sDPO: Don't Use Your Data All at Once
Paper • 2403.19270 • Published • 41 -
Teaching Large Language Models to Reason with Reinforcement Learning
Paper • 2403.04642 • Published • 50 -
Best Practices and Lessons Learned on Synthetic Data for Language Models
Paper • 2404.07503 • Published • 31
-
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper • 2312.09390 • Published • 33 -
OneLLM: One Framework to Align All Modalities with Language
Paper • 2312.03700 • Published • 24 -
Generative Multimodal Models are In-Context Learners
Paper • 2312.13286 • Published • 37 -
The LLM Surgeon
Paper • 2312.17244 • Published • 9
-
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Paper • 2309.04663 • Published • 6 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 88 -
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Paper • 2310.08541 • Published • 18 -
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Paper • 2310.13671 • Published • 19
-
Memory Augmented Language Models through Mixture of Word Experts
Paper • 2311.10768 • Published • 18 -
System 2 Attention (is something you might need too)
Paper • 2311.11829 • Published • 44 -
Fine-tuning Language Models for Factuality
Paper • 2311.08401 • Published • 30 -
Orca 2: Teaching Small Language Models How to Reason
Paper • 2311.11045 • Published • 77
-
ORPO: Monolithic Preference Optimization without Reference Model
Paper • 2403.07691 • Published • 69 -
sDPO: Don't Use Your Data All at Once
Paper • 2403.19270 • Published • 41 -
Teaching Large Language Models to Reason with Reinforcement Learning
Paper • 2403.04642 • Published • 50 -
Best Practices and Lessons Learned on Synthetic Data for Language Models
Paper • 2404.07503 • Published • 31
-
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper • 2312.09390 • Published • 33 -
OneLLM: One Framework to Align All Modalities with Language
Paper • 2312.03700 • Published • 24 -
Generative Multimodal Models are In-Context Learners
Paper • 2312.13286 • Published • 37 -
The LLM Surgeon
Paper • 2312.17244 • Published • 9
-
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
Paper • 2308.07074 • Published -
Evoke: Evoking Critical Thinking Abilities in LLMs via Reviewer-Author Prompt Editing
Paper • 2310.13855 • Published • 1 -
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms
Paper • 2311.13133 • Published -
Group Preference Optimization: Few-Shot Alignment of Large Language Models
Paper • 2310.11523 • Published
-
FIAT: Fusing learning paradigms with Instruction-Accelerated Tuning
Paper • 2309.04663 • Published • 6 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 88 -
Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation
Paper • 2310.08541 • Published • 18 -
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Paper • 2310.13671 • Published • 19