- 
	
	
	
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper • 2408.15545 • Published • 37 - 
	
	
	
Controllable Text Generation for Large Language Models: A Survey
Paper • 2408.12599 • Published • 65 - 
	
	
	
To Code, or Not To Code? Exploring Impact of Code in Pre-training
Paper • 2408.10914 • Published • 44 - 
	
	
	
Automated Design of Agentic Systems
Paper • 2408.08435 • Published • 40 
Collections
Discover the best community collections!
Collections including paper arxiv:2309.03883 
						
					
				- 
	
	
	
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82 - 
	
	
	
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
Paper • 2404.02258 • Published • 107 - 
	
	
	
ReFT: Representation Finetuning for Language Models
Paper • 2404.03592 • Published • 101 - 
	
	
	
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 62 
- 
	
	
	
Training Verifiers to Solve Math Word Problems
Paper • 2110.14168 • Published • 4 - 
	
	
	
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Paper • 2309.12284 • Published • 18 - 
	
	
	
LiteSearch: Efficacious Tree Search for LLM
Paper • 2407.00320 • Published • 40 - 
	
	
	
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
Paper • 2309.03883 • Published • 35 
- 
	
	
	
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Paper • 2210.17323 • Published • 8 - 
	
	
	
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Paper • 2208.07339 • Published • 5 - 
	
	
	
Hydragen: High-Throughput LLM Inference with Shared Prefixes
Paper • 2402.05099 • Published • 20 - 
	
	
	
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Paper • 2401.10774 • Published • 59 
- 
	
	
	
Can large language models explore in-context?
Paper • 2403.15371 • Published • 33 - 
	
	
	
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Paper • 2403.19655 • Published • 19 - 
	
	
	
WavLLM: Towards Robust and Adaptive Speech Large Language Model
Paper • 2404.00656 • Published • 11 - 
	
	
	
Enabling Memory Safety of C Programs using LLMs
Paper • 2404.01096 • Published • 1 
- 
	
	
	
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 6 - 
	
	
	
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 23 - 
	
	
	
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 13 - 
	
	
	
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69 
- 
	
	
	
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper • 2408.15545 • Published • 37 - 
	
	
	
Controllable Text Generation for Large Language Models: A Survey
Paper • 2408.12599 • Published • 65 - 
	
	
	
To Code, or Not To Code? Exploring Impact of Code in Pre-training
Paper • 2408.10914 • Published • 44 - 
	
	
	
Automated Design of Agentic Systems
Paper • 2408.08435 • Published • 40 
- 
	
	
	
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Paper • 2210.17323 • Published • 8 - 
	
	
	
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Paper • 2208.07339 • Published • 5 - 
	
	
	
Hydragen: High-Throughput LLM Inference with Shared Prefixes
Paper • 2402.05099 • Published • 20 - 
	
	
	
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads
Paper • 2401.10774 • Published • 59 
- 
	
	
	
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82 - 
	
	
	
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
Paper • 2404.02258 • Published • 107 - 
	
	
	
ReFT: Representation Finetuning for Language Models
Paper • 2404.03592 • Published • 101 - 
	
	
	
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper • 2404.03715 • Published • 62 
- 
	
	
	
Can large language models explore in-context?
Paper • 2403.15371 • Published • 33 - 
	
	
	
GaussianCube: Structuring Gaussian Splatting using Optimal Transport for 3D Generative Modeling
Paper • 2403.19655 • Published • 19 - 
	
	
	
WavLLM: Towards Robust and Adaptive Speech Large Language Model
Paper • 2404.00656 • Published • 11 - 
	
	
	
Enabling Memory Safety of C Programs using LLMs
Paper • 2404.01096 • Published • 1 
- 
	
	
	
Training Verifiers to Solve Math Word Problems
Paper • 2110.14168 • Published • 4 - 
	
	
	
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
Paper • 2309.12284 • Published • 18 - 
	
	
	
LiteSearch: Efficacious Tree Search for LLM
Paper • 2407.00320 • Published • 40 - 
	
	
	
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
Paper • 2309.03883 • Published • 35 
- 
	
	
	
LoRA+: Efficient Low Rank Adaptation of Large Models
Paper • 2402.12354 • Published • 6 - 
	
	
	
The FinBen: An Holistic Financial Benchmark for Large Language Models
Paper • 2402.12659 • Published • 23 - 
	
	
	
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Paper • 2402.13249 • Published • 13 - 
	
	
	
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 69