- 
	
	
	
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU
Paper • 2502.08910 • Published • 148 - 
	
	
	
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens
Paper • 2502.18890 • Published • 30 - 
	
	
	
MPO: Boosting LLM Agents with Meta Plan Optimization
Paper • 2503.02682 • Published • 28 - 
	
	
	
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
Paper • 2505.20411 • Published • 89 
Jeffrey Yang Fan Chiang
RandomHakkaDude
		AI & ML interests
GenAI, LLMs
		Recent Activity
						upvoted 
								a
								paper
							
						about 1 month ago
						
					
						
						
						DynaGuard: A Dynamic Guardrail Model With User-Defined Policies
						
						liked
								a model
							
						5 months ago
						
					
						
						
						
						nvidia/Nemotron-4-340B-Instruct
						
						updated 
								a collection
							
						5 months ago
						
					LLMs&Agents
						Organizations
None yet