LLM - a zyf515730395 Collection

zyf515730395 's Collections

M-RAG

Video Understanding

MLLM

LLM

Image Generation

Video Generation

LLM

updated Nov 1, 2025

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Paper • 2506.08889 • Published Jun 10, 2025 • 23
MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9, 2025 • 93
Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 263
OpenThoughts: Data Recipes for Reasoning Models

Paper • 2506.04178 • Published Jun 4, 2025 • 49
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320
Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30, 2025 • 53
Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published Mar 29, 2025 • 46
The Diffusion Duality

Paper • 2506.10892 • Published Jun 12, 2025 • 37
Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 194
Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10, 2025 • 660