Attention Is All You Need for KV Cache in Diffusion LLMs Paper • 2510.14973 • Published Oct 16, 2025 • 40
From Next-Token to Next-Block: A Principled Adaptation Path for Diffusion LLMs Paper • 2512.06776 • Published Dec 7, 2025 • 25
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding Paper • 2512.13586 • Published 23 days ago • 88
Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed Paper • 2512.14067 • Published 23 days ago • 13
DEER: Draft with Diffusion, Verify with Autoregressive Models Paper • 2512.15176 • Published 22 days ago • 42
LLaDA2.0: Scaling Up Diffusion Language Models to 100B Paper • 2512.15745 • Published 29 days ago • 78
LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding Paper • 2512.16229 • Published 21 days ago • 15