LightningRL: Breaking the Accuracy-Parallelism Trade-off of Block-wise dLLMs via Reinforcement Learning
arXiv:2603.13319v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) have emerged as a promising paradigm for parallel token generation, with block-wise variants garnering significant …
Yanzhe Hu, Yijie Jin, Pengfei Liu, Kai Yu, Zhijie Deng
12 views