Academic

Academic

Academic · 1 min

On the "Induction Bias" in Sequence Models

arXiv:2602.18333v1 Announce Type: cross Abstract: Despite the remarkable practical success of transformer-based language models, recent work has raised concerns about their ability to perform state …

M. Reza Ebrahimi, Micha\"el Defferrard, Sunny Panchal, Roland Memisevic
9 views
Academic · 1 min

AnCoder: Anchored Code Generation via Discrete Diffusion Models

arXiv:2602.17688v1 Announce Type: new Abstract: Diffusion language models offer a compelling alternative to autoregressive code generation, enabling global planning and iterative refinement of complex program …

Anton Xue, Litu Rout, Constantine Caramanis, Sanjay Shakkottai
60 views
Academic · 1 min

Parallel Complex Diffusion for Scalable Time Series Generation

arXiv:2602.17706v1 Announce Type: new Abstract: Modeling long-range dependencies in time series generation poses a fundamental trade-off between representational capacity and computational efficiency. Traditional temporal diffusion …

Rongyao Cai, Yuxi Wan, Kexin Zhang, Ming Jin, Zhiqiang Ge, Qingsong Wen, Yong Liu
8 views