Academic

Academic

Academic · 1 min

Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models

arXiv:2603.08859v1 Announce Type: new Abstract: Hybrid sequence models--combining Transformer and state-space model layers--seek to gain the expressive versatility of attention as well as the computational …

John Cooper, Ilias Diakonikolas, Mingchen Ma, Frederic Sala
16 views
Academic · 1 min

The Coupling Within: Flow Matching via Distilled Normalizing Flows

arXiv:2603.09014v1 Announce Type: new Abstract: Flow models have rapidly become the go-to method for training and deploying large-scale generators, owing their success to inference-time flexibility …

David Berthelot, Tianrong Chen, Jiatao Gu, Marco Cuturi, Laurent Dinh, Bhavik Chandna, Michal Klein, Josh Susskind, Shuangfei Zhai
13 views