Academic

Academic

Academic · 1 min

Stabilizing Native Low-Rank LLM Pretraining

arXiv:2602.12429v1 Announce Type: new Abstract: Foundation models have achieved remarkable success, yet their growing parameter counts pose significant computational and memory challenges. Low-rank factorization offers …

Paul Janson, Edouard Oyallon, Eugene Belilovsky
4 views
Academic · 1 min

Computationally sufficient statistics for Ising models

arXiv:2602.12449v1 Announce Type: new Abstract: Learning Gibbs distributions using only sufficient statistics has long been recognized as a computationally hard problem. On the other hand, …

Abhijith Jayakumar, Shreya Shukla, Marc Vuffray, Andrey Y. Lokhov, Sidhant Misra
29 views
Academic · 1 min

Continuous Diffusion Models Can Obey Formal Syntax

arXiv:2602.12468v1 Announce Type: new Abstract: Diffusion language models offer a promising alternative to autoregressive models due to their global, non-causal generation process, but their continuous …

Jinwoo Kim, Taylor Berg-Kirkpatrick, Loris D'Antoni
21 views
Academic · 1 min

Regularized Meta-Learning for Improved Generalization

arXiv:2602.12469v1 Announce Type: new Abstract: Deep ensemble methods often improve predictive performance, yet they suffer from three practical limitations: redundancy among base models that inflates …

Noor Islam S. Mohammad, Md Muntaqim Meherab
3 views
Academic · 1 min

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

arXiv:2602.12506v1 Announce Type: new Abstract: Reinforcement learning (RL) fine-tuning has become a key technique for enhancing large language models (LLMs) on reasoning-intensive tasks, motivating its …

Rosie Zhao, Anshul Shah, Xiaoyu Zhu, Xinke Deng, Zhongyu Jiang, Yang Yang, Joerg Liebelt, Arnab Mondal
3 views