All Articles

Articles

Academic · 1 min

Anatomical Heterogeneity in Transformer Language Models

arXiv:2603.19348v1 Announce Type: new Abstract: Current transformer language models are trained with uniform computational budgets across all layers, implicitly assuming layer homogeneity. We challenge this …

Tomasz Wietrzykowski
8 views
Academic · 1 min

A Mathematical Theory of Understanding

arXiv:2603.19349v1 Announce Type: new Abstract: Generative AI has transformed the economics of information production, making explanations, proofs, examples, and analyses available at very low cost. …

Bahar Ta\c{s}kesen
23 views
Academic · 1 min

Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

arXiv:2603.19470v1 Announce Type: new Abstract: Off-policy problems such as policy staleness and training-inference mismatch, has become a major bottleneck for training stability and further exploration …

Chenlu Ye, Xuanchang Zhang, Yifan Hao, Zhou Yu, Ziji Zhang, Abhinav Gullapalli, Hao Chen, Jing Huang, Tong Zhang
10 views