Spectral Edge Dynamics of Training Trajectories: Signal--Noise Geometry Across Scales
arXiv:2603.15678v1 Announce Type: new Abstract: Despite hundreds of millions of parameters, transformer training trajectories evolve within only a few coherent directions. We introduce \emph{Spectral Edge …