As Language Models Scale, Low-order Linear Depth Dynamics Emerge
arXiv:2603.12541v1 Announce Type: new Abstract: Large language models are often viewed as high-dimensional nonlinear systems and treated as black boxes. Here, we show that transformer …
Buddhika Nettasinghe, Geethu Joseph
29 views