Improving Latent Generalization Using Test-time Compute
arXiv:2604.01430v1 Announce Type: new Abstract: Language Models (LMs) exhibit two distinct mechanisms for knowledge acquisition: in-weights learning (i.e., encoding information within the model weights) and …
Arslan Chaudhry, Sridhar Thiagarajan, Andrew Lampinen
4 views