Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling
arXiv:2604.01601v1 Announce Type: new Abstract: We investigate training strategies that co-develop in-context learning (ICL) and in-weights learning (IWL), and the ability to switch between them …
Deeptanshu Malu, Deevyanshu Malu, Aditya Nemiwal, Sunita Sarawagi
3 views