Academic

Academic

Academic · 1 min

Fast and Effective On-policy Distillation from Reasoning Prefixes

arXiv:2602.15260v1 Announce Type: new Abstract: On-policy distillation (OPD), which samples trajectories from the student model and supervises them with a teacher at the token level, …

Dongxu Zhang, Zhichao Yang, Sepehr Janghorbani, Jun Han, Andrew Ressler II, Qian Qian, Gregory D. Lyng, Sanjit Singh Batra, Robert E. Tillman
40 views
Academic · 1 min

Fractional-Order Federated Learning

arXiv:2602.15380v1 Announce Type: new Abstract: Federated learning (FL) allows remote clients to train a global model collaboratively while protecting client privacy. Despite its privacy-preserving benefits, …

Mohammad Partohaghighi, Roummel Marcia, YangQuan Chen
16 views
Academic · 1 min

Doubly Stochastic Mean-Shift Clustering

arXiv:2602.15393v1 Announce Type: new Abstract: Standard Mean-Shift algorithms are notoriously sensitive to the bandwidth hyperparameter, particularly in data-scarce regimes where fixed-scale density estimation leads to …

Tom Trigano, Yann Sepulcre, Itshak Lapidot
21 views