Academic

Academic

Academic · 1 min

Scaling Reward Modeling without Human Supervision

arXiv:2603.02225v1 Announce Type: new Abstract: Learning from feedback is an instrumental process for advancing the capabilities and safety of frontier models, yet its effectiveness is …

Jingxuan Fan, Yueying Li, Zhenting Qi, Dinghuai Zhang, Kiant\'e Brantley, Sham M. Kakade, Hanlin Zhang
3 views
Academic · 1 min

Generalized Discrete Diffusion with Self-Correction

arXiv:2603.02230v1 Announce Type: new Abstract: Self-correction is an effective technique for maintaining parallel sampling in discrete diffusion models with minimal performance degradation. Prior work has …

Linxuan Wang, Ziyi Wang, Yikun Bai, Wei Deng, Guang Lin, Qifan Song
4 views
Academic · 1 min

Structured vs. Unstructured Pruning: An Exponential Gap

arXiv:2603.02234v1 Announce Type: new Abstract: The Strong Lottery Ticket Hypothesis (SLTH) posits that large, randomly initialized neural networks contain sparse subnetworks capable of approximating a …

Davide Ferr\'e (CNRS, COATI, UniCA, I3S), Fr\'ed\'eric Giroire (I3S, COATI, UniCA), Emanuele Natale (CNRS, COATI, I3S, UniCA), Frederik Mallmann-Trenn
12 views
Academic · 1 min

CUDABench: Benchmarking LLMs for Text-to-CUDA Generation

arXiv:2603.02236v1 Announce Type: new Abstract: Recent studies have demonstrated the potential of Large Language Models (LLMs) in generating GPU Kernels. Current benchmarks focus on the …

Jiace Zhu, Wentao Chen, Qi Fan, Zhixing Ren, Junying Wu, Xing Zhe Chai, Chotiwit Rungrueangwutthinon, Yehan Ma, An Zou
13 views