Category

Academic

Academic · 1 min

PRISM: Demystifying Retention and Interaction in Mid-Training

arXiv:2603.17074v1 Announce Type: new Abstract: We present PRISM, a comprehensive empirical study of mid-training design choices for large language models. Through controlled experiments across seven …

Bharat Runwal, Ashish Agrawal, Anurag Roy, Rameswar Panda
5 views
Academic · 1 min

CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning

arXiv:2603.17075v1 Announce Type: new Abstract: Motivated by auto-proof generation and Valiant's VP vs. VNP conjecture, we study the problem of discovering efficient arithmetic circuits to …

Weikun K. Zhang, Rohan Pandey, Bhaumik Mehta, Kaijie Jin, Naomi Morato, Archit Ganapule, Michael Ruofan Zeng, Jarod Alper
10 views
Academic · 1 min

SENSE: Efficient EEG-to-Text via Privacy-Preserving Semantic Retrieval

arXiv:2603.17109v1 Announce Type: new Abstract: Decoding brain activity into natural language is a major challenge in AI with important applications in assistive communication, neurotechnology, and …

Akshaj Murhekar, Christina Liu, Abhijit Mishra, Shounak Roychowdhury, Jacek Gwizdka
4 views
Academic · 1 min

Contextual Preference Distribution Learning

arXiv:2603.17139v1 Announce Type: new Abstract: Decision-making problems often feature uncertainty stemming from heterogeneous and context-dependent human preferences. To address this, we propose a sequential learning-and-optimization …

Benjamin Hudson, Laurent Charlin, Emma Frejinger
9 views
Academic · 1 min

REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge

arXiv:2603.17145v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed as automated evaluators that assign numeric scores to model outputs, a paradigm known …

Yasi Zhang, Tianyu Chen, Mingyuan Zhou, Oscar Leong, Ying Nian Wu, Michal Lukasik
7 views