Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Robust Post-Training for Generative Recommenders: Why Exponential Reward-Weighted SFT Outperforms RLHF

arXiv:2603.10279v1 Announce Type: new Abstract: Aligning generative recommender systems to user preferences via post-training is critical for closing the gap between next-item prediction and actual …

Keertana Chidambaram, Sanath Kumar Krishnamurthy, Qiuling Xu, Ko-Jen Hsiao, Moumita Bhattacharya

21 views Mar 12

Academic · 1 min

Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework

arXiv:2603.10281v1 Announce Type: new Abstract: While score-based generative models have emerged as powerful priors for solving inverse problems, directly integrating them into optimization algorithms such …

Rajesh Shrestha, Xiao Fu

40 views Mar 12

Academic · 1 min

GSVD for Geometry-Grounded Dataset Comparison: An Alignment Angle Is All You Need

arXiv:2603.10283v1 Announce Type: new Abstract: Geometry-grounded learning asks models to respect structure in the problem domain rather than treating observations as arbitrary vectors. Motivated by …

Eduarda de Souza Marques, Arthur Sobrinho Ferreira da Rocha, Joao Paixao, Heudson Mirandola, Daniel Sadoc Menasche

23 views Mar 12

Academic · 1 min

Copula-ResLogit: A Deep-Copula Framework for Unobserved Confounding Effects

arXiv:2603.10284v1 Announce Type: new Abstract: A key challenge in travel demand analysis is the presence of unobserved factors that may generate non-causal dependencies, obscuring the …

Kimia Kamal, Bilal Farooq

22 views Mar 12

Academic · 1 min

GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification

arXiv:2603.10298v1 Announce Type: new Abstract: The rapid rise of large language models (LLMs) and their ability to capture semantic relationships has led to their adoption …

Mayur Choudhary, Saptarshi Sengupta, Katerina Potika

29 views Mar 12

Academic · 1 min

Regime-aware financial volatility forecasting via in-context learning

arXiv:2603.10299v1 Announce Type: new Abstract: This work introduces a regime-aware in-context learning framework that leverages large language models (LLMs) for financial volatility forecasting under nonstationary …

Saba Asaad, Shayan Mohajer Hamidi, Ali Bereyhi

15 views Mar 12

Academic · 1 min

What do near-optimal learning rate schedules look like?

arXiv:2603.10301v1 Announce Type: new Abstract: A basic unanswered question in neural network training is: what is the best learning rate schedule shape for a given …

Hiroki Naganuma, Atish Agarwala, Priya Kasimbeg, George E. Dahl

18 views Mar 12

Academic · 1 min

How to make the most of your masked language model for protein engineering

arXiv:2603.10302v1 Announce Type: new Abstract: A plethora of protein language models have been released in recent years. Yet comparatively little work has addressed how to …

Calvin McCarter, Nick Bhattacharya, Sebastian W. Ober, Hunter Elliott

47 views Mar 12

Academic · 1 min

Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning

arXiv:2603.10305v1 Announce Type: new Abstract: Machine learning models can represent climate processes that are nonlocal in horizontal space, height, and time, often by combining information …

Savannah L. Ferretti, Jerry Lin, Sara Shamekh, Jane W. Baldwin, Michael S. Pritchard, Tom Beucler

37 views Mar 12

Academic · 1 min

Federated Active Learning Under Extreme Non-IID and Global Class Imbalance

arXiv:2603.10341v1 Announce Type: new Abstract: Federated active learning (FAL) seeks to reduce annotation cost under privacy constraints, yet its effectiveness degrades in realistic settings with …

Chen-Chen Zong, Sheng-Jun Huang

28 views Mar 12

Academic · 1 min

Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning

arXiv:2603.10377v1 Announce Type: new Abstract: Sparse autoencoders can localize where concepts live in language models, but not how they interact during multi-step reasoning. We propose …

Md Muntaqim Meherab, Noor Islam S. Mohammad, Faiza Feroz

49 views Mar 12

Academic · 1 min

Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design

arXiv:2603.10379v1 Announce Type: new Abstract: This paper presents a novel extension of neural scaling laws to Mixture-of-Experts (MoE) models, focusing on the optimal allocation of …

Junzhuo Li, Peijie Jiang, Changxin Tian, Jia Liu, Zhiqiang Zhang, Xuming Hu

19 views Mar 12

← Previous

170 171 172 173 174

Academic

Robust Post-Training for Generative Recommenders: Why Exponential Reward-Weighted SFT Outperforms RLHF

Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework

GSVD for Geometry-Grounded Dataset Comparison: An Alignment Angle Is All You Need

Copula-ResLogit: A Deep-Copula Framework for Unobserved Confounding Effects

GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification

Regime-aware financial volatility forecasting via in-context learning

What do near-optimal learning rate schedules look like?

How to make the most of your masked language model for protein engineering

Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning

Federated Active Learning Under Extreme Non-IID and Global Class Imbalance

Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning

Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design

JCG, PC

HSOLLC Co., Ltd.