Tag: cs.LG

#cs.LG

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Extraction of linearized models from pre-trained networks via knowledge distillation

arXiv:2604.06732v1 Announce Type: new Abstract: Recent developments in hardware, such as photonic integrated circuits and optical devices, are driving demand for research on constructing machine …

Fumito Kimura, Jun Ohkubo

88 views Apr 9

Academic · 1 min

SHAPE: Stage-aware Hierarchical Advantage via Potential Estimation for LLM Reasoning

arXiv:2604.06636v1 Announce Type: new Abstract: Process supervision has emerged as a promising approach for enhancing LLM reasoning, yet existing methods fail to distinguish meaningful progress …

Zhengyang Ai, Zikang Shan, Xiaodong Ai, Jingxian Tang, Hangkai Hu, Pinyan Lu

72 views Apr 9

Academic · 1 min

Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook

arXiv:2604.06210v1 Announce Type: new Abstract: As LLMs are globally deployed, aligning their cultural value orientations is critical for safety and user engagement. However, existing benchmarks …

Jaehyeok Lee, Xiaoyuan Yi, Jing Yao, Hyunjin Hwang, Roy Ka-Wei Lee, Xing Xie, JinYeong Bak

74 views Apr 9

Academic · 1 min

MO-RiskVAE: A Multi-Omics Variational Autoencoder for Survival Risk Modeling in Multiple MyelomaMO-RiskVAE

arXiv:2604.06267v1 Announce Type: new Abstract: Multimodal variational autoencoders (VAEs) have emerged as a powerful framework for survival risk modeling in multiple myeloma by integrating heterogeneous …

Zixuan Chen, Heng Zhang, YuPeng Qin, WenPeng Xing, Qiang Wang, Da Wang, Changting Lin, Meng Han

37 views Apr 9

Academic · 1 min

The Rhetoric of Machine Learning

arXiv:2604.06754v1 Announce Type: new Abstract: I examine the technology of machine learning from the perspective of rhetoric, which is simply the art of persuasion. Rather …

Robert C. Williamson

127 views Apr 9

Academic · 1 min

SubFLOT: Submodel Extraction for Efficient and Personalized Federated Learning via Optimal Transport

arXiv:2604.06631v1 Announce Type: new Abstract: Federated Learning (FL) enables collaborative model training while preserving data privacy, but its practical deployment is hampered by system and …

Zheng Jiang, Nan He, Yiming Chen, Lifeng Sun

71 views Apr 9

Academic · 1 min

Quality-preserving Model for Electronics Production Quality Tests Reduction

arXiv:2604.06451v1 Announce Type: new Abstract: Manufacturing test flows in high-volume electronics production are typically fixed during product development and executed unchanged on every unit, even …

Noufa Haneefa, Teddy Lazebnik, Einav Peretz-Andersson

74 views Apr 9

Academic · 1 min

Distributed Interpretability and Control for Large Language Models

arXiv:2604.06483v1 Announce Type: new Abstract: Large language models that require multiple GPU cards to host are usually the most capable models. It is necessary to …

Dev Arpan Desai, Shaoyi Huang, Zining Zhu

48 views Apr 9

Academic · 1 min

Optimal Rates for Pure {\varepsilon}-Differentially Private Stochastic Convex Optimization with Heavy Tails

arXiv:2604.06492v1 Announce Type: new Abstract: We study stochastic convex optimization (SCO) with heavy-tailed gradients under pure epsilon-differential privacy (DP). Instead of assuming a bound on …

Andrew Lowy

59 views Apr 9

Academic · 1 min

Drifting Fields are not Conservative

arXiv:2604.06333v1 Announce Type: new Abstract: Drifting models generate high-quality samples in a single forward pass by transporting generated samples toward the data distribution using a …

Leonard Franz, Sebastian Hoffmann, Georg Martius

54 views Apr 9

Academic · 1 min

RAGEN-2: Reasoning Collapse in Agentic RL

arXiv:2604.06268v1 Announce Type: new Abstract: RL training of multi-turn LLM agents is inherently unstable, and reasoning quality directly determines task performance. Entropy is widely used …

Zihan Wang, Chi Gui, Xing Jin, Qineng Wang, Licheng Liu, Kangrui Wang, Shiqi Chen, Linjie Li, Zhengyuan Yang, Pingyue Zhang, Yiping Lu, Jiajun Wu, Li Fei-Fei, Lijuan Wang, Yejin Choi, Manling Li

92 views Apr 9

Academic · 1 min

When Does Context Help? A Systematic Study of Target-Conditional Molecular Property Prediction

arXiv:2604.06558v1 Announce Type: new Abstract: We present the first systematic study of when target context helps molecular property prediction, evaluating context conditioning across 10 diverse …

Bryan Cheng, Jasper Zhang

63 views Apr 9

1 2 3

#cs.LG

Extraction of linearized models from pre-trained networks via knowledge distillation

SHAPE: Stage-aware Hierarchical Advantage via Potential Estimation for LLM Reasoning

Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook

MO-RiskVAE: A Multi-Omics Variational Autoencoder for Survival Risk Modeling in Multiple MyelomaMO-RiskVAE

The Rhetoric of Machine Learning

SubFLOT: Submodel Extraction for Efficient and Personalized Federated Learning via Optimal Transport

Quality-preserving Model for Electronics Production Quality Tests Reduction

Distributed Interpretability and Control for Large Language Models

Optimal Rates for Pure {\varepsilon}-Differentially Private Stochastic Convex Optimization with Heavy Tails

Drifting Fields are not Conservative

RAGEN-2: Reasoning Collapse in Agentic RL

When Does Context Help? A Systematic Study of Target-Conditional Molecular Property Prediction

JCG, PC

HSOLLC Co., Ltd.