Academic

Academic · 1 min

Hardware Efficient Approximate Convolution with Tunable Error Tolerance for CNNs

arXiv:2603.10100v1 Announce Type: new Abstract: Modern CNNs' high computational demands hinder edge deployment, as traditional ``hard'' sparsity (skipping mathematical zeros) loses effectiveness in deep layers …

Vishal Shashidhar, Anupam Kumari, Roy P Paily

37 views Mar 12

Academic · 1 min

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

arXiv:2603.10101v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has significantly advanced the reasoning capacity of Large Language Models (LLMs). However, RLVR solely …

Sijia Cui, Pengyu Cheng, Jiajun Song, Yongbo Gai, Guojun Zhang, Zhechao Yu, Jianhe Lin, Xiaoxi Jiang, Guanjun Jiang

11 views Mar 12

Academic · 1 min

Lost in the Middle at Birth: An Exact Theory of Transformer Position Bias

arXiv:2603.10123v1 Announce Type: new Abstract: The ``Lost in the Middle'' phenomenon -- a U-shaped performance curve where LLMs retrieve well from the beginning and end …

Borun D Chowdhury

17 views Mar 12

Academic · 1 min

A neural operator for predicting vibration frequency response curves from limited data

arXiv:2603.10149v1 Announce Type: new Abstract: In the design of engineered components, rigorous vibration testing is essential for performance validation and identification of resonant frequencies and …

D. Bluedorn, A. Badawy, B. E. Saunders, D. Roettgen, A. Abdelkefi

41 views Mar 12

Academic · 1 min

Mashup Learning: Faster Finetuning by Remixing Past Checkpoints

arXiv:2603.10156v1 Announce Type: new Abstract: Finetuning on domain-specific data is a well-established method for enhancing LLM performance on downstream tasks. Training on each dataset produces …

Sofia Maria Lo Cicero Vaina, Artem Chumachenko, Max Ryabinin

17 views Mar 12

Academic · 1 min

DT-BEHRT: Disease Trajectory-aware Transformer for Interpretable Patient Representation Learning

arXiv:2603.10180v1 Announce Type: new Abstract: The growing adoption of electronic health record (EHR) systems has provided unprecedented opportunities for predictive modeling to guide clinical decision …

Deyi Li, Zijun Yao, Qi Xu, Muxuan Liang, Lingyao Li, Zijian Xu, Mei Liu

18 views Mar 12

Academic · 1 min

Actor-Accelerated Policy Dual Averaging for Reinforcement Learning in Continuous Action Spaces

arXiv:2603.10199v1 Announce Type: new Abstract: Policy Dual Averaging (PDA) offers a principled Policy Mirror Descent (PMD) framework that more naturally admits value function approximation than …

Ji Gao, Caleb Ju, Guanghui Lan, Zhaohui Tong

18 views Mar 12

Academic · 1 min

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

arXiv:2603.10225v1 Announce Type: new Abstract: Cross-entropy loss has long been the standard choice for training deep neural networks, yet it suffers from interpretability limitations, unbounded …

Maxwell Miller-Golub, Kamil Faber, Marcin Pietron, Panpan Zheng, Pasquale Minervini, Roberto Corizzo

18 views Mar 12

Academic · 1 min

SiMPO: Measure Matching for Online Diffusion Reinforcement Learning

arXiv:2603.10250v1 Announce Type: new Abstract: A commonly used family of RL algorithms for diffusion policies conducts softmax reweighting over the behavior policy, which usually induces …

Haitong Ma, Chenxiao Gao, Tianyi Chen, Na Li, Bo Dai

17 views Mar 12

Academic · 1 min

Improving TabPFN's Synthetic Data Generation by Integrating Causal Structure

arXiv:2603.10254v1 Announce Type: new Abstract: Synthetic tabular data generation addresses data scarcity and privacy constraints in a variety of domains. Tabular Prior-Data Fitted Network (TabPFN), …

Davide Tugnoli, Andrea De Lorenzo, Marco Virgolin, Giovanni Cin\`a

17 views Mar 12

Academic · 1 min

Discovery of a Hematopoietic Manifold in scGPT Yields a Method for Extracting Performant Algorithms from …

arXiv:2603.10261v1 Announce Type: new Abstract: We report the discovery and extraction of a compact hematopoietic algorithm from the single-cell foundation model scGPT, to our knowledge …

Ihor Kendiukhov

19 views Mar 12

Academic · 1 min

Estimating condition number with Graph Neural Networks

arXiv:2603.10277v1 Announce Type: new Abstract: In this paper, we propose a fast method for estimating the condition number of sparse matrices using graph neural networks …

Erin Carson, Xinye Chen

35 views Mar 12

Hardware Efficient Approximate Convolution with Tunable Error Tolerance for CNNs

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

Lost in the Middle at Birth: An Exact Theory of Transformer Position Bias

A neural operator for predicting vibration frequency response curves from limited data

Mashup Learning: Faster Finetuning by Remixing Past Checkpoints

DT-BEHRT: Disease Trajectory-aware Transformer for Interpretable Patient Representation Learning

Actor-Accelerated Policy Dual Averaging for Reinforcement Learning in Continuous Action Spaces

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

SiMPO: Measure Matching for Online Diffusion Reinforcement Learning

Improving TabPFN's Synthetic Data Generation by Integrating Causal Structure

Discovery of a Hematopoietic Manifold in scGPT Yields a Method for Extracting Performant Algorithms from …

Estimating condition number with Graph Neural Networks

JCG, PC

HSOLLC Co., Ltd.