Academic

Academic · 1 min

Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting

arXiv:2603.16985v1 Announce Type: new Abstract: Transformer-based models have been widely adopted for time-series forecasting due to their high representational capacity and architectural flexibility. However, many …

Yu-Chen Den, Kuan-Yu Chen, Kendro Vincent, Darby Tien-Hao Chang

13 views Mar 19

Academic · 1 min

Transformers Can Learn Rules They've Never Seen: Proof of Computation Beyond Interpolation

arXiv:2603.17019v1 Announce Type: new Abstract: A central question in the LLM debate is whether transformers can infer rules absent from training, or whether apparent generalisation …

Andy Gray

13 views Mar 19

Academic · 1 min

Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models

arXiv:2603.17044v1 Announce Type: new Abstract: Unified multimodal models share a language model backbone for both understanding and generating images. Can DPO align both capabilities simultaneously? …

Abinav Rao, Sujan Rachuri

14 views Mar 19

Academic · 1 min

SCE-LITE-HQ: Smooth visual counterfactual explanations with generative foundation models

arXiv:2603.17048v1 Announce Type: new Abstract: Modern neural networks achieve strong performance but remain difficult to interpret in high-dimensional visual domains. Counterfactual explanations (CFEs) provide a …

Ahmed Zeid, Sidney Bender

9 views Mar 19

Academic · 1 min

Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

arXiv:2603.17052v1 Announce Type: new Abstract: Vector quantization is a technique in machine learning that discretizes continuous representations into a set of discrete vectors. It is …

Wenhao Zhao, Qiran Zou, Rushi Shah, Yudi Wu, Zhouhan Lin, Dianbo Liu

6 views Mar 19

Academic · 1 min

PRISM: Demystifying Retention and Interaction in Mid-Training

arXiv:2603.17074v1 Announce Type: new Abstract: We present PRISM, a comprehensive empirical study of mid-training design choices for large language models. Through controlled experiments across seven …

Bharat Runwal, Ashish Agrawal, Anurag Roy, Rameswar Panda

5 views Mar 19

Academic · 1 min

CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning

arXiv:2603.17075v1 Announce Type: new Abstract: Motivated by auto-proof generation and Valiant's VP vs. VNP conjecture, we study the problem of discovering efficient arithmetic circuits to …

Weikun K. Zhang, Rohan Pandey, Bhaumik Mehta, Kaijie Jin, Naomi Morato, Archit Ganapule, Michael Ruofan Zeng, Jarod Alper

10 views Mar 19

Academic · 1 min

SENSE: Efficient EEG-to-Text via Privacy-Preserving Semantic Retrieval

arXiv:2603.17109v1 Announce Type: new Abstract: Decoding brain activity into natural language is a major challenge in AI with important applications in assistive communication, neurotechnology, and …

Akshaj Murhekar, Christina Liu, Abhijit Mishra, Shounak Roychowdhury, Jacek Gwizdka

4 views Mar 19

Academic · 1 min

Topology-Preserving Deep Joint Source-Channel Coding for Semantic Communication

arXiv:2603.17126v1 Announce Type: new Abstract: Many wireless vision applications, such as autonomous driving, require preservation of global structural information rather than only per-pixel fidelity. However, …

Omar Erak, Omar Alhussein, Fang Fang, Sami Muhaidat

40 views Mar 19

Academic · 1 min

Contextual Preference Distribution Learning

arXiv:2603.17139v1 Announce Type: new Abstract: Decision-making problems often feature uncertainty stemming from heterogeneous and context-dependent human preferences. To address this, we propose a sequential learning-and-optimization …

Benjamin Hudson, Laurent Charlin, Emma Frejinger

9 views Mar 19

Academic · 1 min

REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge

arXiv:2603.17145v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed as automated evaluators that assign numeric scores to model outputs, a paradigm known …

Yasi Zhang, Tianyu Chen, Mingyuan Zhou, Oscar Leong, Ying Nian Wu, Michal Lukasik

7 views Mar 19

Academic · 1 min

Personalized Fall Detection by Balancing Data with Selective Feedback Using Contrastive Learning

arXiv:2603.17148v1 Announce Type: new Abstract: Personalized fall detection models can significantly improve accuracy by adapting to individual motion patterns, yet their effectiveness is often limited …

Awatif Yasmin, Tarek Mahmud, Sana Alamgeer, Anne H. H. Ngu

10 views Mar 19

Integrating Inductive Biases in Transformers via Distillation for Financial Time Series Forecasting

Transformers Can Learn Rules They've Never Seen: Proof of Computation Beyond Interpolation

Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models

SCE-LITE-HQ: Smooth visual counterfactual explanations with generative foundation models

Early Quantization Shrinks Codebook: A Simple Fix for Diversity-Preserving Tokenization

PRISM: Demystifying Retention and Interaction in Mid-Training

CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning

SENSE: Efficient EEG-to-Text via Privacy-Preserving Semantic Retrieval

Topology-Preserving Deep Joint Source-Channel Coding for Semantic Communication

Contextual Preference Distribution Learning

REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge

Personalized Fall Detection by Balancing Data with Selective Feedback Using Contrastive Learning

JCG, PC

HSOLLC Co., Ltd.