Academic

Academic

Academic · 1 min

Linear Predictability of Attention Heads in Large Language Models

arXiv:2603.13314v1 Announce Type: new Abstract: Large language model (LLM) inference is increasingly bottlenecked by the Key-Value (KV) cache, yet the fine-grained structure of attention-head activations …

Khalid Shaikh, Asmit Kumar Singh, Rebecca Christopher Dsouza, Shikhar Shiromani
10 views
Academic · 1 min

Modular Neural Computer

arXiv:2603.13323v1 Announce Type: new Abstract: This paper introduces the Modular Neural Computer (MNC), a memory-augmented neural architecture for exact algorithmic computation on variable-length inputs. The …

Florin Leon
35 views
Academic · 1 min

The Challenge of Out-Of-Distribution Detection in Motor Imagery BCIs

arXiv:2603.13324v1 Announce Type: new Abstract: Machine Learning classifiers used in Brain-Computer Interfaces make classifications based on the distribution of data they were trained on. When …

Merlijn Quincent Mulder, Matias Valdenegro-Toro, Andreea Ioana Sburlea, Ivo Pascal de Jong
6 views
Academic · 1 min

Feature-level Interaction Explanations in Multimodal Transformers

arXiv:2603.13326v1 Announce Type: new Abstract: Multimodal Transformers often produce predictions without clarifying how different modalities jointly support a decision. Most existing multimodal explainable AI (MXAI) …

Yeji Kim, Housam Khalifa Bashier Babiker, Mi-Young Kim, Randy Goebel
6 views