All Articles

Articles

Academic · 1 min

Attention-guided Evidence Grounding for Spoken Question Answering

arXiv:2603.16292v1 Announce Type: new Abstract: Spoken Question Answering (Spoken QA) presents a challenging cross-modal problem: effectively aligning acoustic queries with textual knowledge while avoiding the …

Ke Yang, Bolin Chen, Yuejie Li, Yueying Hua, Jianhao Nie, Yueping He, Bowen Li, Chengjun Mao
12 views
Academic · 1 min

Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic

arXiv:2603.16406v1 Announce Type: new Abstract: This paper evaluates current Large Language Model (LLM) benchmarking for Icelandic, identifies problems, and calls for improved evaluation methods in …

Finnur \'Ag\'ust Ingimundarson, Steinunn Rut Fri{\dh}riksd\'ottir, Bjarki \'Armannsson, Iris Edda Nowenstein, Stein{\th}\'or Steingr\'imsson
12 views
Academic · 1 min

On the Emotion Understanding of Synthesized Speech

arXiv:2603.16483v1 Announce Type: new Abstract: Emotion is a core paralinguistic feature in voice interaction. It is widely believed that emotion understanding models learn fundamental representations …

Yuan Ge, Haishu Zhao, Aokai Hao, Junxiang Zhang, Bei Li, Xiaoqian Liu, Chenglong Wang, Jianjin Wang, Bingsen Zhou, Bingyu Liu, Jingbo Zhu, Zhengtao Yu, Tong Xiao
9 views
Academic · 1 min

AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents

arXiv:2603.16496v1 Announce Type: new Abstract: Large language model (LLM) agents increasingly rely on external memory to support long-horizon interaction, personalized assistance, and multi-step reasoning. However, …

Shannan Yan, Jingchen Ni, Leqi Zheng, Jiajun Zhang, Peixi Wu, Dacheng Yin, Jing Lyu, Chun Yuan, Fengyun Rao
9 views