Academic

Academic · 1 min

UniDial-EvalKit: A Unified Toolkit for Evaluating Multi-Faceted Conversational Abilities

arXiv:2603.23160v1 Announce Type: new Abstract: Benchmarking AI systems in multi-turn interactive scenarios is essential for understanding their practical capabilities in real-world applications. However, existing evaluation …

Qi Jia, Haodong Zhao, Dun Pei, Xiujie Song, Shibo Wang, Zijian Chen, Zicheng Zhang, Xiangyang Zhu, Guangtao Zhai

1 views Mar 25

Academic · 1 min

From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service

arXiv:2603.23172v1 Announce Type: new Abstract: Multilingual intent classification is central to customer-service systems on global logistics platforms, where models must process noisy user queries across …

Haoyu He, Jinyu Zhuang, Haoran Chu, Shuhang Yu, J, T AI Group, Hao Wang, Kunpeng Han

1 views Mar 25

Academic · 1 min

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

arXiv:2603.23184v1 Announce Type: new Abstract: Reward modeling represents a long-standing challenge in reinforcement learning from human feedback (RLHF) for aligning language models. Current reward modeling …

Hao Wang, Haocheng Yang, Licheng Pan, Lei Shen, Xiaoxi Li, Yinuo Wang, Zhichao Chen, Yuan Lu, Haoxuan Li, Zhouchen Lin

6 views Mar 25

Academic · 1 min

Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

arXiv:2603.23219v1 Announce Type: new Abstract: Amidst the rising capabilities of generative AI to mimic specific human styles, this study investigates the ability of state-of-the-art large …

Nasser A Alsadhan

1 views Mar 25

Academic · 1 min

I Came, I Saw, I Explained: Benchmarking Multimodal LLMs on Figurative Meaning in Memes

arXiv:2603.23229v1 Announce Type: new Abstract: Internet memes represent a popular form of multimodal online communication and often use figurative elements to convey layered meaning through …

Shijia Zhou, Saif M. Mohammad, Barbara Plank, Diego Frassinelli

1 views Mar 25

Academic · 1 min

Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguistic Style in …

arXiv:2603.23251v1 Announce Type: new Abstract: The advancing fluency of LLMs raises important questions about their ability to emulate complex human traits, including emotional expression and …

Nasser A Alsadhan

1 views Mar 25

Academic · 1 min

Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning

arXiv:2603.22292v1 Announce Type: new Abstract: Sequential decision making using Markov Decision Process underpins many realworld applications. Both model-based and model free methods have achieved strong …

Janaka Chathuranga Brahmanage, Akshat Kumar

10 views Mar 25

Academic · 1 min

Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks

arXiv:2603.22294v1 Announce Type: new Abstract: Synthetic Data Generation (SDG), leveraging Large Language Models (LLMs), has recently been recognized and broadly adopted as an effective approach …

Srideepika Jayaraman, Achille Fokoue, Dhaval Patel, Jayant Kalagnanam

1 views Mar 25

Academic · 1 min

Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores

arXiv:2603.22299v1 Announce Type: new Abstract: Large language models (LLMs) are often confidently wrong, making reliable uncertainty estimation (UE) essential. Output-based heuristics are cheap but brittle, …

Zvi N. Badash, Yonatan Belinkov, Moti Freiman

3 views Mar 25

Academic · 1 min

Scaling Attention via Feature Sparsity

arXiv:2603.22300v1 Announce Type: new Abstract: Scaling Transformers to ultra-long contexts is bottlenecked by the $O(n^2 d)$ cost of self-attention. Existing methods reduce this cost along …

Yan Xie, Tiansheng Wen, Tangda Huang, Bo Chen, Chenyu You, Stefanie Jegelka, Yifei Wang

1 views Mar 25

Academic · 1 min

Latent Semantic Manifolds in Large Language Models

arXiv:2603.22301v1 Announce Type: new Abstract: Large Language Models (LLMs) perform internal computations in continuous vector spaces yet produce discrete tokens -- a fundamental mismatch whose …

Mohamed A. Mabrok

1 views Mar 25

Academic · 1 min

Research on Individual Trait Clustering and Development Pathway Adaptation Based on the K-means Algorithm

arXiv:2603.22302v1 Announce Type: new Abstract: With the development of information technology, the application of artificial intelligence and machine learning in the field of education shows …

Qianru Wei, Jihaoyu Yang, Cheng Zhang, Jinming Yang

15 views Mar 25

UniDial-EvalKit: A Unified Toolkit for Evaluating Multi-Faceted Conversational Abilities

From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

Decoding AI Authorship: Can LLMs Truly Mimic Human Style Across Literature and Politics?

I Came, I Saw, I Explained: Benchmarking Multimodal LLMs on Figurative Meaning in Memes

Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguistic Style in …

Beyond Hard Constraints: Budget-Conditioned Reachability For Safe Offline Reinforcement Learning

Efficient Embedding-based Synthetic Data Generation for Complex Reasoning Tasks

Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores

Scaling Attention via Feature Sparsity

Latent Semantic Manifolds in Large Language Models

Research on Individual Trait Clustering and Development Pathway Adaptation Based on the K-means Algorithm

JCG, PC

HSOLLC Co., Ltd.