Academic

Academic · 1 min

TherapyGym: Evaluating and Aligning Clinical Fidelity and Safety in Therapy Chatbots

arXiv:2603.18008v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used for mental-health support; yet prevailing evaluation methods--fluency metrics, preference tests, and generic dialogue …

Fangrui Huang, Souhad Chbeir, Arpandeep Khatua, Sheng Wang, Sijun Tan, Kenan Ye, Lily Bailey, Merryn Daniel, Ryan Louie, Sanmi Koyejo, Ehsan Adeli

18 views Mar 20

Academic · 1 min

Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange …

arXiv:2603.18007v1 Announce Type: new Abstract: The study explores whether current Large Language Models (LLMs) exhibit Theory of Mind (ToM) capabilities -- specifically, the ability to …

Anna Babarczy, Andras Lukacs, Peter Vedres, Zeteny Bujka

17 views Mar 20

Academic · 1 min

Can LLM generate interesting mathematical research problems?

arXiv:2603.18813v1 Announce Type: new Abstract: This paper is the second one in a series of work on the mathematical creativity of LLM. In the first …

Xiaoyang Chen, Xiang Jiang

12 views Mar 20

Academic · 1 min

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

arXiv:2603.18806v1 Announce Type: new Abstract: Diffusion Large Language Models (dLLMs) introduce a new paradigm for language generation, which in turn presents new challenges for aligning …

Wenxuan Zhang, Lemeng Wu, Changsheng Zhao, Ernie Chang, Mingchen Zhuge, Zechun Liu, Andy Su, Hanxian Huang, Jun Chen, Chong Zhou, Raghuraman Krishnamoorthi, Vikas Chandra, Mohamed Elhoseiny, Wei Wen

93 views Mar 20

Academic · 1 min

Proceedings of the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind

arXiv:2603.18786v1 Announce Type: new Abstract: This volume includes a selection of papers presented at the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind …

Nitay Alon, Joseph M. Barnby, Reuth Mirsky, Stefan Sarkadi

18 views Mar 20

Academic · 1 min

A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models

arXiv:2603.18767v1 Announce Type: new Abstract: Concept unlearning has emerged as a promising direction for reducing the risks of harmful content generation in text-to-image diffusion models …

Duc Hao Pham, Van Duy Truong, Duy Khanh Dinh, Tien Cuong Nguyen, Dien Hy Ngo, Tuan Anh Bui

22 views Mar 20

Academic · 1 min

NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics

arXiv:2603.18761v1 Announce Type: new Abstract: Standard attention mechanisms in transformers are limited by their pairwise formulation, which hinders the modeling of higher-order dependencies among tokens. …

Djamel Bouchaffra, Fay\c{c}al Ykhlef, Hanene Azzag, Mustapha Lebbah, Bilal Faye

12 views Mar 20

Academic · 1 min

Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

arXiv:2603.18729v1 Announce Type: new Abstract: Many works in the literature show that LLM outputs exhibit discriminatory behaviour, triggering stereotype-based inferences based on the dialect in …

Martina Ullasci, Marco Rondina, Riccardo Coppola, Flavio Giobergia, Riccardo Bellanca, Gabriele Mancari Pasi, Luca Prato, Federico Spinoso, Silvia Tagliente

14 views Mar 20

Academic · 1 min

MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution

arXiv:2603.18718v1 Announce Type: new Abstract: Memory-augmented LLM agents maintain external memory banks to support long-horizon interaction, yet most existing systems treat construction, retrieval, and utilization …

Minhua Lin, Zhiwei Zhang, Hanqing Lu, Hui Liu, Xianfeng Tang, Qi He, Xiang Zhang, Suhang Wang

11 views Mar 20

Academic · 1 min

Accurate and Efficient Multi-Channel Time Series Forecasting via Sparse Attention Mechanism

arXiv:2603.18712v1 Announce Type: new Abstract: The task of multi-channel time series forecasting is ubiquitous in numerous fields such as finance, supply chain management, and energy …

Lei Gao, Hengda Bao, Jingfei Fang, Guangzheng Wu, Weihua Zhou, Yun Zhou

14 views Mar 20

Academic · 1 min

MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation

arXiv:2603.18676v1 Announce Type: new Abstract: MANAR (Memory-augmented Attention with Navigational Abstract Conceptual Representation), contextualization layer generalizes standard multi-head attention (MHA) by instantiating the principles of …

Zuher Jahshan, Ben Ben Ishay, Leonid Yavits

9 views Mar 20

Academic · 1 min

Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning

arXiv:2603.18662v1 Announce Type: new Abstract: Geometric reasoning inherently requires "thinking with constructions" -- the dynamic manipulation of visual aids to bridge the gap between problem …

Haokun Zhao, Wanshi Xu, Haidong Yuan, Songjun Cao, Long Ma, Yanghua Xiao

13 views Mar 20

TherapyGym: Evaluating and Aligning Clinical Fidelity and Safety in Therapy Chatbots

Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange …

Can LLM generate interesting mathematical research problems?

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Proceedings of the 2nd Workshop on Advancing Artificial Intelligence through Theory of Mind

A Concept is More Than a Word: Diversified Unlearning in Text-to-Image Diffusion Models

NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics

Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution

Accurate and Efficient Multi-Channel Time Series Forecasting via Sparse Attention Mechanism

MANAR: Memory-augmented Attention with Navigational Abstract Conceptual Representation

Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning

JCG, PC

HSOLLC Co., Ltd.