Academic

Academic · 1 min

Detecting Multi-Agent Collusion Through Multi-Agent Interpretability

arXiv:2604.01151v1 Announce Type: new Abstract: As LLM agents are increasingly deployed in multi-agent systems, they introduce risks of covert coordination that may evade standard forms …

Aaron Rose, Carissa Cullen, Brandon Gary Kaplowitz, Christian Schroeder de Witt

8 views Apr 3

Academic · 1 min

Adversarial Moral Stress Testing of Large Language Models

arXiv:2604.01108v1 Announce Type: new Abstract: Evaluating the ethical robustness of large language models (LLMs) deployed in software systems remains challenging, particularly under sustained adversarial user …

Saeid Jamshidi, Foutse Khomh, Arghavan Moradi Dakhel, Amin Nikanjam, Mohammad Hamdaqa, Kawser Wazed Nafi

2 views Apr 3

Academic · 1 min

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

arXiv:2604.01007v2 Announce Type: new Abstract: AI agents increasingly operate over extended time horizons, yet their ability to retain, organize, and recall multimodal experiences remains a …

Jiaqi Liu, Zipeng Ling, Shi Qiu, Yanqing Liu, Siwei Han, Peng Xia, Haoqin Tu, Zeyu Zheng, Cihang Xie, Charles Fleming, Mingyu Ding, Huaxiu Yao

3 views Apr 3

Academic · 1 min

PsychAgent: An Experience-Driven Lifelong Learning Agent for Self-Evolving Psychological Counselor

arXiv:2604.00931v2 Announce Type: new Abstract: Existing methods for AI psychological counselors predominantly rely on supervised fine-tuning using static dialogue datasets. However, this contrasts with human …

Yutao Yang, Junsong Li, Qianjun Pan, Jie Zhou, Kai Chen, Qin Chen, Jingyuan Zhao, Ningning Zhou, Xin Li, Liang He

4 views Apr 3

Academic · 1 min

Experience as a Compass: Multi-agent RAG with Evolving Orchestration and Agent Prompts

arXiv:2604.00901v1 Announce Type: new Abstract: Multi-agent Retrieval-Augmented Generation (RAG), wherein each agent takes on a specific role, supports hard queries that require multiple steps and …

Sha Li, Naren Ramakrishnan

5 views Apr 3

Academic · 1 min

Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models

arXiv:2604.00890v1 Announce Type: new Abstract: Geometric Problem Solving (GPS) remains at the heart of enhancing mathematical reasoning in large language models because it requires the …

Md. Abu Bakor Siddique, Shahrin Hossain, Sadman Ahmed Siam, Syed Rifat Raiyan, Hasan Mahmud, Md Kamrul Hasan

7 views Apr 3

Academic · 1 min

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

arXiv:2604.00842v1 Announce Type: new Abstract: Proactive agents that anticipate user needs and autonomously execute tasks hold great promise as digital assistants, yet the lack of …

Deepak Nathani, Cheng Zhang, Chang Huan, Jiaming Shan, Yinfei Yang, Alkesh Patel, Zhe Gan, William Yang Wang, Michael Saxon, Xin Eric Wang

6 views Apr 3

Academic · 1 min

Preference Guided Iterated Pareto Referent Optimisation for Accessible Route Planning

arXiv:2604.00795v1 Announce Type: new Abstract: We propose the Preference Guided Iterated Pareto Referent Optimisation (PG-IPRO) for urban route planning for people with different accessibility requirements …

Paolo Speziali, Arno De Greef, Mehrdad Asadi, Willem R\"opke, Ann Now\'e, Diederik M. Roijers

3 views Apr 3

Academic · 1 min

RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning

arXiv:2604.00790v1 Announce Type: new Abstract: While large language models (LLMs) have demonstrated strong performance on complex reasoning tasks such as competitive programming (CP), existing methods …

Shaopeng Fu, Xingxing Zhang, Li Dong, Di Wang, Furu Wei

3 views Apr 3

Academic · 1 min

UK AISI Alignment Evaluation Case-Study

arXiv:2604.00788v1 Announce Type: new Abstract: This technical report presents methods developed by the UK AI Security Institute for assessing whether advanced AI systems reliably follow …

Alexandra Souly, Robert Kirk, Jacob Merizian, Abby D'Cruz, Xander Davies

18 views Apr 3

Academic · 1 min

CircuitProbe: Predicting Reasoning Circuits in Transformers via Stability Zone Detection

arXiv:2604.00716v1 Announce Type: new Abstract: Transformer language models contain localized reasoning circuits, contiguous layer blocks that improve reasoning when duplicated at inference time. Finding these …

Rajkiran Panuganti

2 views Apr 3

Academic · 1 min

Agent psychometrics: Task-level performance prediction in agentic coding benchmarks

arXiv:2604.00594v1 Announce Type: new Abstract: As the focus in LLM-based coding shifts from static single-step code generation to multi-step agentic interaction with tools and environments, …

Chris Ge, Daria Kryvosheieva, Daniel Fried, Uzay Girit, Kaivalya Hariharan

2 views Apr 3

Detecting Multi-Agent Collusion Through Multi-Agent Interpretability

Adversarial Moral Stress Testing of Large Language Models

Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory

PsychAgent: An Experience-Driven Lifelong Learning Agent for Self-Evolving Psychological Counselor

Experience as a Compass: Multi-agent RAG with Evolving Orchestration and Agent Prompts

Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models

Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants

Preference Guided Iterated Pareto Referent Optimisation for Accessible Route Planning

RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning

UK AISI Alignment Evaluation Case-Study

CircuitProbe: Predicting Reasoning Circuits in Transformers via Stability Zone Detection

Agent psychometrics: Task-level performance prediction in agentic coding benchmarks

JCG, PC

HSOLLC Co., Ltd.