Academic

Academic

Academic · 1 min

Contrastive explanations of BDI agents

arXiv:2602.13323v1 Announce Type: new Abstract: The ability of autonomous systems to provide explanations is important for supporting transparency and aiding the development of (appropriate) trust. …

Michael Winikoff
7 views
Academic · 1 min

Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts

arXiv:2602.13367v1 Announce Type: new Abstract: We present Nanbeige4.1-3B, a unified generalist language model that simultaneously achieves strong agentic behavior, code generation, and general reasoning with …

Chen Yang, Guangyue Peng, Jiaying Zhu, Ran Le, Ruixiang Feng, Tao Zhang, Xiyun Xu, Yang Song, Yiming Jia, Yuntao Wen, Yunzhi Xu, Zekai Wang, Zhenwei An, Zhicong Sun, Zongchao Chen
2 views
Academic · 1 min

On-Policy Supervised Fine-Tuning for Efficient Reasoning

arXiv:2602.13407v1 Announce Type: new Abstract: Large reasoning models (LRMs) are commonly trained with reinforcement learning (RL) to explore long chain-of-thought reasoning, achieving strong performance at …

Anhao Zhao, Ziyang Chen, Junlong Tong, Yingqi Fan, Fanghua Ye, Shuhao Li, Yunpu Ma, Wenjie Li, Xiaoyu Shen
4 views
Academic · 1 min

OMNI-LEAK: Orchestrator Multi-Agent Network Induced Data Leakage

arXiv:2602.13477v1 Announce Type: new Abstract: As Large Language Model (LLM) agents become more capable, their coordinated use in the form of multi-agent systems is anticipated …

Akshat Naik, Jay Culligan, Yarin Gal, Philip Torr, Rahaf Aljundi, Alasdair Paren, Adel Bibi
4 views
Academic · 1 min

SPILLage: Agentic Oversharing on the Web

arXiv:2602.13516v1 Announce Type: new Abstract: LLM-powered agents are beginning to automate user's tasks across the open web, often with access to user resources such as …

Jaechul Roh, Eugene Bagdasarian, Hamed Haddadi, Ali Shahin Shamsabadi
6 views
Academic · 1 min

REMem: Reasoning with Episodic Memory in Language Agent

arXiv:2602.13530v1 Announce Type: new Abstract: Humans excel at remembering concrete experiences along spatiotemporal contexts and performing reasoning across those events, i.e., the capacity for episodic …

Yiheng Shu, Saisri Padmaja Jonnalagedda, Xiang Gao, Bernal Jim\'enez Guti\'errez, Weijian Qi, Kamalika Das, Huan Sun, Yu Su
3 views
Academic · 1 min

OpAgent: Operator Agent for Web Navigation

arXiv:2602.13559v1 Announce Type: new Abstract: To fulfill user instructions, autonomous web agents must contend with the inherent complexity and volatile nature of real-world websites. Conventional …

Yuyu Guo, Wenjie Yang, Siyuan Yang, Ziyang Liu, Cheng Chen, Yuan Wei, Yun Hu, Yang Huang, Guoliang Hao, Dongsheng Yuan, Jianming Wang, Xin Chen, Hang Yu, Lei Lei, Peng Di
3 views