Academic

Academic

Academic · 1 min

See and Remember: A Multimodal Agent for Web Traversal

arXiv:2603.02626v1 Announce Type: new Abstract: Autonomous web navigation requires agents to perceive complex visual environments and maintain long-term context, yet current Large Language Model (LLM) …

Xinjun Wang, Shengyao Wang, Aimin Zhou, Hao Hao
4 views
Academic · 1 min

SorryDB: Can AI Provers Complete Real-World Lean Theorems?

arXiv:2603.02668v1 Announce Type: new Abstract: We present SorryDB, a dynamically-updating benchmark of open Lean tasks drawn from 78 real world formalization projects on GitHub. Unlike …

Austin Letson, Leopoldo Sarra, Auguste Poiroux, Oliver Dressler, Paul Lezeau, Dhyan Aranha, Frederick Pu, Aaron Hill, Miguel Corredera Hidalgo, Julian Berman, George Tsoukalas, Lenny Taelman
3 views
Academic · 1 min

Retrieval-Augmented Robots via Retrieve-Reason-Act

arXiv:2603.02688v1 Announce Type: new Abstract: To achieve general-purpose utility, we argue that robots must evolve from passive executors into active Information Retrieval users. In strictly …

Izat Temiraliev, Diji Yang, Yi Zhang
12 views
Academic · 1 min

A Natural Language Agentic Approach to Study Affective Polarization

arXiv:2603.02711v1 Announce Type: new Abstract: Affective polarization has been central to political and social studies, with growing focus on social media, where partisan divisions are …

Stephanie Anneris Malvicini, Ewelina Gajewska, Arda Derbent, Katarzyna Budzynska, Jaros{\l}aw A. Chudziak, Maria Vanina Martinez
12 views
Academic · 1 min

EvoSkill: Automated Skill Discovery for Multi-Agent Systems

arXiv:2603.02766v1 Announce Type: new Abstract: Coding agents are increasingly used as general-purpose problem solvers, but their flexibility does not by itself confer the domain expertise …

Salaheddin Alzubi, Noah Provenzano, Jaydon Bingham, Weiyuan Chen, Tu Vu
4 views