Academic

Academic

Academic · 1 min

When Do LLM Preferences Predict Downstream Behavior?

arXiv:2602.18971v1 Announce Type: new Abstract: Preference-driven behavior in LLMs may be a necessary precondition for AI misalignment such as sandbagging: models cannot strategically pursue misaligned …

Katarina Slama, Alexandra Souly, Dishank Bansal, Henry Davidson, Christopher Summerfield, Lennart Luettgau
3 views
Academic · 1 min

Benchmark Test-Time Scaling of General LLM Agents

arXiv:2602.18998v1 Announce Type: new Abstract: LLM agents are increasingly expected to function as general-purpose systems capable of resolving open-ended user requests. While existing benchmarks focus …

Xiaochuan Li, Ryan Ming, Pranav Setlur, Abhijay Paladugu, Andy Tang, Hao Kang, Shuai Shao, Rong Jin, Chenyan Xiong
12 views
Academic · 1 min

Defining Explainable AI for Requirements Analysis

arXiv:2602.19071v1 Announce Type: new Abstract: Explainable Artificial Intelligence (XAI) has become popular in the last few years. The Artificial Intelligence (AI) community in general, and …

Raymond Sheh, Isaac Monteath
4 views