All Articles

Articles

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models

arXiv:2602.12566v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) plays a key role in stimulating the explicit reasoning capability of Large Language Models …

Haoqing Wang, Xiang Long, Ziheng Li, Yilong Xu, Tingguang Li, Yehui Tang

17 views Mar 7

Academic · 1 min

Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language …

arXiv:2602.12586v1 Announce Type: new Abstract: While plan-and-infill decoding in Masked Diffusion Models (MDMs) shows promise for mathematical and code reasoning, performance remains highly sensitive to …

Joshua Ong Jun Leang, Yu Zhao, Mihaela C\u{a}t\u{a}lina Stoian, Wenda Li, Shay B. Cohen, Eleonora Giunchiglia

15 views Mar 7

Academic · 1 min

GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

arXiv:2602.12617v1 Announce Type: new Abstract: This paper presents GeoAgent, a model capable of reasoning closely with humans and deriving fine-grained address conclusions. Previous RL-based methods …

Modi Jin, Yiming Zhang, Boyuan Sun, Dingwen Zhang, MingMing Cheng, Qibin Hou

13 views Mar 7

Academic · 1 min

AI Agents for Inventory Control: Human-LLM-OR Complementarity

arXiv:2602.12631v1 Announce Type: new Abstract: Inventory control is a fundamental operations problem in which ordering decisions are traditionally guided by theoretically grounded operations research (OR) …

Jackie Baek, Yaopeng Fu, Will Ma, Tianyi Peng

85 views Mar 7

Academic · 1 min

Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents

arXiv:2602.12662v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly deployed as autonomous agents for multi-turn decision-making tasks. However, current agents typically rely on …

Ruihan Yang, Fanghua Ye, Xiang We, Ruoqing Zhao, Kang Luo, Xinbo Xu, Bo Zhao, Ruotian Ma, Shanyi Wang, Zhaopeng Tu, Xiaolong Li, Deqing Yang, Linus

14 views Mar 7

Academic · 1 min

Evaluating Robustness of Reasoning Models on Parameterized Logical Problems

arXiv:2602.12665v1 Announce Type: new Abstract: Logic provides a controlled testbed for evaluating LLM-based reasoners, yet standard SAT-style benchmarks often conflate surface difficulty (length, wording, clause …

Na\"im Es-sebbani, Esteban Marquer, Yakoub Salhi, Zied Bouraoui

13 views Mar 7

Academic · 1 min

X-SYS: A Reference Architecture for Interactive Explanation Systems

arXiv:2602.12748v1 Announce Type: new Abstract: The explainable AI (XAI) research community has proposed numerous technical methods, yet deploying explainability as systems remains challenging: Interactive explanation …

Tobias Labarta, Nhi Hoang, Maximilian Dreyer, Jim Berend, Oleg Hein, Jackie Ma, Wojciech Samek, Sebastian Lapuschkin

24 views Mar 7

Academic · 1 min

WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning

arXiv:2602.12852v1 Announce Type: new Abstract: Deep Research systems based on web agents have shown strong potential in solving complex information-seeking tasks, yet their search efficiency …

Junjie Wang, Zequn Xie, Dan Yang, Jie Feng, Yue Shen, Duolin Sun, Meixiu Long, Yihan Jiao, Zhehao Tan, Jian Wang, Peng Wei, Jinjie Gu

5 views Mar 7

Academic · 1 min

Information-theoretic analysis of world models in optimal reward maximizers

arXiv:2602.12963v1 Announce Type: new Abstract: An important question in the field of AI is the extent to which successful behaviour requires an internal representation of …

Alfred Harwood, Jose Faustino, Alex Altair

5 views Mar 7

Academic · 1 min

Consistency of Large Reasoning Models Under Multi-Turn Attacks

arXiv:2602.13093v2 Announce Type: new Abstract: Large reasoning models with reasoning capabilities achieve state-of-the-art performance on complex tasks, but their robustness under multi-turn adversarial pressure remains …

Yubo Li, Ramayya Krishnan, Rema Padman

16 views Mar 7

Academic · 1 min

Optimal Take-off under Fuzzy Clearances

arXiv:2602.13166v1 Announce Type: new Abstract: This paper presents a hybrid obstacle avoidance architecture that integrates Optimal Control under clearance with a Fuzzy Rule Based System …

Hugo Henry, Arthur Tsai, Kelly Cohen

8 views Mar 7

Academic · 1 min

Language-Guided Invariance Probing of Vision-Language Models

arXiv:2511.13494v1 Announce Type: cross Abstract: Recent vision-language models (VLMs) such as CLIP, OpenCLIP, EVA02-CLIP and SigLIP achieve strong zero-shot performance, but it is unclear how …

Jae Joong Lee

5 views Mar 7

← Previous

315 316 317 318 319

Articles

To Mix or To Merge: Toward Multi-Domain Reinforcement Learning for Large Language Models

Can I Have Your Order? Monte-Carlo Tree Search for Slot Filling Ordering in Diffusion Language …

GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

AI Agents for Inventory Control: Human-LLM-OR Complementarity

Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents

Evaluating Robustness of Reasoning Models on Parameterized Logical Problems

X-SYS: A Reference Architecture for Interactive Explanation Systems

WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning

Information-theoretic analysis of world models in optimal reward maximizers

Consistency of Large Reasoning Models Under Multi-Turn Attacks

Optimal Take-off under Fuzzy Clearances

Language-Guided Invariance Probing of Vision-Language Models

JCG, PC

HSOLLC Co., Ltd.