All Articles

Articles

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

arXiv:2603.11214v1 Announce Type: new Abstract: We evaluate the autonomous cyber-attack capabilities of frontier AI models on two purpose-built cyber ranges-a 32-step corporate network attack and …

Linus Folkerts, Will Payne, Simon Inman, Philippos Giavridis, Joe Skinner, Sam Deverett, James Aung, Ekin Zorer, Michael Schmatz, Mahmoud Ghanem, John Wilkinson, Alan Steer, Vy Hong, Jessica Wang

10 views Mar 13

Academic · 1 min

AI Psychometrics: Evaluating the Psychological Reasoning of Large Language Models with Psychometric Validities

arXiv:2603.11279v1 Announce Type: new Abstract: The immense number of parameters and deep neural networks make large language models (LLMs) rival the complexity of human brains, …

Yibai Li, Xiaolin Lin, Zhenghui Sha, Zhiye Jin, Xiaobing Li

10 views Mar 13

Academic · 1 min

Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation

arXiv:2603.11342v1 Announce Type: new Abstract: The study of the attribution of input features to the output of neural network models is an active area of …

Aria Nourbakhsh, Salima Lamsiyah, Adelaide Danilov, Christoph Schommer

22 views Mar 13

Academic · 1 min

STAIRS-Former: Spatio-Temporal Attention with Interleaved Recursive Structure Transformer for Offline Multi-task Multi-agent Reinforcement Learning

arXiv:2603.11691v1 Announce Type: new Abstract: Offline multi-agent reinforcement learning (MARL) with multi-task datasets is challenging due to varying numbers of agents across tasks and the …

Jiwon Jeon, Myungsik Cho, Youngchul Sung

18 views Mar 13

Academic · 1 min

Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks

arXiv:2603.11689v1 Announce Type: new Abstract: Frontier Multimodal Large Language Models (MLLMs) exhibit remarkable capabilities in Visual-Language Comprehension (VLC) tasks. However, they are often deployed as …

Mei Chee Leong, Ying Gu, Hui Li Tan, Liyuan Li, Nancy Chen

10 views Mar 13

Academic · 1 min

The Density of Cross-Persistence Diagrams and Its Applications

arXiv:2603.11623v1 Announce Type: new Abstract: Topological Data Analysis (TDA) provides powerful tools to explore the shape and structure of data through topological features such as …

Alexander Mironenko, Evgeny. Burnaev, Serguei Barannikov

10 views Mar 13

Academic · 1 min

ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions

arXiv:2603.11281v1 Announce Type: new Abstract: Medical question-answering benchmarks predominantly evaluate single-turn exchanges, failing to capture the iterative, clarification-seeking nature of real patient consultations. We introduce …

Monica Munnangi, Saiph Savage

22 views Mar 13

Academic · 1 min

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

arXiv:2603.11076v1 Announce Type: new Abstract: Recent work synthesizes agentic tasks for post-training tool-using LLMs, yet robust generalization under shifts in tasks and toolsets remains an …

Aili Chen, Chi Zhang, Junteng Liu, Jiangjie Chen, Chengyu Du, Yunji Li, Ming Zhong, Qin Wang, Zhengmao Zhu, Jiayuan Song, Ke Ji, Junxian He, Pengyu Zhao, Yanghua Xiao

19 views Mar 13

Academic · 1 min

An Automatic Text Classification Method Based on Hierarchical Taxonomies, Neural Networks and Document Embedding: The …

arXiv:2603.11770v1 Announce Type: new Abstract: This work describes an automatic text classification method implemented in a software tool called NETHIC, which takes advantage of the …

Luigi Lomasto, Rosario Di Florio, Andrea Ciapetti, Giuseppe Miscione, Giulia Ruggiero, Daniele Toti

21 views Mar 13

Academic · 1 min

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

arXiv:2603.11863v1 Announce Type: new Abstract: The saturation of high-quality pre-training data has shifted research focus toward evolutionary systems capable of continuously generating novel artifacts, leading …

Zi-Han Wang, Lam Nguyen, Zhengyang Zhao, Mengyue Yang, Chengwei Qin, Yujiu Yang, Linyi Yang

16 views Mar 13

Academic · 1 min

Examining Users' Behavioural Intention to Use OpenClaw Through the Cognition--Affect--Conation Framework

arXiv:2603.11455v1 Announce Type: new Abstract: This study examines users' behavioural intention to use OpenClaw through the Cognition--Affect--Conation (CAC) framework. The research investigates how cognitive perceptions …

Yiran Du

13 views Mar 13

Academic · 1 min

CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data

arXiv:2603.11745v1 Announce Type: new Abstract: Real-world multivariate time series, particularly in critical infrastructure such as electrical power grids, are often corrupted by noise and anomalies …

David Baumgartner, Helge Langseth, Heri Ramampiaro

21 views Mar 13

← Previous

186 187 188 189 190

Articles

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

AI Psychometrics: Evaluating the Psychological Reasoning of Large Language Models with Psychometric Validities

Evaluating Explainable AI Attribution Methods in Neural Machine Translation via Attention-Guided Knowledge Distillation

STAIRS-Former: Spatio-Temporal Attention with Interleaved Recursive Structure Transformer for Offline Multi-task Multi-agent Reinforcement Learning

Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks

The Density of Cross-Persistence Diagrams and Its Applications

ThReadMed-QA: A Multi-Turn Medical Dialogue Benchmark from Real Patient Questions

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

An Automatic Text Classification Method Based on Hierarchical Taxonomies, Neural Networks and Document Embedding: The …

CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges

Examining Users' Behavioural Intention to Use OpenClaw Through the Cognition--Affect--Conation Framework

CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data

JCG, PC

HSOLLC Co., Ltd.