All Articles

Articles

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Hybrid Self-evolving Structured Memory for GUI Agents

arXiv:2603.10291v1 Announce Type: new Abstract: The remarkable progress of vision-language models (VLMs) has enabled GUI agents to interact with computers in a human-like manner. Yet …

Sibo Zhu, Wenyi Wu, Kun Zhou, Stephen Wang, Biwei Huang

22 views Mar 12

Academic · 1 min

GATech at AbjadMed: Bidirectional Encoders vs. Causal Decoders: Insights from 82-Class Arabic Medical Classification

arXiv:2603.10008v1 Announce Type: cross Abstract: This paper presents system description for Arabic medical text classification across 82 distinct categories. Our primary architecture utilizes a fine-tuned …

Ahmed Khaled Khamis

28 views Mar 12

Academic · 1 min

FERRET: Framework for Expansion Reliant Red Teaming

arXiv:2603.10010v1 Announce Type: cross Abstract: We introduce a multi-faceted automated red teaming framework in which the goal is to generate multi-modal adversarial conversations that would …

Ninareh Mehrabi, Vitor Albiero, Maya Pavlova, Joanna Bitton

28 views Mar 12

Academic · 1 min

TAMUSA-Chat: A Domain-Adapted Large Language Model Conversational System for Research and Responsible Deployment

arXiv:2603.09992v1 Announce Type: cross Abstract: This paper presents TAMUSA-Chat, a research-oriented framework for building domain-adapted large language model conversational systems. The work addresses critical challenges …

Izzat Alsmadi, Anas Alsobeh

22 views Mar 12

Academic · 1 min

The System Hallucination Scale (SHS): A Minimal yet Effective Human-Centered Instrument for Evaluating Hallucination-Related Behavior …

arXiv:2603.09989v1 Announce Type: cross Abstract: We introduce the System Hallucination Scale (SHS), a lightweight and human-centered measurement instrument for assessing hallucination-related behavior in large language …

Heimo M\"uller, Dominik Steiger, Markus Plass, Andreas Holzinger

15 views Mar 12

Academic · 1 min

Causally Grounded Mechanistic Interpretability for LLMs with Faithful Natural-Language Explanations

arXiv:2603.09988v1 Announce Type: cross Abstract: Mechanistic interpretability identifies internal circuits responsible for model behaviors, yet translating these findings into human-understandable explanations remains an open problem. …

Ajay Pravin Mahale

12 views Mar 12

Academic · 1 min

Trajectory-Informed Memory Generation for Self-Improving Agent Systems

arXiv:2603.10600v1 Announce Type: new Abstract: LLM-powered agents face a persistent challenge: learning from their execution experiences to improve future performance. While agents can successfully complete …

Gaodan Fang, Vatche Isahagian, K. R. Jayaram, Ritesh Kumar, Vinod Muthusamy, Punleuk Oum, Gegi Thomas

32 views Mar 12

Academic · 1 min

Resource-constrained Amazons chess decision framework integrating large language models and graph attention

arXiv:2603.10512v1 Announce Type: new Abstract: Artificial intelligence has advanced significantly through the development of intelligent game-playing systems, providing rigorous testbeds for decision-making, strategic planning, and …

Tianhao Qian, Zhuoxuan Li, Jinde Cao, Xinli Shi, Hanjie Liu, Leszek Rutkowski

19 views Mar 12

Academic · 1 min

SENS-ASR: Semantic Embedding injection in Neural-transducer for Streaming Automatic Speech Recognition

arXiv:2603.10005v1 Announce Type: cross Abstract: Many Automatic Speech Recognition (ASR) applications require streaming processing of the audio data. In streaming mode, ASR systems need to …

Youness Dkhissi (LIUM), Valentin Vielzeuf (LIUM), Elys Allesiardo (LIUM), Anthony Larcher (LIUM)

13 views Mar 12

Academic · 1 min

The Dunning-Kruger Effect in Large Language Models: An Empirical Study of Confidence Calibration

arXiv:2603.09985v1 Announce Type: cross Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across diverse tasks, yet their ability to accurately assess their own confidence …

Sudipta Ghosh, Mrityunjoy Panday

15 views Mar 12

Academic · 1 min

CUAAudit: Meta-Evaluation of Vision-Language Models as Auditors of Autonomous Computer-Use Agents

arXiv:2603.10577v1 Announce Type: new Abstract: Computer-Use Agents (CUAs) are emerging as a new paradigm in human-computer interaction, enabling autonomous execution of tasks in desktop environment …

Marta Sumyk, Oleksandr Kosovan

18 views Mar 12

Academic · 1 min

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

arXiv:2603.10016v1 Announce Type: cross Abstract: We investigate whether large language models (LLMs) display human-like cognitive biases, focusing on potential implications for assistance in judicial sentencing, …

Sierra S. Liu

41 views Mar 12

← Previous

196 197 198 199 200

Articles

Hybrid Self-evolving Structured Memory for GUI Agents

GATech at AbjadMed: Bidirectional Encoders vs. Causal Decoders: Insights from 82-Class Arabic Medical Classification

FERRET: Framework for Expansion Reliant Red Teaming

TAMUSA-Chat: A Domain-Adapted Large Language Model Conversational System for Research and Responsible Deployment

The System Hallucination Scale (SHS): A Minimal yet Effective Human-Centered Instrument for Evaluating Hallucination-Related Behavior …

Causally Grounded Mechanistic Interpretability for LLMs with Faithful Natural-Language Explanations

Trajectory-Informed Memory Generation for Self-Improving Agent Systems

Resource-constrained Amazons chess decision framework integrating large language models and graph attention

SENS-ASR: Semantic Embedding injection in Neural-transducer for Streaming Automatic Speech Recognition

The Dunning-Kruger Effect in Large Language Models: An Empirical Study of Confidence Calibration

CUAAudit: Meta-Evaluation of Vision-Language Models as Auditors of Autonomous Computer-Use Agents

Assessing Cognitive Biases in LLMs for Judicial Decision Support: Virtuous Victim and Halo Effects

JCG, PC

HSOLLC Co., Ltd.