Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Improving LLM Performance Through Black-Box Online Tuning: A Case for Adding System Specs to Factsheets …

arXiv:2603.11340v1 Announce Type: new Abstract: In this paper, we present a novel black-box online controller that uses only end-to-end measurements over short segments, without internal …

Yonas Atinafu, Henry Lin, Robin Cohen

49 views Mar 13

Academic · 1 min

Summarize Before You Speak with ARACH: A Training-Free Inference-Time Plug-In for Enhancing LLMs via Global …

arXiv:2603.11067v1 Announce Type: new Abstract: Large language models (LLMs) achieve remarkable performance, yet further gains often require costly training. This has motivated growing interest in …

Jingtao Wang, Yucong Wang, Jun Ding, Rui Cai, Xun Wang

34 views Mar 13

Academic · 1 min

From Debate to Deliberation: Structured Collective Reasoning with Typed Epistemic Acts

arXiv:2603.11781v1 Announce Type: new Abstract: Multi-agent LLM systems increasingly tackle complex reasoning, yet their interaction patterns remain limited to voting, unstructured debate, or pipeline orchestration. …

Sunil Prakash

39 views Mar 13

Academic · 1 min

Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents

arXiv:2603.11864v1 Announce Type: new Abstract: As AI agents are increasingly used in high-stakes domains like healthcare and law enforcement, aligning their behaviour with social, legal, …

Radu Calinescu, Ana Cavalcanti, Marsha Chechik, Lina Marsso, Beverley Townsend

40 views Mar 13

Academic · 1 min

Understanding Wikidata Qualifiers: An Analysis and Taxonomy

arXiv:2603.11767v1 Announce Type: new Abstract: This paper presents an in-depth analysis of Wikidata qualifiers, focusing on their semantics and actual usage, with the aim of …

Gilles Falquet, Sahar Aljalbout

27 views Mar 13

Academic · 1 min

Stop Listening to Me! How Multi-turn Conversations Can Degrade Diagnostic Reasoning

arXiv:2603.11394v1 Announce Type: new Abstract: Patients and clinicians are increasingly using chatbots powered by large language models (LLMs) for healthcare inquiries. While state-of-the-art LLMs exhibit …

Kevin H. Guo, Chao Yan, Avinash Baidya, Katherine Brown, Xiang Gao, Juming Xiong, Zhijun Yin, Bradley A. Malin

43 views Mar 13

Academic · 1 min

Markovian Generation Chains in Large Language Models

arXiv:2603.11228v1 Announce Type: new Abstract: The widespread use of large language models (LLMs) raises an important question: how do texts evolve when they are repeatedly …

Mingmeng Geng, Amr Mohamed, Guokan Shang, Michalis Vazirgiannis, Thierry Poibeau

40 views Mar 13

Academic · 1 min

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

arXiv:2603.11193v1 Announce Type: new Abstract: Reinforcement learning with Verifiable Rewards (RLVR) has emerged as a powerful paradigm for eliciting reasoning capabilities in large language models, …

Hanxu Hu, Yuxuan Wang, Maggie Huan, Jannis Vamvas, Yinya Huang, Zhijiang Guo, Rico Sennrich

36 views Mar 13

Academic · 1 min

VisDoT : Enhancing Visual Reasoning through Human-Like Interpretation Grounding and Decomposition of Thought

arXiv:2603.11631v1 Announce Type: new Abstract: Large vision-language models (LVLMs) struggle to reliably detect visual primitives in charts and align them with semantic representations, which severely …

Eunsoo Lee, Jeongwoo Lee, Minki Hong, Jangho Choi, Jihie Kim

24 views Mar 13

Academic · 1 min

Adversarial Reinforcement Learning for Detecting False Data Injection Attacks in Vehicular Routing

arXiv:2603.11433v1 Announce Type: new Abstract: In modern transportation networks, adversaries can manipulate routing algorithms using false data injection attacks, such as simulating heavy traffic with …

Taha Eghtesad, Yevgeniy Vorobeychik, Aron Laszka

37 views Mar 13

Academic · 1 min

LLM-Augmented Digital Twin for Policy Evaluation in Short-Video Platforms

arXiv:2603.11333v1 Announce Type: new Abstract: Short-video platforms are closed-loop, human-in-the-loop ecosystems where platform policy, creator incentives, and user behavior co-evolve. This feedback structure makes counterfactual …

Haoting Zhang (Max), Yunduan Lin (Max), Jinghai He (Max), Denglin Jiang (Max), Zuo-Jun (Max), Shen, Zeyu Zheng

45 views Mar 13

Academic · 1 min

MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large …

arXiv:2603.11414v1 Announce Type: new Abstract: We present MaterialFigBench, a benchmark dataset designed to evaluate the ability of multimodal large language models (LLMs) to solve university-level …

Michiko Yoshitake, Yuta Suzuki, Ryo Igarashi, Yoshitaka Ushiku, Keisuke Nagato

73 views Mar 13

← Previous

151 152 153 154 155

Academic

Improving LLM Performance Through Black-Box Online Tuning: A Case for Adding System Specs to Factsheets …

Summarize Before You Speak with ARACH: A Training-Free Inference-Time Plug-In for Enhancing LLMs via Global …

From Debate to Deliberation: Structured Collective Reasoning with Typed Epistemic Acts

Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents

Understanding Wikidata Qualifiers: An Analysis and Taxonomy

Stop Listening to Me! How Multi-turn Conversations Can Degrade Diagnostic Reasoning

Markovian Generation Chains in Large Language Models

DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning

VisDoT : Enhancing Visual Reasoning through Human-Like Interpretation Grounding and Decomposition of Thought

Adversarial Reinforcement Learning for Detecting False Data Injection Attacks in Vehicular Routing

LLM-Augmented Digital Twin for Policy Evaluation in Short-Video Platforms

MaterialFigBENCH: benchmark dataset with figures for evaluating college-level materials science problem-solving abilities of multimodal large …

JCG, PC

HSOLLC Co., Ltd.