Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Engineering Reasoning and Instruction (ERI) Benchmark: A Large Taxonomy-driven Dataset for Foundation Models and Agents

arXiv:2603.02239v1 Announce Type: new Abstract: The Engineering Reasoning and Instruction (ERI) benchmark is a taxonomy-driven instruction dataset designed to train and evaluate engineering-capable large language …

MZ Naser, Ahmad Bani Awwad, Zoie McCreery, Radwa Eissa, Ahmad Naser, Gianluca Cusatis, Andrew Metcalf, Kapil Madathil, Jamal Abdalla, Venkatesh Kodur, Mohammad Reza Saeb

17 views Mar 7

Academic · 1 min

SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning

arXiv:2603.02240v1 Announce Type: new Abstract: We present SuperLocalMemory, a local-first memory system for multi-agent AI that defends against OWASP ASI06 memory poisoning through architectural isolation …

Varun Pratap Bhardwaj

50 views Mar 7

Academic · 1 min

Estimating Visual Attribute Effects in Advertising from Observational Data: A Deepfake-Informed Double Machine Learning Approach

arXiv:2603.02359v1 Announce Type: new Abstract: Digital advertising increasingly relies on visual content, yet marketers lack rigorous methods for understanding how specific visual attributes causally affect …

Yizhi Liu, Balaji Padmanabhan, Siva Viswanathan

35 views Mar 7

Academic · 1 min

Can machines be uncertain?

arXiv:2603.02365v1 Announce Type: new Abstract: The paper investigates whether and how AI systems can realize states of uncertainty. By adopting a functionalist and behavioral perspective, …

Luis Rosa

21 views Mar 7

Academic · 1 min

COOL-MC: Verifying and Explaining RL Policies for Platelet Inventory Management

arXiv:2603.02396v1 Announce Type: new Abstract: Platelets expire within five days. Blood banks face uncertain daily demand and must balance ordering decisions between costly wastage from …

Dennis Gross

31 views Mar 7

Academic · 1 min

VL-KGE: Vision-Language Models Meet Knowledge Graph Embeddings

arXiv:2603.02435v1 Announce Type: new Abstract: Real-world multimodal knowledge graphs (MKGs) are inherently heterogeneous, modeling entities that are associated with diverse modalities. Traditional knowledge graph embedding …

Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring

13 views Mar 7

Academic · 1 min

Diagnosing Retrieval vs. Utilization Bottlenecks in LLM Agent Memory

arXiv:2603.02473v1 Announce Type: new Abstract: Memory-augmented LLM agents store and retrieve information from prior interactions, yet the relative importance of how memories are written versus …

Boqin Yuan, Yue Su, Kun Yao

16 views Mar 7

Academic · 1 min

PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference

arXiv:2603.02479v1 Announce Type: new Abstract: DEEPTHINK methods improve reasoning by generating, refining, and aggregating populations of candidate solutions, which enables strong performance on complex mathematical …

Rituraj Sharma, Weiyuan Chen, Noah Provenzano, Tu Vu

17 views Mar 7

Academic · 1 min

Revealing Positive and Negative Role Models to Help People Make Good Decisions

arXiv:2603.02495v1 Announce Type: new Abstract: We consider a setting where agents take action by following their role models in a social network, and study strategies …

Avrim Blum, Keziah Naggita, Matthew R. Walter, Jingyan Wang

14 views Mar 7

Academic · 1 min

NeuroProlog: Multi-Task Fine-Tuning for Neurosymbolic Mathematical Reasoning via the Cocktail Effect

arXiv:2603.02504v1 Announce Type: new Abstract: Large Language Models (LLMs) achieve strong performance on natural language tasks but remain unreliable in mathematical reasoning, frequently generating fluent …

Pratibha Zunjare, Michael Hsiao

14 views Mar 7

Academic · 1 min

LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model

arXiv:2603.02528v1 Announce Type: new Abstract: Accurate classification of autonomous vehicle (AV) driving behaviors is critical for safety validation, performance diagnosis, and traffic integration analysis. However, …

Xiangyu Li, Tianyi Wang, Xi Cheng, Rakesh Chowdary Machineni, Zhaomiao Guo, Sikai Chen, Junfeng Jiao, Christian Claudel

30 views Mar 7

Academic · 1 min

A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities

arXiv:2603.02540v1 Announce Type: new Abstract: Large language models (LLMs) exhibit a unified "general factor" of capability across 10 benchmarks, a finding confirmed by our factor …

Faiz Ghifari Haznitrama, Faeyza Rishad Ardi, Alice Oh

40 views Mar 7

← Previous

278 279 280 281 282

Academic

Engineering Reasoning and Instruction (ERI) Benchmark: A Large Taxonomy-driven Dataset for Foundation Models and Agents

SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning

Estimating Visual Attribute Effects in Advertising from Observational Data: A Deepfake-Informed Double Machine Learning Approach

Can machines be uncertain?

COOL-MC: Verifying and Explaining RL Policies for Platelet Inventory Management

VL-KGE: Vision-Language Models Meet Knowledge Graph Embeddings

Diagnosing Retrieval vs. Utilization Bottlenecks in LLM Agent Memory

PRISM: Pushing the Frontier of Deep Think via Process Reward Model-Guided Inference

Revealing Positive and Negative Role Models to Help People Make Good Decisions

NeuroProlog: Multi-Task Fine-Tuning for Neurosymbolic Mathematical Reasoning via the Cocktail Effect

LLM-MLFFN: Multi-Level Autonomous Driving Behavior Feature Fusion via Large Language Model

A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities

JCG, PC

HSOLLC Co., Ltd.