Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Artificial Intelligence for Modeling & Simulation in Digital Twins

arXiv:2602.19390v1 Announce Type: new Abstract: The convergence of modeling & simulation (M&S) and artificial intelligence (AI) is leaving its marks on advanced digital technology. Pertinent …

Philipp Zech, Istvan David

30 views Mar 7

Academic · 1 min

Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement

arXiv:2602.19396v1 Announce Type: new Abstract: Large language models (LLMs) remain vulnerable to jailbreak prompts that are fluent and semantically coherent, and therefore difficult to detect …

Amirhossein Farzam, Majid Behabahani, Mani Malek, Yuriy Nevmyvaka, Guillermo Sapiro

43 views Mar 7

Academic · 1 min

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

arXiv:2602.19416v1 Announce Type: new Abstract: Reinforcement Learning from Human Feedback (RLHF) enables powerful LLM alignment but can introduce reward hacking - models exploit spurious correlations …

Mohammad Beigi, Ming Jin, Junshan Zhang, Jiaxin Zhang, Qifan Wang, Lifu Huang

16 views Mar 7

Academic · 1 min

OptiRepair: Closed-Loop Diagnosis and Repair of Supply Chain Optimization Models with LLM Agents

arXiv:2602.19439v1 Announce Type: new Abstract: Problem Definition. Supply chain optimization models frequently become infeasible because of modeling errors. Diagnosis and repair require scarce OR expertise: …

Ruicheng Ao, David Simchi-Levi, Xinshang Wang

64 views Mar 7

Academic · 1 min

ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

arXiv:2602.19458v1 Announce Type: new Abstract: Multi-agent decision pipelines can outperform single agent workflows when complementarity holds, i.e., different agents bring unique information to the table …

Ziyang Guo, Yifan Wu, Jason Hartline, Kenneth Holstein, Jessica Hullman

18 views Mar 7

Academic · 1 min

ReportLogic: Evaluating Logical Quality in Deep Research Reports

arXiv:2602.18446v1 Announce Type: new Abstract: Users increasingly rely on Large Language Models (LLMs) for Deep Research, using them to synthesize diverse sources into structured reports …

Jujia Zhao, Zhaoxin Huan, Zihan Wang, Xiaolu Zhang, Jun Zhou, Suzan Verberne, Zhaochun Ren

26 views Mar 7

Academic · 1 min

ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

arXiv:2602.18447v1 Announce Type: new Abstract: Chain-of-Thought reasoning significantly improves the performance of large language models on complex tasks, but incurs high inference latency due to …

Siran Liu, Cyril Y. He

15 views Mar 7

Academic · 1 min

INSURE-Dial: A Phase-Aware Conversational Dataset \& Benchmark for Compliance Verification and Phase Detection

arXiv:2602.18448v1 Announce Type: new Abstract: Administrative phone tasks drain roughly 1 trillion USD annually from U.S. healthcare, with over 500 million insurance-benefit verification calls manually …

Shubham Kulkarni, Alexander Lyzhov, Preetam Joshi, Shiva Chaitanya

16 views Mar 7

Academic · 1 min

Prompt Optimization Via Diffusion Language Models

arXiv:2602.18449v1 Announce Type: new Abstract: We propose a diffusion-based framework for prompt optimization that leverages Diffusion Language Models (DLMs) to iteratively refine system prompts through …

Shiyu Wang, Haolin Chen, Liangwei Yang, Jielin Qiu, Rithesh Murthy, Ming Zhu, Zixiang Chen, Silvio Savarese, Caiming Xiong, Shelby Heinecke, Huan Wang

14 views Mar 7

Academic · 1 min

Asymptotic Semantic Collapse in Hierarchical Optimization

arXiv:2602.18450v1 Announce Type: new Abstract: Multi-agent language systems can exhibit a failure mode where a shared dominant context progressively absorbs individual semantics, yielding near-uniform behavior …

Faruk Alpay, Bugra Kilictas

33 views Mar 7

Academic · 1 min

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

arXiv:2602.18487v1 Announce Type: new Abstract: This paper introduces GLiNER-bi-Encoder, a novel architecture for Named Entity Recognition (NER) that harmonizes zero-shot flexibility with industrial-scale efficiency. While …

Ihor Stepanov, Mykhailo Shtopko, Dmytro Vodianytskyi, Oleksandr Lukashov

16 views Mar 7

Academic · 1 min

Luna-2: Scalable Single-Token Evaluation with Small Language Models

arXiv:2602.18583v1 Announce Type: new Abstract: Real-time guardrails require evaluation that is accurate, cheap, and fast - yet today's default, LLM-as-a-judge (LLMAJ), is slow, expensive, and …

Vatsal Goel, Rishon Dsouza, Nikhil Ega, Amey Ramesh Rambatla, Rob Friel, Shuai Shao, Yash Sheth

14 views Mar 7

← Previous

266 267 268 269 270

Academic

Artificial Intelligence for Modeling & Simulation in Digital Twins

Hiding in Plain Text: Detecting Concealed Jailbreaks via Activation Disentanglement

IR$^3$: Contrastive Inverse Reinforcement Learning for Interpretable Detection and Mitigation of Reward Hacking

OptiRepair: Closed-Loop Diagnosis and Repair of Supply Chain Optimization Models with LLM Agents

ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

ReportLogic: Evaluating Logical Quality in Deep Research Reports

ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

INSURE-Dial: A Phase-Aware Conversational Dataset \& Benchmark for Compliance Verification and Phase Detection

Prompt Optimization Via Diffusion Language Models

Asymptotic Semantic Collapse in Hierarchical Optimization

The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder

Luna-2: Scalable Single-Token Evaluation with Small Language Models

JCG, PC

HSOLLC Co., Ltd.