Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

TAB-PO: Preference Optimization with a Token-Level Adaptive Barrier for Token-Critical Structured Generation

arXiv:2603.00025v1 Announce Type: new Abstract: Direct Preference Optimization is an offline post-SFT method for aligning language models from preference pairs, with strong results in instruction …

Samah Fodeh, Linhai Ma, Ganesh Puthiaraju, Srivani Talakokkul, Afshan Khan, Ashley Hagaman, Sarah R. Lowe, Aimee Kendall Roundtree

17 views Mar 7

Academic · 1 min

ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

arXiv:2603.00026v1 Announce Type: new Abstract: Effective memory management is essential for large language model (LLM) agents handling long-term interactions. Current memory frameworks typically treat agents …

Xiaohui Zhang, Zequn Sun, Chengyuan Yang, Yaqin Jin, Yazhong Zhang, Wei Hu

26 views Mar 7

Academic · 1 min

EPPCMinerBen: A Novel Benchmark for Evaluating Large Language Models on Electronic Patient-Provider Communication via the …

arXiv:2603.00028v1 Announce Type: new Abstract: Effective communication in health care is critical for treatment outcomes and adherence. With patient-provider exchanges shifting to secure messaging, analyzing …

Samah Fodeh, Yan Wang, Linhai Ma, Srivani Talakokkul, Jordan M. Alpert, Sarah Schellhorn

29 views Mar 7

Academic · 1 min

Embracing Anisotropy: Turning Massive Activations into Interpretable Control Knobs for Large Language Models

arXiv:2603.00029v1 Announce Type: new Abstract: Large Language Models (LLMs) exhibit highly anisotropic internal representations, often characterized by massive activations, a phenomenon where a small subset …

Youngji Roh, Hyunjin Cho, Jaehyung Kim

17 views Mar 7

Academic · 1 min

SimpleTool: Parallel Decoding for Real-Time LLM Function Calling

arXiv:2603.00030v1 Announce Type: new Abstract: LLM-based function calling enables intelligent agents to interact with external tools and environments, yet autoregressive decoding imposes a fundamental latency …

Xiaoxin Shi, Jiaxin Wan, Linkang Dong, Wei Jiang, Yue Liu, Zengfeng Huang

16 views Mar 7

Academic · 1 min

GRIP: Geometric Refinement and Adaptive Information Potential for Data Efficiency

arXiv:2603.00031v1 Announce Type: new Abstract: The performance of Large Language Models (LLMs) is increasingly governed by data efficiency rather than raw scaling volume. However, existing …

Changhao Wang, Jiaolong Yang, Xinhao Yao, Yunfei Yu, Peng Jiao, Lu Yu, Junpeng Fang, Riccardo Cantoro, Qing Cui, Jun Zhou

16 views Mar 7

Academic · 1 min

Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

arXiv:2603.00077v1 Announce Type: new Abstract: Rubric-based evaluation with large language models (LLMs) has become standard practice for assessing text generation at scale, yet the underlying …

Delip Rao, Chris Callison-Burch

27 views Mar 7

Academic · 1 min

Iterative LLM-based improvement for French Clinical Interview Transcription and Speaker Diarization

arXiv:2603.00086v1 Announce Type: new Abstract: Automatic speech recognition for French medical conversations remains challenging, with word error rates often exceeding 30% in spontaneous clinical speech. …

Ambre Marie (LaTIM), Thomas Bertin (DySoLab), Guillaume Dardenne (LaTIM), Gwenol\'e Quellec (LaTIM)

32 views Mar 7

Academic · 1 min

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

arXiv:2603.00296v1 Announce Type: new Abstract: Large reasoning models improve with more test-time computation, but often overthink, producing unnecessarily long chains-of-thought that raise cost without improving …

Xintong Li, Sha Li, Rongmei Lin, Hongye Jin, Linwei Li, Hejie Cui, Sarah Zhang, Chia-Yuan Chang, Kewei Cheng, Besnik Fetahu, Priyanka Nigam, Jingbo Shang, Bing Yin

17 views Mar 7

Academic · 1 min

From Prerequisites to Predictions: Validating a Geometric Hallucination Taxonomy Through Controlled Induction

arXiv:2603.00307v1 Announce Type: new Abstract: We test whether a geometric hallucination taxonomy -- classifying failures as center-drift (Type~1), wrong-well convergence (Type~2), or coverage gaps (Type~3) …

Matic Korun

36 views Mar 7

Academic · 1 min

When Metrics Disagree: Automatic Similarity vs. LLM-as-a-Judge for Clinical Dialogue Evaluation

arXiv:2603.00314v1 Announce Type: new Abstract: This paper details the baseline model selection, fine-tuning process, evaluation methods, and the implications of deploying more accurate LLMs in …

Bian Sun, Zhenjian Wang, Orvill de la Torre, Zirui Wang

14 views Mar 7

Academic · 1 min

Federated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving

arXiv:2603.02214v1 Announce Type: new Abstract: Federated Inference (FI) studies how independently trained and privately owned models can collaborate at inference time without sharing data or …

Jungwon Seo, Ferhat Ozgur Catak, Chunming Rong, Jaeyeon Jang

35 views Mar 7

← Previous

277 278 279 280 281

Academic

TAB-PO: Preference Optimization with a Token-Level Adaptive Barrier for Token-Critical Structured Generation

ActMem: Bridging the Gap Between Memory Retrieval and Reasoning in LLM Agents

EPPCMinerBen: A Novel Benchmark for Evaluating Large Language Models on Electronic Patient-Provider Communication via the …

Embracing Anisotropy: Turning Massive Activations into Interpretable Control Knobs for Large Language Models

SimpleTool: Parallel Decoding for Real-Time LLM Function Calling

GRIP: Geometric Refinement and Adaptive Information Potential for Data Efficiency

Autorubric: A Unified Framework for Rubric-Based LLM Evaluation

Iterative LLM-based improvement for French Clinical Interview Transcription and Speaker Diarization

Stepwise Penalization for Length-Efficient Chain-of-Thought Reasoning

From Prerequisites to Predictions: Validating a Geometric Hallucination Taxonomy Through Controlled Induction

When Metrics Disagree: Automatic Similarity vs. LLM-as-a-Judge for Clinical Dialogue Evaluation

Federated Inference: Toward Privacy-Preserving Collaborative and Incentivized Model Serving

JCG, PC

HSOLLC Co., Ltd.