All Articles

Articles

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Learning Ordinal Probabilistic Reward from Preferences

arXiv:2602.12660v1 Announce Type: new Abstract: Reward models are crucial for aligning large language models (LLMs) with human values and intentions. Existing approaches follow either Generative …

Longze Chen, Lu Wang, Renke Shan, Ze Gong, Run Luo, Jiaming Li, Jing Luo, Qiyao Wang, Min Yang

5 views Mar 7

Academic · 1 min

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

arXiv:2602.12674v1 Announce Type: new Abstract: Knowledge Distillation (KD) for Large Language Models (LLMs) has become increasingly important as models grow in size and complexity. While …

Yuang Cai, Yuyu Yuan

53 views Mar 7

Academic · 1 min

ReFilter: Improving Robustness of Retrieval-Augmented Generation via Gated Filter

arXiv:2602.12709v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) has become a dominant paradigm for grounding large language models (LLMs) with external evidence in knowledge-intensive question …

Yixin Chen, Ying Xiong, Shangyu Wu, Xiangrui Ke, Nan Guan, Chun Jason Xue

5 views Mar 7

Academic · 1 min

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

arXiv:2602.12746v1 Announce Type: new Abstract: Despite their impressive performance, self-supervised speech models often struggle to generalize to new languages and tend to forget previously acquired …

Jing Xu, Minglin Wu, Xueyuan Chen, Xixin Wu, Helen Meng

28 views Mar 7

Academic · 1 min

Towards a Diagnostic and Predictive Evaluation Methodology for Sequence Labeling Tasks

arXiv:2602.12759v1 Announce Type: new Abstract: Standard evaluation in NLP typically indicates that system A is better on average than system B, but it provides little …

Elena Alvarez-Mellado, Julio Gonzalo

5 views Mar 7

Academic · 1 min

Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews

arXiv:2602.12778v1 Announce Type: new Abstract: This study advances aspect-based sentiment analysis (ABSA) for Persian-language user reviews in the tourism domain, addressing challenges of low-resource languages. …

Hamidreza Kazemi Taskooh, Taha Zare Harofte

15 views Mar 7

Academic · 1 min

RAT-Bench: A Comprehensive Benchmark for Text Anonymization

arXiv:2602.12806v1 Announce Type: new Abstract: Data containing personal information is increasingly used to train, fine-tune, or query Large Language Models (LLMs). Text is typically scrubbed …

Nata\v{s}a Kr\v{c}o, Zexi Yao, Matthieu Meeus, Yves-Alexandre de Montjoye

22 views Mar 7

Academic · 1 min

Left-right asymmetry in predicting brain activity from LLMs' representations emerges with their formal linguistic competence

arXiv:2602.12811v1 Announce Type: new Abstract: When humans and large language models (LLMs) process the same text, activations in the LLMs correlate with brain activity measured, …

Laurent Bonnasse-Gahot, Christophe Pallier

35 views Mar 7

Academic · 1 min

AIWizards at MULTIPRIDE: A Hierarchical Approach to Slur Reclamation Detection

arXiv:2602.12818v1 Announce Type: new Abstract: Detecting reclaimed slurs represents a fundamental challenge for hate speech detection systems, as the same lexcal items can function either …

Luca Tedeschini, Matteo Fasulo

15 views Mar 7

Academic · 1 min

MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models

arXiv:2602.12871v1 Announce Type: new Abstract: We introduce MentalBench, a benchmark for evaluating psychiatric diagnostic decision-making in large language models (LLMs). Existing mental health benchmarks largely …

Hoyun Song, Migyeong Kang, Jisu Shin, Jihyun Kim, Chanbi Park, Hangyeol Yoo, Jihyun An, Alice Oh, Jinyoung Han, KyungTae Lim

4 views Mar 7

Academic · 1 min

BaziQA-Benchmark: Evaluating Symbolic and Temporally Compositional Reasoning in Large Language Models

arXiv:2602.12889v1 Announce Type: new Abstract: We present BaziQA-Benchmark, a standardized benchmark for evaluating symbolic and temporally compositional reasoning in large language models. The benchmark is …

Jiangxi Chen, Qian Liu

14 views Mar 7

Academic · 1 min

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

arXiv:2602.12911v1 Announce Type: new Abstract: Code-switching (CS), which is when Vietnamese speech uses English words like drug names or procedures, is a common phenomenon in …

Tung X. Nguyen, Nhu Vo, Giang-Son Nguyen, Duy Mai Hoang, Chien Dinh Huynh, Inigo Jauregi Unanue, Massimo Piccardi, Wray Buntine, Dung D. Le

4 views Mar 7

← Previous

319 320 321 322 323

Articles

Learning Ordinal Probabilistic Reward from Preferences

$\mathcal{X}$-KD: General Experiential Knowledge Distillation for Large Language Models

ReFilter: Improving Robustness of Retrieval-Augmented Generation via Gated Filter

Lamer-SSL: Layer-aware Mixture of LoRA Experts for Continual Multilingual Expansion of Self-supervised Models without Forgetting

Towards a Diagnostic and Predictive Evaluation Methodology for Sequence Labeling Tasks

Aspect-Based Sentiment Analysis for Future Tourism Experiences: A BERT-MoE Framework for Persian User Reviews

RAT-Bench: A Comprehensive Benchmark for Text Anonymization

Left-right asymmetry in predicting brain activity from LLMs' representations emerges with their formal linguistic competence

AIWizards at MULTIPRIDE: A Hierarchical Approach to Slur Reclamation Detection

MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models

BaziQA-Benchmark: Evaluating Symbolic and Temporally Compositional Reasoning in Large Language Models

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

JCG, PC

HSOLLC Co., Ltd.