Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

References Improve LLM Alignment in Non-Verifiable Domains

arXiv:2602.16802v1 Announce Type: new Abstract: While Reinforcement Learning with Verifiable Rewards (RLVR) has shown strong effectiveness in reasoning tasks, it cannot be directly applied to …

Kejian Shi, Yixin Liu, Peifeng Wang, Alexander R. Fabbri, Shafiq Joty, Arman Cohan

18 views Feb 22

Academic · 1 min

Evaluating Monolingual and Multilingual Large Language Models for Greek Question Answering: The DemosQA Benchmark

arXiv:2602.16811v1 Announce Type: new Abstract: Recent advancements in Natural Language Processing and Deep Learning have enabled the development of Large Language Models (LLMs), which have …

Charalampos Mastrokostas, Nikolaos Giarelis, Nikos Karacapilidis

28 views Feb 22

Academic · 1 min

One-step Language Modeling via Continuous Denoising

arXiv:2602.16813v1 Announce Type: new Abstract: Language models based on discrete diffusion have attracted widespread interest for their potential to provide faster generation than autoregressive models. …

Chanhyuk Lee, Jaehoon Yoo, Manan Agarwal, Sheel Shah, Jerry Huang, Aditi Raghunathan, Seunghoon Hong, Nicholas M. Boffi, Jinwoo Kim

32 views Feb 22

Academic · 1 min

Claim Automation using Large Language Model

arXiv:2602.16836v1 Announce Type: new Abstract: While Large Language Models (LLMs) have achieved strong performance on general-purpose language tasks, their deployment in regulated and data-sensitive domains, …

Zhengda Mo, Zhiyu Quan, Eli O'Donohue, Kaiwen Zhong

36 views Feb 22

Academic · 1 min

BanglaSummEval: Reference-Free Factual Consistency Evaluation for Bangla Summarization

arXiv:2602.16843v1 Announce Type: new Abstract: Evaluating factual consistency is essential for reliable text summarization, particularly in high-stakes domains such as healthcare and news. However, most …

Ahmed Rafid, Rumman Adib, Fariya Ahmed, Ajwad Abrar, Mohammed Saidul Islam

36 views Feb 22

Academic · 1 min

Meenz bleibt Meenz, but Large Language Models Do Not Speak Its Dialect

arXiv:2602.16852v1 Announce Type: new Abstract: Meenzerisch, the dialect spoken in the German city of Mainz, is also the traditional language of the Mainz carnival, a …

Minh Duc Bui, Manuel Mager, Peter Herbert Kann, Katharina von der Wense

18 views Feb 22

Academic · 1 min

ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders

arXiv:2602.16938v1 Announce Type: new Abstract: The promise of LLM-based user simulators to improve conversational AI is hindered by a critical "realism gap," leading to systems …

Ofer Meshi, Krisztian Balog, Sally Goldman, Avi Caciularu, Guy Tennenholtz, Jihwan Jeong, Amir Globerson, Craig Boutilier

21 views Feb 22

Academic · 1 min

When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English

arXiv:2602.16957v1 Announce Type: new Abstract: Euphemisms substitute socially sensitive expressions, often softening or reframing meaning, and their reliance on cultural and pragmatic context complicates modeling …

Hasan Can Biyik, Libby Barak, Jing Peng, Anna Feldman

26 views Feb 22

Academic · 1 min

Eigenmood Space: Uncertainty-Aware Spectral Graph Analysis of Psychological Patterns in Classical Persian Poetry

arXiv:2602.16959v1 Announce Type: new Abstract: Classical Persian poetry is a historically sustained archive in which affective life is expressed through metaphor, intertextual convention, and rhetorical …

Kourosh Shahnazari, Seyed Moein Ayyoubzadeh, Mohammadali Keshtparvar

30 views Feb 22

Academic · 1 min

Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History

arXiv:2602.17003v1 Announce Type: new Abstract: Large language models have advanced web agents, yet current agents lack personalization capabilities. Since users rarely specify every detail of …

Serin Kim, Sangam Lee, Dongha Lee

16 views Feb 22

Academic · 1 min

ReIn: Conversational Error Recovery with Reasoning Inception

arXiv:2602.17022v1 Announce Type: new Abstract: Conversational agents powered by large language models (LLMs) with tool integration achieve strong performance on fixed task-oriented dialogue datasets but …

Takyoung Kim, Jinseok Nam, Chandrayee Basu, Xing Fan, Chengyuan Ma, Heng Ji, Gokhan Tur, Dilek Hakkani-T\"ur

18 views Feb 22

Academic · 1 min

Large Language Models Persuade Without Planning Theory of Mind

arXiv:2602.17045v1 Announce Type: new Abstract: A growing body of work attempts to evaluate the theory of mind (ToM) abilities of humans and large language models …

Jared Moore, Rasmus Overmark, Ned Cooper, Beba Cibralic, Nick Haber, Cameron R. Jones

19 views Feb 22

← Previous

387 388 389 390 391

Academic

References Improve LLM Alignment in Non-Verifiable Domains

Evaluating Monolingual and Multilingual Large Language Models for Greek Question Answering: The DemosQA Benchmark

One-step Language Modeling via Continuous Denoising

Claim Automation using Large Language Model

BanglaSummEval: Reference-Free Factual Consistency Evaluation for Bangla Summarization

Meenz bleibt Meenz, but Large Language Models Do Not Speak Its Dialect

ConvApparel: A Benchmark Dataset and Validation Framework for User Simulators in Conversational Recommenders

When Semantic Overlap Is Not Enough: Cross-Lingual Euphemism Transfer Between Turkish and English

Eigenmood Space: Uncertainty-Aware Spectral Graph Analysis of Psychological Patterns in Classical Persian Poetry

Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History

ReIn: Conversational Error Recovery with Reasoning Inception

Large Language Models Persuade Without Planning Theory of Mind

JCG, PC

HSOLLC Co., Ltd.