Academic

Academic · 1 min

LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation

arXiv:2603.22629v1 Announce Type: new Abstract: Adapting pretrained language models to low-resource, morphologically rich languages remains a significant challenge. Existing vocabulary expansion methods typically rely on …

Hailay Teklehaymanot, Dren Fazlija, Wolfgang Nejdl

12 views Mar 25

Academic · 1 min

Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

arXiv:2603.22582v1 Announce Type: new Abstract: Chain-of-thought (CoT) reasoning has been proposed as a transparency mechanism for large language models in safety-critical deployments, yet its effectiveness …

Richard J. Young

12 views Mar 25

Academic · 1 min

CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

arXiv:2603.22576v1 Announce Type: new Abstract: We introduce CAPITU, a benchmark for evaluating instruction-following capabilities of Large Language Models (LLMs) in Brazilian Portuguese. Unlike existing benchmarks …

Giovana Kerche Bon\'as, Roseval Malaquias Junior, Marcos Piau, Thiago Laitz, Thales Sales Almeida, Hugo Abonizio, Celio Larcher, Ramon Pires, Rodrigo Nogueira

8 views Mar 25

Academic · 1 min

Reddit After Roe: A Computational Analysis of Abortion Narratives and Barriers in the Wake of …

arXiv:2603.22566v1 Announce Type: new Abstract: The 2022 U.S. Supreme Court decision in Dobbs v. Jackson Women's Health Organization reshaped the reproductive rights landscape, introducing new …

Aria Pessianzadeh, Alex H. Poole, Rezvaneh Rezapour

12 views Mar 25

Academic · 1 min

Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning

arXiv:2603.22497v1 Announce Type: new Abstract: Where there is growing interest in in-context language learning (ICLL) for unseen languages with large language models, such languages usually …

Niyati Bafna, Ryan Soh-Eun Shim, Barbara Plank, David Yarowsky, Hale Sirin

5 views Mar 25

Academic · 1 min

Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

arXiv:2603.22473v1 Announce Type: new Abstract: Hybrid language models combining attention with state space models (SSMs) or linear attention offer improved efficiency, but whether both components …

Hector Borobia, Elies Segu\'i-Mas, Guillermina Tormo-Carb\'o

3 views Mar 25

Academic · 1 min

LLM-guided headline rewriting for clickability enhancement without clickbait

arXiv:2603.22459v1 Announce Type: new Abstract: Enhancing reader engagement while preserving informational fidelity is a central challenge in controllable text generation for news media. Optimizing news …

Yehudit Aperstein, Linoy Halifa, Sagiv Bar, Alexander Apartsin

6 views Mar 25

Academic · 1 min

Towards Automated Community Notes Generation with Large Vision Language Models for Combating Contextual Deception

arXiv:2603.22453v1 Announce Type: new Abstract: Community Notes have emerged as an effective crowd-sourced mechanism for combating online deception on social media platforms. However, its reliance …

Jin Ma, Jingwen Yan, Mohammed Aldeen, Ethan Anderson, Taran Kavuru, Jinkyung Katie Park, Feng Luo, Long Cheng

11 views Mar 25

Academic · 1 min

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

arXiv:2603.22446v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) has significantly improved reasoning in large language models (LLMs), yet the token-level mechanisms underlying …

Haoming Meng, Kexin Huang, Shaohang Wei, Chiyu Ma, Shuo Yang, Xue Wang, Guoyin Wang, Bolin Ding, Jingren Zhou

3 views Mar 25

Academic · 1 min

Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

arXiv:2603.22295v1 Announce Type: new Abstract: Large language models appear to develop internal representations of emotion -- "emotion circuits," "emotion neurons," and structured emotional manifolds have …

Michael Keeman

4 views Mar 25

Academic · 1 min

TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

arXiv:2603.22293v1 Announce Type: new Abstract: Search-augmented large language models (LLMs) trained with reinforcement learning (RL) have achieved strong results on open-domain question answering (QA), but …

Yutao Xie, Nathaniel Thomas, Nicklas Hansen, Yang Fu, Li Erran Li, Xiaolong Wang

3 views Mar 25

Academic · 1 min

Less is More: Adapting Text Embeddings for Low-Resource Languages with Small Scale Noisy Synthetic Data

arXiv:2603.22290v1 Announce Type: new Abstract: Low-resource languages (LRLs) often lack high-quality, large-scale datasets for training effective text embedding models, hindering their application in tasks like …

Zaruhi Navasardyan, Spartak Bughdaryan, Bagrat Minasyan, Hrant Davtyan

10 views Mar 25

LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation

Lie to Me: How Faithful Is Chain-of-Thought Reasoning in Reasoning Models?

CAPITU: A Benchmark for Evaluating Instruction-Following in Brazilian Portuguese with Literary Context

Reddit After Roe: A Computational Analysis of Abortion Narratives and Barriers in the Wake of …

Rashid: A Cipher-Based Framework for Exploring In-Context Language Learning

Functional Component Ablation Reveals Specialization Patterns in Hybrid Language Model Architectures

LLM-guided headline rewriting for clickability enhancement without clickbait

Towards Automated Community Notes Generation with Large Vision Language Models for Combating Contextual Deception

Sparse but Critical: A Token-Level Analysis of Distributional Shifts in RLVR Fine-Tuning of LLMs

Whether, Not Which: Mechanistic Interpretability Reveals Dissociable Affect Reception and Emotion Categorization in LLMs

TIPS: Turn-Level Information-Potential Reward Shaping for Search-Augmented LLMs

Less is More: Adapting Text Embeddings for Low-Resource Languages with Small Scale Noisy Synthetic Data

JCG, PC

HSOLLC Co., Ltd.