All Articles

Articles

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

arXiv:2603.12698v1 Announce Type: new Abstract: Reinforcement learning with verifiable rewards (RLVR) is a promising approach for improving code generation in large language models, but its …

Chi Ruan, Dongfu Jiang, Huaye Zeng, Ping Nie, Wenhu Chen

10 views Mar 16

Academic · 1 min

A Method for Learning Large-Scale Computational Construction Grammars from Semantically Annotated Corpora

arXiv:2603.12754v1 Announce Type: new Abstract: We present a method for learning large-scale, broad-coverage construction grammars from corpora of language use. Starting from utterances annotated with …

Paul Van Eecke, Katrien Beuls

13 views Mar 16

Academic · 1 min

SectEval: Evaluating the Latent Sectarian Preferences of Large Language Models

arXiv:2603.12768v1 Announce Type: new Abstract: As Large Language Models (LLMs) becomes a popular source for religious knowledge, it is important to know if it treats …

Aditya Maheshwari, Amit Gajkeshwar, Kaushal Sharma, Vivek Patel

9 views Mar 16

Academic · 1 min

SteerRM: Debiasing Reward Models via Sparse Autoencoders

arXiv:2603.12795v1 Announce Type: new Abstract: Reward models (RMs) are critical components of alignment pipelines, yet they exhibit biases toward superficial stylistic cues, preferring better-presented responses …

Mengyuan Sun, Zhuohao Yu, Weizheng Gu, Shikun Zhang, Wei Ye

9 views Mar 16

Academic · 1 min

Adaptive Vision-Language Model Routing for Computer Use Agents

arXiv:2603.12823v1 Announce Type: new Abstract: Computer Use Agents (CUAs) translate natural-language instructions into Graphical User Interface (GUI) actions such as clicks, keystrokes, and scrolls by …

Xunzhuo Liu, Bowei He, Xue Liu, Andy Luo, Haichen Zhang, Huamin Chen

81 views Mar 16

Academic · 1 min

Rethinking Multiple-Choice Questions for RLVR: Unlocking Potential via Distractor Design

arXiv:2603.12826v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) significantly enhances the reasoning capabilities of Large Language Models. When applied to RLVR, Multiple-Choice …

Xu Guo, Qiming Ge, Jian Tong, Kedi Chen, Jin Zhang, Xiaogui Yang, Xuan Gao, Haijun Lv, Zhihui Lu, Yicheng Zou, Qipeng Guo

9 views Mar 16

Academic · 1 min

CLARIN-PT-LDB: An Open LLM Leaderboard for Portuguese to assess Language, Culture and Civility

arXiv:2603.12872v1 Announce Type: new Abstract: This paper reports on the development of a leaderboard of Open Large Language Models (LLM) for European Portuguese (PT-PT), and …

Jo\~ao Silva, Lu\'is Gomes, Ant\'onio Branco

10 views Mar 16

Academic · 1 min

Learning from Child-Directed Speech in Two-Language Scenarios: A French-English Case Study

arXiv:2603.12906v1 Announce Type: new Abstract: Research on developmentally plausible language models has largely focused on English, leaving open questions about multilingual settings. We present a …

Liel Binyamin, Elior Sulem

11 views Mar 16

Academic · 1 min

HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection

arXiv:2603.12920v1 Announce Type: new Abstract: Cyberbullying on social media is inherently multilingual and multi-faceted, where abusive behaviors often overlap across multiple categories. Existing methods are …

Zixin Feng, Xinying Cui, Yifan Sun, Zheng Wei, Jiachen Yuan, Jiazhen Hu, Ning Xin, Md Maruf Hasan

14 views Mar 16

Academic · 1 min

DS$^2$-Instruct: Domain-Specific Data Synthesis for Large Language Models Instruction Tuning

arXiv:2603.12932v1 Announce Type: new Abstract: Adapting Large Language Models (LLMs) to specialized domains requires high-quality instruction tuning datasets, which are expensive to create through human …

Ruiyao Xu, Noelle I. Samia, Han Liu

10 views Mar 16

Academic · 1 min

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

arXiv:2603.12963v1 Announce Type: new Abstract: The widespread adoption of reinforcement learning-based alignment highlights the growing importance of reward models. Various benchmarks have been built to …

Hui Huang, Yancheng He, Wei Liu, Muyun Yang, Jiaheng Liu, Kehai Chen, Bing Xu, Conghui Zhu, Hailong Cao, Tiejun Zhao

9 views Mar 16

Academic · 1 min

Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation

arXiv:2603.12983v1 Announce Type: new Abstract: Error Span Detection (ESD) is a crucial subtask in Machine Translation (MT) evaluation, aiming to identify the location and severity …

Boxuan Lyu, Haiyue Song, Zhi Qu

9 views Mar 16

← Previous

175 176 177 178 179

Articles

EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning

A Method for Learning Large-Scale Computational Construction Grammars from Semantically Annotated Corpora

SectEval: Evaluating the Latent Sectarian Preferences of Large Language Models

SteerRM: Debiasing Reward Models via Sparse Autoencoders

Adaptive Vision-Language Model Routing for Computer Use Agents

Rethinking Multiple-Choice Questions for RLVR: Unlocking Potential via Distractor Design

CLARIN-PT-LDB: An Open LLM Leaderboard for Portuguese to assess Language, Culture and Civility

Learning from Child-Directed Speech in Two-Language Scenarios: A French-English Case Study

HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection

DS$^2$-Instruct: Domain-Specific Data Synthesis for Large Language Models Instruction Tuning

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

Is Human Annotation Necessary? Iterative MBR Distillation for Error Span Detection in Machine Translation

JCG, PC

HSOLLC Co., Ltd.