Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Conformal Margin Risk Minimization: An Envelope Framework for Robust Learning under Label Noise

arXiv:2604.06468v1 Announce Type: new Abstract: Most methods for learning with noisy labels require privileged knowledge such as noise transition matrices, clean subsets or pretrained feature …

Yuanjie Shi, Peihong Li, Zijian Zhang, Janardhan Rao Doppa, Yan Yan

73 views Apr 9

Academic · 1 min

The Illusion of Stochasticity in LLMs

arXiv:2604.06543v1 Announce Type: new Abstract: In this work, we demonstrate that reliable stochastic sampling is a fundamental yet unfulfilled requirement for Large Language Models (LLMs) …

Xiangming Gu, Soham De, Michalis Titsias, Larisa Markeeva, Petar Veli\v{c}kovi\'c, Razvan Pascanu

41 views Apr 9

Academic · 1 min

The Detection--Extraction Gap: Models Know the Answer Before They Can Say It

arXiv:2604.06613v1 Announce Type: new Abstract: Modern reasoning models continue generating long after the answer is already determined. Across five model configurations, two families, and three …

Hanyang Wang, Mingxuan Zhu

73 views Apr 9

Academic · 1 min

MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts

arXiv:2604.06505v1 Announce Type: new Abstract: Large language models (LLMs) are widely explored for reasoning-intensive research tasks, yet resources for testing whether they can infer scientific …

Weiyue Li, Ruizhi Qian, Yi Li, Yongce Li, Yunfan Long, Jiahui Cai, Yan Luo, Mengyu Wang

61 views Apr 9

Academic · 1 min

FLeX: Fourier-based Low-rank EXpansion for multilingual transfer

arXiv:2604.06253v1 Announce Type: new Abstract: Cross-lingual code generation is critical in enterprise environments where multiple programming languages coexist. However, fine-tuning large language models (LLMs) individually …

Gaurav Narasimhan

69 views Apr 9

Academic · 1 min

Bi-level Heterogeneous Learning for Time Series Foundation Models: A Federated Learning Approach

arXiv:2604.06727v1 Announce Type: new Abstract: Heterogeneity in time series data is more pronounced than in vision or language, as temporal dynamics vary substantially across domains …

Shengchao Chen, Guodong Long, Dikai Liu, Jing Jiang

75 views Apr 9

Academic · 1 min

Fine-tuning Whisper for Pashto ASR: strategies and scale

arXiv:2604.06507v1 Announce Type: new Abstract: Pashto is absent from Whisper's pre-training corpus despite being one of CommonVoice's largest language collections, leaving off-the-shelf models unusable: all …

Hanif Rahman

51 views Apr 9

Academic · 1 min

Unsupervised Neural Network for Automated Classification of Surgical Urgency Levels in Medical Transcriptions

arXiv:2604.06214v1 Announce Type: new Abstract: Efficient classification of surgical procedures by urgency is paramount to optimize patient care and resource allocation within healthcare systems. This …

Sadaf Tabatabaee, Sarah S. Lam

60 views Apr 9

Academic · 1 min

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

arXiv:2604.06421v1 Announce Type: new Abstract: This paper introduces Arabic-DeepSeek-R1, an application-driven open-source Arabic LLM that leverages a sparse MoE backbone to address the digital equity …

Navan Preet Singh, Anurag Garikipati, Ahmed Abulkhair, Jyani Akshay Jagdishbhai, Atul Yaduvanshi, Amarendra Chaudhary, Madalina Ciobanu, Qingqing Mao, Ritankar Das

46 views Apr 9

Academic · 1 min

Beyond Facts: Benchmarking Distributional Reading Comprehension in Large Language Models

arXiv:2604.06201v1 Announce Type: new Abstract: While most reading comprehension benchmarks for LLMs focus on factual information that can be answered by localizing specific textual evidence, …

Pei-Fu Guo, Ya-An Tsai, Chun-Chia Hsu, Kai-Xin Chen, Yun-Da Tsai, Kai-Wei Chang, Nanyun Peng, Mi-Yen Yeh, Shou-De Lin

64 views Apr 9

Academic · 1 min

Limits of Difficulty Scaling: Hard Samples Yield Diminishing Returns in GRPO-Tuned SLMs

arXiv:2604.06298v1 Announce Type: new Abstract: Recent alignment work on Large Language Models (LLMs) suggests preference optimization can improve reasoning by shifting probability mass toward better …

Suraj Yadav, Siddharth Yadav, Parth Goyal

60 views Apr 9

Academic · 1 min

Transformer See, Transformer Do: Copying as an Intermediate Step in Learning Analogical Reasoning

arXiv:2604.06501v1 Announce Type: new Abstract: Analogical reasoning is a hallmark of human intelligence, enabling us to solve new problems by transferring knowledge from one situation …

Philipp Hellwig, Willem Zuidema, Claire E. Stevenson, Martha Lewis

70 views Apr 9

← Previous

3 4 5 6 7

Academic

Conformal Margin Risk Minimization: An Envelope Framework for Robust Learning under Label Noise

The Illusion of Stochasticity in LLMs

The Detection--Extraction Gap: Models Know the Answer Before They Can Say It

MedConclusion: A Benchmark for Biomedical Conclusion Generation from Structured Abstracts

FLeX: Fourier-based Low-rank EXpansion for multilingual transfer

Bi-level Heterogeneous Learning for Time Series Foundation Models: A Federated Learning Approach

Fine-tuning Whisper for Pashto ASR: strategies and scale

Unsupervised Neural Network for Automated Classification of Surgical Urgency Levels in Medical Transcriptions

State-of-the-Art Arabic Language Modeling with Sparse MoE Fine-Tuning and Chain-of-Thought Distillation

Beyond Facts: Benchmarking Distributional Reading Comprehension in Large Language Models

Limits of Difficulty Scaling: Hard Samples Yield Diminishing Returns in GRPO-Tuned SLMs

Transformer See, Transformer Do: Copying as an Intermediate Step in Learning Analogical Reasoning

JCG, PC

HSOLLC Co., Ltd.