All Articles

Articles

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

MAVRL: Learning Reward Functions from Multiple Feedback Types with Amortized Variational Inference

arXiv:2602.15206v1 Announce Type: new Abstract: Reward learning typically relies on a single feedback type or combines multiple feedback types using manually weighted loss terms. Currently, …

Rapha\"el Baur, Yannick Metz, Maria Gkoulta, Mennatallah El-Assady, Giorgia Ramponi, Thomas Kleine Buening

20 views Feb 19

Academic · 1 min

Automatically Finding Reward Model Biases

arXiv:2602.15222v1 Announce Type: new Abstract: Reward models are central to large language model (LLM) post-training. However, past work has shown that they can reward spurious …

Atticus Wang, Iv\'an Arcuschin, Arthur Conmy

26 views Feb 19

Academic · 1 min

BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening

arXiv:2602.15236v1 Announce Type: new Abstract: Virtual screening aims to efficiently identify active ligands from massive chemical libraries for a given target pocket. Recent CLIP-style models …

Anjie Qiao, Zhen Wang, Yaliang Li, Jiahua Rao, Yuedong Yang

24 views Feb 19

Academic · 1 min

Closing the Distribution Gap in Adversarial Training for LLMs

arXiv:2602.15238v1 Announce Type: new Abstract: Adversarial training for LLMs is one of the most promising methods to reliably improve robustness against adversaries. However, despite significant …

Chengzhi Hu, Jonas Dornbusch, David L\"udke, Stephan G\"unnemann, Leo Schwinn

25 views Feb 19

Academic · 1 min

Size Transferability of Graph Transformers with Convolutional Positional Encodings

arXiv:2602.15239v1 Announce Type: new Abstract: Transformers have achieved remarkable success across domains, motivating the rise of Graph Transformers (GTs) as attention-based architectures for graph-structured data. …

Javier Porras-Valenzuela, Zhiyang Wang, Alejandro Ribeiro

20 views Feb 19

Academic · 1 min

Scaling Laws for Masked-Reconstruction Transformers on Single-Cell Transcriptomics

arXiv:2602.15253v1 Announce Type: new Abstract: Neural scaling laws -- power-law relationships between loss, model size, and data -- have been extensively documented for language and …

Ihor Kendiukhov

35 views Feb 19

Academic · 1 min

Fast and Effective On-policy Distillation from Reasoning Prefixes

arXiv:2602.15260v1 Announce Type: new Abstract: On-policy distillation (OPD), which samples trajectories from the student model and supervises them with a teacher at the token level, …

Dongxu Zhang, Zhichao Yang, Sepehr Janghorbani, Jun Han, Andrew Ressler II, Qian Qian, Gregory D. Lyng, Sanjit Singh Batra, Robert E. Tillman

42 views Feb 19

Academic · 1 min

Complex-Valued Unitary Representations as Classification Heads for Improved Uncertainty Quantification in Deep Neural Networks

arXiv:2602.15283v1 Announce Type: new Abstract: Modern deep neural networks achieve high predictive accuracy but remain poorly calibrated: their confidence scores do not reliably reflect the …

Akbar Anbar Jafari, Cagri Ozcinar, Gholamreza Anbarjafari

30 views Feb 19

Academic · 1 min

Hybrid Federated and Split Learning for Privacy Preserving Clinical Prediction and Treatment Optimization

arXiv:2602.15304v1 Announce Type: new Abstract: Collaborative clinical decision support is often constrained by governance and privacy rules that prevent pooling patient-level records across institutions. We …

Farzana Akter, Rakib Hossain, Deb Kanna Roy Toushi, Mahmood Menon Khan, Sultana Amin, Lisan Al Amin

30 views Feb 19

Academic · 1 min

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

arXiv:2602.15322v1 Announce Type: new Abstract: Training large language models (LLMs) relies almost exclusively on dense adaptive optimizers with increasingly sophisticated preconditioners. We challenge this by …

Taejong Joo, Wenhan Xia, Cheolmin Kim, Ming Zhang, Eugene Ie

9 views Feb 19

Academic · 1 min

A Scalable Curiosity-Driven Game-Theoretic Framework for Long-Tail Multi-Label Learning in Data Mining

arXiv:2602.15330v1 Announce Type: new Abstract: The long-tail distribution, where a few head labels dominate while rare tail labels abound, poses a persistent challenge for large-scale …

Jing Yang, Keze Wang

36 views Feb 19

Academic · 1 min

Directional Reasoning Trajectory Change (DRTC): Identifying Critical Trace Segments in Reasoning Models

arXiv:2602.15332v1 Announce Type: new Abstract: Understanding how language models carry out long-horizon reasoning remains an open challenge. Existing interpretability methods often highlight tokens or spans …

Waldemar Chang

10 views Feb 19

← Previous

529 530 531 532 533

Articles

MAVRL: Learning Reward Functions from Multiple Feedback Types with Amortized Variational Inference

Automatically Finding Reward Model Biases

BindCLIP: A Unified Contrastive-Generative Representation Learning Framework for Virtual Screening

Closing the Distribution Gap in Adversarial Training for LLMs

Size Transferability of Graph Transformers with Convolutional Positional Encodings

Scaling Laws for Masked-Reconstruction Transformers on Single-Cell Transcriptomics

Fast and Effective On-policy Distillation from Reasoning Prefixes

Complex-Valued Unitary Representations as Classification Heads for Improved Uncertainty Quantification in Deep Neural Networks

Hybrid Federated and Split Learning for Privacy Preserving Clinical Prediction and Treatment Optimization

On Surprising Effectiveness of Masking Updates in Adaptive Optimizers

A Scalable Curiosity-Driven Game-Theoretic Framework for Long-Tail Multi-Label Learning in Data Mining

Directional Reasoning Trajectory Change (DRTC): Identifying Critical Trace Segments in Reasoning Models

JCG, PC

HSOLLC Co., Ltd.