Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Reference-guided Policy Optimization for Molecular Optimization via LLM Reasoning

arXiv:2603.05900v1 Announce Type: new Abstract: Large language models (LLMs) benefit substantially from supervised fine-tuning (SFT) and reinforcement learning with verifiable rewards (RLVR) in reasoning tasks. …

Xuan Li, Zhanke Zhou, Zongze Li, Jiangchao Yao, Yu Rong, Lu Zhang, Bo Han

26 views Mar 9

Academic · 1 min

Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis

arXiv:2603.05917v1 Announce Type: new Abstract: Stock market prediction presents considerable challenges for investors, financial institutions, and policymakers operating in complex market environments characterized by noise, …

Mohammad Al Ridhawi, Mahtab Haj Ali, Hussein Al Osman

47 views Mar 9

Academic · 1 min

Design Experiments to Compare Multi-armed Bandit Algorithms

arXiv:2603.05919v1 Announce Type: new Abstract: Online platforms routinely compare multi-armed bandit algorithms, such as UCB and Thompson Sampling, to select the best-performing policy. Unlike standard …

Huiling Meng, Ningyuan Chen, Xuefeng Gao

48 views Mar 9

Academic · 1 min

Weak-SIGReg: Covariance Regularization for Stable Deep Learning

arXiv:2603.05924v1 Announce Type: new Abstract: Modern neural network optimization relies heavily on architectural priorssuch as Batch Normalization and Residual connectionsto stabilize training dynamics. Without these, …

Habibullah Akbar

43 views Mar 9

Academic · 1 min

Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence

arXiv:2603.05960v1 Announce Type: new Abstract: Memory-efficient optimization methods have recently gained increasing attention for scaling full-parameter training of large language models under the GPU-memory bottleneck. …

Hui Yang, Tao Ren, Jinyang Jiang, Wan Tian, Yijie Peng

34 views Mar 9

Academic · 1 min

EvoESAP: Non-Uniform Expert Pruning for Sparse MoE

arXiv:2603.06003v1 Announce Type: new Abstract: Sparse Mixture-of-Experts (SMoE) language models achieve strong capability at low per-token compute, yet deployment remains memory- and throughput-bound because the …

Zongfang Liu, Shengkun Tang, Boyang Sun, Zhiqiang Shen, Xin Yuan

23 views Mar 9

Academic · 1 min

Preventing Learning Stagnation in PPO by Scaling to 1 Million Parallel Environments

arXiv:2603.06009v1 Announce Type: new Abstract: Plateaus, where an agent's performance stagnates at a suboptimal level, are a common problem in deep on-policy RL. Focusing on …

Michael Beukman, Khimya Khetarpal, Zeyu Zheng, Will Dabney, Jakob Foerster, Michael Dennis, Clare Lyle

37 views Mar 9

Academic · 1 min

Agnostic learning in (almost) optimal time via Gaussian surface area

arXiv:2603.06027v1 Announce Type: new Abstract: The complexity of learning a concept class under Gaussian marginals in the difficult agnostic model is closely related to its …

Lucas Pesenti, Lucas Slot, Manuel Wiedmer

55 views Mar 9

Academic · 1 min

Improved high-dimensional estimation with Langevin dynamics and stochastic weight averaging

arXiv:2603.06028v1 Announce Type: new Abstract: Significant recent work has studied the ability of gradient descent to recover a hidden planted direction $\theta^\star \in S^{d-1}$ in …

Stanley Wei, Alex Damian, Jason D. Lee

33 views Mar 9

Academic · 1 min

Latent Diffusion-Based 3D Molecular Recovery from Vibrational Spectra

arXiv:2603.06113v1 Announce Type: new Abstract: Infrared (IR) spectroscopy, a type of vibrational spectroscopy, is widely used for molecular structure determination and provides critical structural information …

Wenjin Wu, Ale\v{s} Leonardis, Linjiang Chen, Jianbo Jiao

46 views Mar 9

Academic · 1 min

Dynamic Momentum Recalibration in Online Gradient Learning

arXiv:2603.06120v1 Announce Type: new Abstract: Stochastic Gradient Descent (SGD) and its momentum variants form the backbone of deep learning optimization, yet the underlying dynamics of …

Zhipeng Yao, Rui Yu, Guisong Chang, Ying Li, Yu Zhang, Dazhou Li

26 views Mar 9

Academic · 1 min

DQE: A Semantic-Aware Evaluation Metric for Time Series Anomaly Detection

arXiv:2603.06131v1 Announce Type: new Abstract: Time series anomaly detection has achieved remarkable progress in recent years. However, evaluation practices have received comparatively less attention, despite …

Yuewei Li, Dalin Zhang, Huan Li, Xinyi Gong, Hongjun Chu, Zhaohui Song

24 views Mar 9

← Previous

202 203 204 205 206

Academic

Reference-guided Policy Optimization for Molecular Optimization via LLM Reasoning

Stock Market Prediction Using Node Transformer Architecture Integrated with BERT Sentiment Analysis

Design Experiments to Compare Multi-armed Bandit Algorithms

Weak-SIGReg: Covariance Regularization for Stable Deep Learning

Omni-Masked Gradient Descent: Memory-Efficient Optimization via Mask Traversal with Improved Convergence

EvoESAP: Non-Uniform Expert Pruning for Sparse MoE

Preventing Learning Stagnation in PPO by Scaling to 1 Million Parallel Environments

Agnostic learning in (almost) optimal time via Gaussian surface area

Improved high-dimensional estimation with Langevin dynamics and stochastic weight averaging

Latent Diffusion-Based 3D Molecular Recovery from Vibrational Spectra

Dynamic Momentum Recalibration in Online Gradient Learning

DQE: A Semantic-Aware Evaluation Metric for Time Series Anomaly Detection

JCG, PC

HSOLLC Co., Ltd.