Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Stabilizing Native Low-Rank LLM Pretraining

arXiv:2602.12429v1 Announce Type: new Abstract: Foundation models have achieved remarkable success, yet their growing parameter counts pose significant computational and memory challenges. Low-rank factorization offers …

Paul Janson, Edouard Oyallon, Eugene Belilovsky

16 views Mar 7

Academic · 1 min

Computationally sufficient statistics for Ising models

arXiv:2602.12449v1 Announce Type: new Abstract: Learning Gibbs distributions using only sufficient statistics has long been recognized as a computationally hard problem. On the other hand, …

Abhijith Jayakumar, Shreya Shukla, Marc Vuffray, Andrey Y. Lokhov, Sidhant Misra

47 views Mar 7

Academic · 1 min

Continuous Diffusion Models Can Obey Formal Syntax

arXiv:2602.12468v1 Announce Type: new Abstract: Diffusion language models offer a promising alternative to autoregressive models due to their global, non-causal generation process, but their continuous …

Jinwoo Kim, Taylor Berg-Kirkpatrick, Loris D'Antoni

45 views Mar 7

Academic · 1 min

Regularized Meta-Learning for Improved Generalization

arXiv:2602.12469v1 Announce Type: new Abstract: Deep ensemble methods often improve predictive performance, yet they suffer from three practical limitations: redundancy among base models that inflates …

Noor Islam S. Mohammad, Md Muntaqim Meherab

17 views Mar 7

Academic · 1 min

Tight Bounds for Logistic Regression with Large Stepsize Gradient Descent in Low Dimension

arXiv:2602.12471v1 Announce Type: new Abstract: We consider the optimization problem of minimizing the logistic loss with gradient descent to train a linear model for binary …

Michael Crawshaw, Mingrui Liu

14 views Mar 7

Academic · 1 min

Geometric separation and constructive universal approximation with two hidden layers

arXiv:2602.12482v1 Announce Type: new Abstract: We give a geometric construction of neural networks that separate disjoint compact subsets of $\Bbb R^n$, and use it to …

Chanyoung Sung

20 views Mar 7

Academic · 1 min

A Theoretical Analysis of Mamba's Training Dynamics: Filtering Relevant Features for Generalization in State Space …

arXiv:2602.12499v1 Announce Type: new Abstract: The recent empirical success of Mamba and other selective state space models (SSMs) has renewed interest in non-attention architectures for …

Mugunthan Shandirasegaran, Hongkang Li, Songyang Zhang, Meng Wang, Shuai Zhang

42 views Mar 7

Academic · 1 min

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

arXiv:2602.12506v1 Announce Type: new Abstract: Reinforcement learning (RL) fine-tuning has become a key technique for enhancing large language models (LLMs) on reasoning-intensive tasks, motivating its …

Rosie Zhao, Anshul Shah, Xiaoyu Zhu, Xinke Deng, Zhongyu Jiang, Yang Yang, Joerg Liebelt, Arnab Mondal

15 views Mar 7

Academic · 1 min

Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

arXiv:2602.12517v1 Announce Type: new Abstract: The intersection of Mean Field Games (MFGs) and Reinforcement Learning (RL) has fostered a growing family of algorithms designed to …

Lorenzo Magnino, Jiacheng Shen, Matthieu Geist, Olivier Pietquin, Mathieu Lauri\`ere

39 views Mar 7

Academic · 1 min

Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

arXiv:2602.12520v1 Announce Type: new Abstract: Learning to coordinate many agents in partially observable and highly dynamic environments requires both informative representations and data-efficient training. To …

Zhizun Wang, David Meger

24 views Mar 7

Academic · 1 min

Analytical Results for Two Exponential Family Distributions in Hierarchical Dirichlet Processes

arXiv:2602.12527v1 Announce Type: new Abstract: The Hierarchical Dirichlet Process (HDP) provides a flexible Bayesian nonparametric framework for modeling grouped data with a shared yet unbounded …

Naiqi Li

31 views Mar 7

Academic · 1 min

Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models

arXiv:2602.12529v1 Announce Type: new Abstract: Reinforcement learning has emerged as a promising paradigm for aligning diffusion and flow-matching models with human preferences, yet practitioners face …

Bowen Ping, Chengyou Jia, Minnan Luo, Hangwei Qian, Ivor Tsang

24 views Mar 7

← Previous

249 250 251 252 253

Academic

Stabilizing Native Low-Rank LLM Pretraining

Computationally sufficient statistics for Ising models

Continuous Diffusion Models Can Obey Formal Syntax

Regularized Meta-Learning for Improved Generalization

Tight Bounds for Logistic Regression with Large Stepsize Gradient Descent in Low Dimension

Geometric separation and constructive universal approximation with two hidden layers

A Theoretical Analysis of Mamba's Training Dynamics: Filtering Relevant Features for Generalization in State Space …

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

Analytical Results for Two Exponential Family Distributions in Hierarchical Dirichlet Processes

Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models

JCG, PC

HSOLLC Co., Ltd.