Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

On the "Induction Bias" in Sequence Models

arXiv:2602.18333v1 Announce Type: cross Abstract: Despite the remarkable practical success of transformer-based language models, recent work has raised concerns about their ability to perform state …

M. Reza Ebrahimi, Micha\"el Defferrard, Sunny Panchal, Roland Memisevic

9 views Feb 24

Academic · 1 min

Subgroups of $U(d)$ Induce Natural RNN and Transformer Architectures

arXiv:2602.18417v1 Announce Type: cross Abstract: This paper presents a direct framework for sequence models with hidden states on closed subgroups of U(d). We use a …

Joshua Nunley

8 views Feb 24

Academic · 1 min

Topic Modeling with Fine-tuning LLMs and Bag of Sentences

arXiv:2408.03099v2 Announce Type: replace Abstract: Large language models (LLMs) are increasingly used for topic modeling, outperforming classical topic models such as LDA. Commonly, pre-trained LLM …

Johannes Schneider

10 views Feb 24

Academic · 1 min

Joint Parameter and State-Space Bayesian Optimization: Using Process Expertise to Accelerate Manufacturing Optimization

arXiv:2602.17679v1 Announce Type: new Abstract: Bayesian optimization (BO) is a powerful method for optimizing black-box manufacturing processes, but its performance is often limited when dealing …

Saksham Kiroriwal, Julius Pfrommer, J\"urgen Beyerer

32 views Feb 24

Academic · 1 min

BioBridge: Bridging Proteins and Language for Enhanced Biological Reasoning with LLMs

arXiv:2602.17680v1 Announce Type: new Abstract: Existing Protein Language Models (PLMs) often suffer from limited adaptability to multiple tasks and exhibit poor generalization across diverse biological …

Yujia Wang, Jihong Guan, Wengen Li, Shuigeng Zhou, Xuhong Wang

23 views Feb 24

Academic · 1 min

Duality Models: An Embarrassingly Simple One-step Generation Paradigm

arXiv:2602.17682v1 Announce Type: new Abstract: Consistency-based generative models like Shortcut and MeanFlow achieve impressive results via a target-aware design for solving the Probability Flow ODE …

Peng Sun, Xinyi Shang, Tao Lin, Zhiqiang Shen

12 views Feb 24

Academic · 1 min

Probabilistic NDVI Forecasting from Sparse Satellite Time Series and Weather Covariates

arXiv:2602.17683v1 Announce Type: new Abstract: Accurate short-term forecasting of vegetation dynamics is a key enabler for data-driven decision support in precision agriculture. Normalized Difference Vegetation …

Irene Iele, Giulia Romoli, Daniele Molino, Elena Mulero Ayll\'on, Filippo Ruffini, Paolo Soda, Matteo Tortora

21 views Feb 24

Academic · 1 min

Optimal Multi-Debris Mission Planning in LEO: A Deep Reinforcement Learning Approach with Co-Elliptic Transfers and …

arXiv:2602.17685v1 Announce Type: new Abstract: This paper addresses the challenge of multi target active debris removal (ADR) in Low Earth Orbit (LEO) by introducing a …

Agni Bandyopadhyay, Gunther Waxenegger-Wilfing

20 views Feb 24

Academic · 1 min

AnCoder: Anchored Code Generation via Discrete Diffusion Models

arXiv:2602.17688v1 Announce Type: new Abstract: Diffusion language models offer a compelling alternative to autoregressive code generation, enabling global planning and iterative refinement of complex program …

Anton Xue, Litu Rout, Constantine Caramanis, Sanjay Shakkottai

60 views Feb 24

Academic · 1 min

Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters

arXiv:2602.17697v1 Announce Type: new Abstract: Large Language Models (LLMs) are being increasingly used across a wide range of tasks. However, their substantial computational demands raise …

Nada Zine, Cl\'ement Quinton, Romain Rouvoy

26 views Feb 24

Academic · 1 min

Parallel Complex Diffusion for Scalable Time Series Generation

arXiv:2602.17706v1 Announce Type: new Abstract: Modeling long-range dependencies in time series generation poses a fundamental trade-off between representational capacity and computational efficiency. Traditional temporal diffusion …

Rongyao Cai, Yuxi Wan, Kexin Zhang, Ming Jin, Zhiqiang Ge, Qingsong Wen, Yong Liu

8 views Feb 24

Academic · 1 min

Provable Adversarial Robustness in In-Context Learning

arXiv:2602.17743v1 Announce Type: new Abstract: Large language models adapt to new tasks through in-context learning (ICL) without parameter updates. Current theoretical explanations for this capability …

Di Zhang

20 views Feb 24

← Previous

370 371 372 373 374

Academic

On the "Induction Bias" in Sequence Models

Subgroups of $U(d)$ Induce Natural RNN and Transformer Architectures

Topic Modeling with Fine-tuning LLMs and Bag of Sentences

Joint Parameter and State-Space Bayesian Optimization: Using Process Expertise to Accelerate Manufacturing Optimization

BioBridge: Bridging Proteins and Language for Enhanced Biological Reasoning with LLMs

Duality Models: An Embarrassingly Simple One-step Generation Paradigm

Probabilistic NDVI Forecasting from Sparse Satellite Time Series and Weather Covariates

Optimal Multi-Debris Mission Planning in LEO: A Deep Reinforcement Learning Approach with Co-Elliptic Transfers and …

AnCoder: Anchored Code Generation via Discrete Diffusion Models

Pimp My LLM: Leveraging Variability Modeling to Tune Inference Hyperparameters

Parallel Complex Diffusion for Scalable Time Series Generation

Provable Adversarial Robustness in In-Context Learning

JCG, PC

HSOLLC Co., Ltd.