All Articles

Articles

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

arXiv:2602.12506v1 Announce Type: new Abstract: Reinforcement learning (RL) fine-tuning has become a key technique for enhancing large language models (LLMs) on reasoning-intensive tasks, motivating its …

Rosie Zhao, Anshul Shah, Xiaoyu Zhu, Xinke Deng, Zhongyu Jiang, Yang Yang, Joerg Liebelt, Arnab Mondal

5 views Mar 7

Academic · 1 min

Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

arXiv:2602.12517v1 Announce Type: new Abstract: The intersection of Mean Field Games (MFGs) and Reinforcement Learning (RL) has fostered a growing family of algorithms designed to …

Lorenzo Magnino, Jiacheng Shen, Matthieu Geist, Olivier Pietquin, Mathieu Lauri\`ere

24 views Mar 7

Academic · 1 min

Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

arXiv:2602.12520v1 Announce Type: new Abstract: Learning to coordinate many agents in partially observable and highly dynamic environments requires both informative representations and data-efficient training. To …

Zhizun Wang, David Meger

14 views Mar 7

Academic · 1 min

Analytical Results for Two Exponential Family Distributions in Hierarchical Dirichlet Processes

arXiv:2602.12527v1 Announce Type: new Abstract: The Hierarchical Dirichlet Process (HDP) provides a flexible Bayesian nonparametric framework for modeling grouped data with a shared yet unbounded …

Naiqi Li

18 views Mar 7

Academic · 1 min

Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models

arXiv:2602.12529v1 Announce Type: new Abstract: Reinforcement learning has emerged as a promising paradigm for aligning diffusion and flow-matching models with human preferences, yet practitioners face …

Bowen Ping, Chengyou Jia, Minnan Luo, Hangwei Qian, Ivor Tsang

7 views Mar 7

Academic · 1 min

AMPS: Adaptive Modality Preference Steering via Functional Entropy

arXiv:2602.12533v1 Announce Type: new Abstract: Multimodal Large Language Models (MLLMs) often exhibit significant modality preference, which is a tendency to favor one modality over another. …

Zihan Huang, Xintong Li, Rohan Surana, Tong Yu, Rui Wang, Julian McAuley, Jingbo Shang, Junda Wu

12 views Mar 7

Academic · 1 min

Exploring Accurate and Transparent Domain Adaptation in Predictive Healthcare via Concept-Grounded Orthogonal Inference

arXiv:2602.12542v1 Announce Type: new Abstract: Deep learning models for clinical event prediction on electronic health records (EHR) often suffer performance degradation when deployed under different …

Pengfei Hu, Chang Lu, Feifan Liu, Yue Ning

7 views Mar 7

Academic · 1 min

Fractional Order Federated Learning for Battery Electric Vehicle Energy Consumption Modeling

arXiv:2602.12567v1 Announce Type: new Abstract: Federated learning on connected electric vehicles (BEVs) faces severe instability due to intermittent connectivity, time-varying client participation, and pronounced client-to-client …

Mohammad Partohaghighi, Roummel Marcia, Bruce J. West, YangQuan Chen

5 views Mar 7

Academic · 1 min

VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

arXiv:2602.12579v1 Announce Type: new Abstract: Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a dominant paradigm for enhancing Large Language Models (LLMs) reasoning, yet …

Xin-Qiang Cai, Masashi Sugiyama

7 views Mar 7

Academic · 1 min

Vehicle behaviour estimation for abnormal event detection using distributed fiber optic sensing

arXiv:2602.12591v1 Announce Type: new Abstract: The distributed fiber-optic sensing (DFOS) system is a cost-effective wide-area traffic monitoring technology that utilizes existing fiber infrastructure to effectively …

Hemant Prasad, Daisuke Ikefuji, Shin Tominaga, Hitoshi Sakurai, Manabu Otani

7 views Mar 7

Academic · 1 min

Power Interpretable Causal ODE Networks: A Unified Model for Explainable Anomaly Detection and Root Cause …

arXiv:2602.12592v1 Announce Type: new Abstract: Anomaly detection and root cause analysis (RCA) are critical for ensuring the safety and resilience of cyber-physical systems such as …

Yue Sun, Likai Wang, Rick S. Blum, Parv Venkitasubramaniam

4 views Mar 7

Academic · 1 min

Block-Sample MAC-Bayes Generalization Bounds

arXiv:2602.12605v1 Announce Type: new Abstract: We present a family of novel block-sample MAC-Bayes bounds (mean approximately correct). While PAC-Bayes bounds (probably approximately correct) typically give …

Matthias Frey, Jingge Zhu, Michael C. Gastpar

19 views Mar 7

← Previous

323 324 325 326 327

Articles

On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs

Bench-MFG: A Benchmark Suite for Learning in Stationary Mean Field Games

Multi-Agent Model-Based Reinforcement Learning with Joint State-Action Learned Embeddings

Analytical Results for Two Exponential Family Distributions in Hierarchical Dirichlet Processes

Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models

AMPS: Adaptive Modality Preference Steering via Functional Entropy

Exploring Accurate and Transparent Domain Adaptation in Predictive Healthcare via Concept-Grounded Orthogonal Inference

Fractional Order Federated Learning for Battery Electric Vehicle Energy Consumption Modeling

VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction

Vehicle behaviour estimation for abnormal event detection using distributed fiber optic sensing

Power Interpretable Causal ODE Networks: A Unified Model for Explainable Anomaly Detection and Root Cause …

Block-Sample MAC-Bayes Generalization Bounds

JCG, PC

HSOLLC Co., Ltd.