Academic

Latest First Most Viewed Alphabetical

All Conference (266) Law Review (314) Academic (4957) Think Tank (60) News (791) Journal (139) Technology & AI (4) Business & Strategy (1) Finance & Economics (2) Legal & Compliance (1) Innovation & Research (0) International Affairs (2) Cybersecurity (2) Healthcare & Biotech (2)

Academic · 1 min

Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling

arXiv:2604.00489v1 Announce Type: new Abstract: Adapting pre-trained text Large Language Models (LLMs) into Speech Language Models (Speech LMs) via continual pretraining on speech data is …

Kazuki Yano, Jun Suzuki, Shinji Watanabe

42 views Apr 3

Academic · 1 min

Massively Parallel Exact Inference for Hawkes Processes

arXiv:2604.01342v1 Announce Type: new Abstract: Multivariate Hawkes processes are a widely used class of self-exciting point processes, but maximum likelihood estimation naively scales as $O(N^2)$ …

Ahmer Raza, Hudson Smith

27 views Apr 3

Academic · 1 min

Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms

arXiv:2604.00012v1 Announce Type: cross Abstract: Despite the impressive performance of general-purpose large language models (LLMs), they often require fine-tuning or post-training to excel at specific …

Mingjie Li, Wai Man Si, Michael Backes, Yang Zhang, Yisen Wang

35 views Apr 3

Academic · 1 min

Towards Intrinsically Calibrated Uncertainty Quantification in Industrial Data-Driven Models via Diffusion Sampler

arXiv:2604.01870v1 Announce Type: new Abstract: In modern process industries, data-driven models are important tools for real-time monitoring when key performance indicators are difficult to measure …

Yiran Ma, Jerome Le Ny, Zhichao Chen, Zhihuan Song

86 views Apr 3

Academic · 1 min

Open, Reliable, and Collective: A Community-Driven Framework for Tool-Using AI Agents

arXiv:2604.00137v1 Announce Type: new Abstract: Tool-integrated LLMs can retrieve, compute, and take real-world actions via external tools, but reliability remains a key bottleneck. We argue …

Hy Dang, Quang Dao, Meng Jiang

64 views Apr 3

Academic · 1 min

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

arXiv:2604.00344v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown remarkable performance in completing various tasks. However, solving complex problems often requires the coordination …

Eric Hanchen Jiang, Levina Li, Rui Sun, Xiao Liang, Yubei Li, Yuchen Wu, Haozheng Luo, Hengli Li, Zhi Zhang, Zhaolu Kang, Kai-Wei Chang, Ying Nian Wu

50 views Apr 3

Academic · 1 min

Koopman-Based Nonlinear Identification and Adaptive Control of a Turbofan Engine

arXiv:2604.01730v1 Announce Type: new Abstract: This paper investigates Koopman operator-based approaches for multivariable control of a two-spool turbofan engine. A physics-based component-level model is developed …

David Grasev

41 views Apr 3

Academic · 1 min

FourierMoE: Fourier Mixture-of-Experts Adaptation of Large Language Models

arXiv:2604.01762v1 Announce Type: new Abstract: Parameter-efficient fine-tuning (PEFT) has emerged as a crucial paradigm for adapting large language models (LLMs) under constrained computational budgets. However, …

Juyong Jiang, Fan Wang, Hong Qi, Sunghun Kim, Jing Tang

74 views Apr 3

Academic · 1 min

DDCL: Deep Dual Competitive Learning: A Differentiable End-to-End Framework for Unsupervised Prototype-Based Representation Learning

arXiv:2604.01740v1 Announce Type: new Abstract: A persistent structural weakness in deep clustering is the disconnect between feature learning and cluster assignment. Most architectures invoke an …

Giansalvo Cirrincione

85 views Apr 3

Academic · 1 min

A Reliability Evaluation of Hybrid Deterministic-LLM Based Approaches for Academic Course Registration PDF Information Extraction

arXiv:2604.00003v1 Announce Type: cross Abstract: This study evaluates the reliability of information extraction approaches from KRS documents using three strategies: LLM only, Hybrid Deterministic - …

Muhammad Anis Al Hilmi, Neelansh Khare, Noel Framil Iglesias

75 views Apr 3

Academic · 1 min

CircuitProbe: Predicting Reasoning Circuits in Transformers via Stability Zone Detection

arXiv:2604.00716v1 Announce Type: new Abstract: Transformer language models contain localized reasoning circuits, contiguous layer blocks that improve reasoning when duplicated at inference time. Finding these …

Rajkiran Panuganti

23 views Apr 3

Academic · 1 min

Model Merging via Data-Free Covariance Estimation

arXiv:2604.01329v1 Announce Type: new Abstract: Model merging provides a way of cheaply combining individual models to produce a model that inherits each individual's capabilities. While …

Marawan Gamal Abdel Hameed, Derek Tam, Pascal Jr Tikeng Notsawo, Colin Raffel, Guillaume Rabusseau

56 views Apr 3

← Previous

48 49 50 51 52

Academic

Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling

Massively Parallel Exact Inference for Hawkes Processes

Finding and Reactivating Post-Trained LLMs' Hidden Safety Mechanisms

Towards Intrinsically Calibrated Uncertainty Quantification in Industrial Data-Driven Models via Diffusion Sampler

Open, Reliable, and Collective: A Community-Driven Framework for Tool-Using AI Agents

Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning

Koopman-Based Nonlinear Identification and Adaptive Control of a Turbofan Engine

FourierMoE: Fourier Mixture-of-Experts Adaptation of Large Language Models

DDCL: Deep Dual Competitive Learning: A Differentiable End-to-End Framework for Unsupervised Prototype-Based Representation Learning

A Reliability Evaluation of Hybrid Deterministic-LLM Based Approaches for Academic Course Registration PDF Information Extraction

CircuitProbe: Predicting Reasoning Circuits in Transformers via Stability Zone Detection

Model Merging via Data-Free Covariance Estimation

JCG, PC

HSOLLC Co., Ltd.