All Articles

Articles

Academic · 1 min

CapTrack: Multifaceted Evaluation of Forgetting in LLM Post-Training

arXiv:2603.06610v1 Announce Type: new Abstract: Large language model (LLM) post-training enhances latent skills, unlocks value alignment, improves performance, and enables domain adaptation. Unfortunately, post-training is …

Lukas Thede, Stefan Winzeck, Zeynep Akata, Jonathan Richard Schwarz
12 views
Academic · 1 min

Correlation Analysis of Generative Models

arXiv:2603.06614v1 Announce Type: new Abstract: Based on literature review about existing diffusion models and flow matching with a neural network to predict a predefined target …

Zhengguo Li, Chaobing Zheng, Wei Wang
19 views
Academic · 1 min

Evo: Autoregressive-Diffusion Large Language Models with Evolving Balance

arXiv:2603.06617v1 Announce Type: new Abstract: We introduce \textbf{Evo}, a duality latent trajectory model that bridges autoregressive (AR) and diffusion-based language generation within a continuous evolutionary …

Junde Wu, Minhao Hu, Jiayuan Zhu, Yuyuan Liu, Tianyi Zhang, Kang Li, Jingkun Chen, Jiazhen Pan, Min Xu, Yueming Jin
10 views