Category

Academic

Academic · 1 min

TARo: Token-level Adaptive Routing for LLM Test-time Alignment

arXiv:2603.18411v1 Announce Type: new Abstract: Large language models (LLMs) exhibit strong reasoning capabilities but typically require expensive post-training to reach high performance. Recent test-time alignment …

Arushi Rai, Qiang Zhang, Hanqing Zeng, Yunkai Zhang, Dipesh Tamboli, Xiangjun Fan, Zhuokai Zhao
11 views
Academic · 1 min

Cross-Lingual LLM-Judge Transfer via Evaluation Decomposition

arXiv:2603.18557v1 Announce Type: new Abstract: As large language models are increasingly deployed across diverse real-world applications, extending automated evaluation beyond English has become a critical …

Ivaxi Sheth, Zeno Jonke, Amin Mantrach, Saab Mansour
9 views