Academic

Academic · 1 min

TopoChunker: Topology-Aware Agentic Document Chunking Framework

arXiv:2603.18409v1 Announce Type: new Abstract: Current document chunking methods for Retrieval-Augmented Generation (RAG) typically linearize text. This forced linearization strips away intrinsic topological hierarchies, creating …

Xiaoyu Liu

9 views Mar 20

Academic · 1 min

TARo: Token-level Adaptive Routing for LLM Test-time Alignment

arXiv:2603.18411v1 Announce Type: new Abstract: Large language models (LLMs) exhibit strong reasoning capabilities but typically require expensive post-training to reach high performance. Recent test-time alignment …

Arushi Rai, Qiang Zhang, Hanqing Zeng, Yunkai Zhang, Dipesh Tamboli, Xiangjun Fan, Zhuokai Zhao

11 views Mar 20

Academic · 1 min

Multimodal Task Interference: A Benchmark and Analysis of History-Target Mismatch in Multimodal LLMs

arXiv:2603.18425v1 Announce Type: new Abstract: Task interference, the performance degradation caused by task switches within a single conversation, has been studied exclusively in text-only settings …

Masayuki Kawarada, Tatsuya Ishigaki, Hiroya Takamura

8 views Mar 20

Academic · 1 min

Adaptive Decoding via Test-Time Policy Learning for Self-Improving Generation

arXiv:2603.18428v1 Announce Type: new Abstract: Decoding strategies largely determine the quality of Large Language Model (LLM) outputs, yet widely used heuristics such as greedy or …

Asmita Bhardwaj, Yuya Jeremy Ong, Eelaaf Zahid, Basel Shbita

10 views Mar 20

Academic · 1 min

UT-ACA: Uncertainty-Triggered Adaptive Context Allocation for Long-Context Inference

arXiv:2603.18446v1 Announce Type: new Abstract: Long-context inference remains challenging for large language models due to attention dilution and out-of-distribution degradation. Context selection mitigates this limitation …

Lang Zhou, Shuxuan Li, Zhuohao Li, Shi Liu, Zhilin Zhao, Wei-Shi Zheng

13 views Mar 20

Academic · 1 min

GAIN: A Benchmark for Goal-Aligned Decision-Making of Large Language Models under Imperfect Norms

arXiv:2603.18469v1 Announce Type: new Abstract: We introduce GAIN (Goal-Aligned Decision-Making under Imperfect Norms), a benchmark designed to evaluate how large language models (LLMs) balance adherence …

Masayuki Kawarada, Kodai Watanabe, Soichiro Murakami

8 views Mar 20

Academic · 1 min

WASD: Locating Critical Neurons as Sufficient Conditions for Explaining and Controlling LLM Behavior

arXiv:2603.18474v1 Announce Type: new Abstract: Precise behavioral control of large language models (LLMs) is critical for complex applications. However, existing methods often incur high training …

Haonan Yu, Junhao Liu, Zhenyu Yan, Haoran Lin, Xin Zhang

8 views Mar 20

Academic · 1 min

The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices

arXiv:2603.18482v1 Announce Type: new Abstract: Standard decoding strategies for text generation, including top-k, nucleus sampling, and contrastive search, select tokens based on likelihood, restricting selection …

Esteban Garces Arias, Nurzhan Sapargali, Christian Heumann, Matthias A{\ss}enmacher

11 views Mar 20

Academic · 1 min

EntropyCache: Decoded Token Entropy Guided KV Caching for Diffusion Language Models

arXiv:2603.18489v1 Announce Type: new Abstract: Diffusion-based large language models (dLLMs) rely on bidirectional attention, which prevents lossless KV caching and requires a full forward pass …

Minsoo Cheong, Donghyun Son, Woosang Lim, Sungjoo Yoo

9 views Mar 20

Academic · 1 min

When Names Change Verdicts: Intervention Consistency Reveals Systematic Bias in LLM Decision-Making

arXiv:2603.18530v1 Announce Type: new Abstract: Large language models (LLMs) are increasingly used for high-stakes decisions, yet their susceptibility to spurious features remains poorly characterized. We …

Abhinaba Basu, Pavan Chakraborty

29 views Mar 20

Academic · 1 min

Cross-Lingual LLM-Judge Transfer via Evaluation Decomposition

arXiv:2603.18557v1 Announce Type: new Abstract: As large language models are increasingly deployed across diverse real-world applications, extending automated evaluation beyond English has become a critical …

Ivaxi Sheth, Zeno Jonke, Amin Mantrach, Saab Mansour

9 views Mar 20

Academic · 1 min

ICE: Intervention-Consistent Explanation Evaluation with Statistical Grounding for LLMs

arXiv:2603.18579v1 Announce Type: new Abstract: Evaluating whether explanations faithfully reflect a model's reasoning remains an open problem. Existing benchmarks use single interventions without statistical testing, …

Abhinaba Basu, Pavan Chakraborty

23 views Mar 20

TopoChunker: Topology-Aware Agentic Document Chunking Framework

TARo: Token-level Adaptive Routing for LLM Test-time Alignment

Multimodal Task Interference: A Benchmark and Analysis of History-Target Mismatch in Multimodal LLMs

Adaptive Decoding via Test-Time Policy Learning for Self-Improving Generation

UT-ACA: Uncertainty-Triggered Adaptive Context Allocation for Long-Context Inference

GAIN: A Benchmark for Goal-Aligned Decision-Making of Large Language Models under Imperfect Norms

WASD: Locating Critical Neurons as Sufficient Conditions for Explaining and Controlling LLM Behavior

The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices

EntropyCache: Decoded Token Entropy Guided KV Caching for Diffusion Language Models

When Names Change Verdicts: Intervention Consistency Reveals Systematic Bias in LLM Decision-Making

Cross-Lingual LLM-Judge Transfer via Evaluation Decomposition

ICE: Intervention-Consistent Explanation Evaluation with Statistical Grounding for LLMs

JCG, PC

HSOLLC Co., Ltd.