Academic

Academic

Academic · 1 min

One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

arXiv:2603.04411v1 Announce Type: new Abstract: Despite the remarkable progress of Large Language Models (LLMs), the escalating memory footprint of the Key-Value (KV) cache remains a …

Liming Lu, Kaixi Qiu, Jiayu Zhou, Jushi Kai, Haoyan Zhang, Huanyu Wang, Jingwen Leng, Ziwei He, Zhouhan Lin
4 views
Academic · 1 min

Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?

arXiv:2603.04421v1 Announce Type: new Abstract: Multi-agent large language model (LLM) systems have emerged as a promising approach for clinical diagnosis, leveraging collaboration among agents to …

Grace Chang Yuan, Xiaoman Zhang, Sung Eun Kim, Pranav Rajpurkar
5 views