This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Quality follows upgrading

Toshiaki Koike-Akino, Jing Liu, Ye Wang

Articles by Toshiaki Koike-Akino, Jing Liu, Ye Wang

Academic · 1 min

TTQ: Activation-Aware Test-Time Quantization to Accelerate LLM Inference On The Fly

arXiv:2603.19296v1 Announce Type: new Abstract: To tackle the huge computational demand of large foundation models, activation-aware compression techniques without retraining have been introduced. However, since …

9 views Mar 23

Toshiaki Koike-Akino, Jing Liu, Ye Wang

Articles by Toshiaki Koike-Akino, Jing Liu, Ye Wang

TTQ: Activation-Aware Test-Time Quantization to Accelerate LLM Inference On The Fly

JCG, PC

HSOLLC Co., Ltd.