Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization
arXiv:2603.16105v1 Announce Type: new Abstract: Post-training model compression is essential for enhancing the portability of Large Language Models (LLMs) while preserving their performance. While several …
Francesco Pio Monaco, Elia Cunegatti, Flavio Vella, Giovanni Iacca
2 views