Duration Aware Scheduling for ASR Serving Under Workload Drift
arXiv:2603.11273v1 Announce Type: new Abstract: Scheduling policies in large-scale Automatic Speech Recognition (ASR) serving pipelines play a key role in determining end-to-end (E2E) latency. Yet, …
Darshan Makwana, Yash Jogi, Harsh Kotta, Aayush Kubba
3 views