A Hierarchical End-of-Turn Model with Primary Speaker Segmentation for Real-Time Conversational AI
arXiv:2603.13379v1 Announce Type: new Abstract: We present a real-time front-end for voice-based conversational AI to enable natural turn-taking in two-speaker scenarios by combining primary speaker …
Karim Helwani, Hoang Do, James Luan, Sriram Srinivasan
13 views