User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction
arXiv:2603.20939v1 Announce Type: new Abstract: Large language models are increasingly used as personal assistants, yet most lack a persistent user model, forcing users to repeatedly …
Yuren Hao, Shuhaib Mehri, ChengXiang Zhai, Dilek Hakkani-T\"ur
17 views