A Spectral Revisit of the Distributional Bellman Operator under the Cram\'er Metric
arXiv:2603.12576v1 Announce Type: new Abstract: Distributional reinforcement learning (DRL) studies the evolution of full return distributions under Bellman updates rather than focusing on expected values. …
Keru Wang, Yixin Deng, Yao Lyu, Stephen Redmond, Shengbo Eben Li
14 views