FastDSAC: Unlocking the Potential of Maximum Entropy RL in High-Dimensional Humanoid Control
arXiv:2603.12612v1 Announce Type: new Abstract: Scaling Maximum Entropy Reinforcement Learning (RL) to high-dimensional humanoid control remains a formidable challenge, as the ``curse of dimensionality'' induces …
Jun Xue, Junze Wang, Xinming Zhang, Shanze Wang, Yanjun Chen, Wei Zhang
8 views