This platform requires JavaScript for full functionality. Please enable JavaScript in your browser settings.

Quality follows upgrading

Rohan Deb, Stephen J. Wright, Arindam Banerjee

Articles by Rohan Deb, Stephen J. Wright, Arindam Banerjee

Academic · 1 min

Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning

arXiv:2603.22430v1 Announce Type: new Abstract: Offline Reinforcement Learning (RL) aims to learn optimal policies from fixed offline datasets, without further interactions with the environment. Such …

3 views Mar 25

Rohan Deb, Stephen J. Wright, Arindam Banerjee

Articles by Rohan Deb, Stephen J. Wright, Arindam Banerjee

Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning

JCG, PC

HSOLLC Co., Ltd.