Learning Invariant Visual Representations for Planning with Joint-Embedding Predictive World Models
arXiv:2602.18639v1 Announce Type: new Abstract: World models learned from high-dimensional visual observations allow agents to make decisions and plan directly in latent space, avoiding pixel-level …