Researchers propose the use of latent space generative world models to address the covariate shift problem in autonomous driving.
The driving policy can effectively mitigate covariate shift without requiring an excessive amount of training data by leveraging a world model during training.
The policy learns how to recover from errors by aligning with states observed in human demonstrations during end-to-end training.
Qualitative and quantitative results demonstrate significant improvements upon prior state of the art in closed-loop testing in the CARLA simulator.