menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

Towards Ca...
source image

Arxiv

2M

read

454

img
dot

Image Credit: Arxiv

Towards Causal Model-Based Policy Optimization

  • Real-world decision-making problems often have complex, uncertain dynamics.
  • Traditional model-based reinforcement learning approaches don't consider underlying causal mechanisms, leading to spurious correlations.
  • Causal Model-Based Policy Optimization (C-MBPO) integrates causal learning into the MBRL pipeline.
  • C-MBPO infers a Causal Markov Decision Process (C-MDP) and learned Structural Causal Models (SCMs) for more robust and generalizable policy learning.

Read Full Article

like

27 Likes

For uninterrupted reading, download the app