Real-world decision-making problems often have complex, uncertain dynamics.Traditional model-based reinforcement learning approaches don't consider underlying causal mechanisms, leading to spurious correlations.Causal Model-Based Policy Optimization (C-MBPO) integrates causal learning into the MBRL pipeline.C-MBPO infers a Causal Markov Decision Process (C-MDP) and learned Structural Causal Models (SCMs) for more robust and generalizable policy learning.