Reinforcement learning agents often exploit spurious correlations in training data, resulting in brittle behaviors that fail to generalize to new environments.
The Causal Object-centric Model Extraction Tool (COMET) is an algorithm designed to learn interpretable causal world models (CWMs).
COMET extracts object-centric state descriptions from observations and models object-centric transitions using symbolic regression.
COMET constructs CWMs that align with the true causal structure of the environment, enabling better planning and decision-making in reinforcement learning.