<ul data-eligibleForWebStory="true">Modeling spatial-temporal interactions among neighboring agents is crucial for multi-agent problems like motion forecasting and crowd navigation.Recent representations may not fully capture the causal relationships in agent interactions.A metric learning approach is introduced to enhance causal awareness in representations by regularizing latent features with causal annotations.Experiments demonstrate that the proposed approach improves causal awareness and enhances out-of-distribution robustness.A sim-to-real causal transfer method through cross-domain multi-task learning is proposed to apply these concepts in real-world scenarios.Experiments on pedestrian datasets show significant performance improvements even without real-world causal annotations.The research offers insights into challenges and solutions for developing causally-aware representations of multi-agent interactions.The code for the approach is available at https://github.com/vita-epfl/CausalSim2Real.