Behavioral metric learning is a key approach to state abstraction in reinforcement learning environments.
Accurately estimating these metrics remains challenging, and prior evaluations mainly focused on final returns.
A study evaluated five recent metric learning approaches in deep reinforcement learning across various tasks and noise settings.
The study introduced a denoising factor evaluation and an isolated metric estimation setting, and released an open-source codebase for future research.