Back-stepping experience replay (BER) is a reinforcement learning technique that can accelerate learning efficiency in reversible environments.
An enhanced version called Generalized BER (GBER) is proposed, which extends the original algorithm to sparse-reward environments.
GBER improves the performance of BER by introducing relabeling mechanism and applying diverse sampling strategies.
Experimental results show that GBER significantly boosts the performance and stability of the baseline algorithm in various sparse-reward environments.