The study focuses on the charging scheduling problem for Electric Buses (EBs) using Deep Reinforcement Learning (DRL).
Hierarchical DRL (HDRL) is proposed to tackle the challenge of long-range multi-phase planning with sparse rewards.
The algorithm involves Hierarchical Double Deep Q-Network (HDDQN)-Hindsight Experience Replay (HER) for decision-making at different temporal resolutions.
Numerical experiments with real-world data have been conducted to assess the algorithm's performance.