Multi-task reinforcement learning aims to efficiently learn multiple tasks simultaneously by leveraging shared information.
A new framework called Cross-Task Policy Guidance (CTPG) is introduced to provide guidance for unmastered tasks by utilizing control policies of proficient tasks.
CTPG uses guide policies to select behavior policies from various tasks, enhancing training trajectories.
Empirical evaluations show that integrating CTPG with existing approaches improves performance in manipulation and locomotion benchmarks.