Reinforcement learning methods have proposed promising traffic signal control policy that can be trained on large road networks.MacLight, Multi-Scene Aggregation Convolutional Learning, offers faster training speeds and more stable performance.It utilizes variational autoencoders for global representation and proximal policy optimization algorithm for value evaluation.Experimental results demonstrate superior stability, optimized convergence levels, and the highest time efficiency.