This study explores the application of deep reinforcement learning for real-time drum control in nuclear microreactors.Deep reinforcement learning controllers demonstrate similar or better load-following performance compared to traditional PID control.RL agents can reduce tracking error rate in short transients and maintain accuracy in longer, more complex load-following scenarios.Multi-agent RL enables independent drum control and maintains reactor symmetry constraints without sacrificing performance.