Tradable credit schemes (TCS) are an alternative to congestion pricing, addressing equity and revenue neutrality.This paper focuses on the day-to-day dynamic tolling problem under TCS using reinforcement learning (RL).RL algorithms achieve comparable travel times and social welfare, even with varying capacities and demand levels.Challenges include scaling to large networks, but transfer learning can improve computational efficiency.