An independent team has developed a Pokémon model with just 10 million parameters, significantly smaller than frontier AI models.The small model outperforms larger models due to the 'depth curse', a counterintuitive issue in AI.The model was trained using Reinforcement Learning algorithm and achieved the goal efficiently.The article discusses imitation learning and exploration learning as two training paradigms in AI.