Machine learning methods have shown promise in learning chaotic dynamical systems.In large-scale, spatiotemporally chaotic systems, data-driven machine learning methods suffer from inefficiencies.Clustered echo state networks, incorporating the spatial coupling structure, outperform existing models in learning chaotic systems.The approach remains effective even with imperfect prior coupling knowledge and noise in training data.