Provable Guarantees for Nonlinear Feature Learning in Three-Layer Neural NetworksDeep learning theory seeks to understand how neural networks learn hierarchical features.This study focuses on three-layer neural networks and their richer feature learning capabilities.They present a theorem that bounds sample complexity and width needed for low test error when the target has hierarchical structure.