<ul data-eligibleForWebStory="false"><li>Practitioners often treat gradients of neural networks as inputs to task-specific algorithms for optimization, editing, and analysis.</li><li>A new paper introduces GradMetaNet, an architecture designed specifically for processing gradients by following principles like equivariant design and efficient gradient representation.</li><li>GradMetaNet is demonstrated to outperform previous approaches in approximating natural gradient-based functions for tasks like learned optimization, INR editing, and loss landscape curvature estimation.</li><li>The architecture, based on simple equivariant blocks, is proven to be universal and effective on a variety of gradient-based tasks involving MLPs and transformers.</li></ul>

GradMetaNet: An Equivariant Architecture for Learning on Gradients

Discover more