Training data attribution (TDA) methods measure the impact of training data on a model's predictions.
Gradient-based attribution methods are theoretically grounded but computationally expensive, leading to impracticality for large-scale applications.
AirRep is introduced as a scalable, representation-based approach optimized for TDA by learning task-specific and model-aligned representations.
AirRep achieves comparable performance to gradient-based approaches but is significantly more efficient at inference time, showcasing robustness and generalization.