A study investigated the impact of data augmentation and preprocessing strategies in self-supervised learning for lung ultrasound.
Semantics-preserving data augmentation resulted in the greatest performance for COVID-19 classification, while cropping-based methods yielded the greatest performance on B-line and pleural effusion object classification tasks.
Increased downstream performance for multiple tasks was observed with semantics-preserving ultrasound image preprocessing.
Guidance regarding data augmentation and preprocessing strategies in self-supervised learning for ultrasound was provided.