The study explores efficient annotation strategies for Sound Event Detection (SED) applications using Active Learning (AL).
A novel uncertainty aggregation strategy called Top K Entropy is introduced, which prioritizes the most uncertain segments in an audio recording for annotation.
Compared to random sampling and Mean Entropy, Top K Entropy leads to improved annotation efficiency in sparse data scenarios.
Using Top K Entropy, the study demonstrates comparable model performance with only 8% of the labels compared to training on the fully labeled dataset.