Enhancing Aviation Communication Transcription: Fine-Tuning Distil-Whisper with LoRA
The paper discusses the use of a Parameter-Efficient Fine-tuning method called Low-Rank Adaptation (LoRA) to fine-tune a more computationally efficient version of the automatic speech recognition model, Whisper, for aviation communication transcription.
The authors used the Air Traffic Control Corpus dataset and performed a grid search to optimize the hyperparameters of distil-Whisper using a 5-fold cross-validation.
The fine-tuned model achieved an average word error rate of 3.86% across five folds, indicating its potential for accurate transcription of aviation communication.