Understanding patient feedback is crucial for improving healthcare services, yet analyzing unlabeled short-text feedback presents significant challenges due to limited data and domain-specific nuances.
This study explores unsupervised methods to extract meaningful topics from patient feedback collected from a healthcare system in Wisconsin, USA.
The study employed a keyword-based filtering approach and explored various topic modeling methods, including LDA, GSDMM, and BERTopic.
The integration of BERT embeddings with k-means clustering, called kBERT, outperformed other models, achieving high coherence and distinct topic separation in short-text healthcare feedback analysis.