Researchers have addressed the challenge of incorporating document-level metadata into topic modeling to improve topic mixture estimation.
They propose a graph-structured topic modeling approach that incorporates document-level covariates or known similarities between documents.
The approach is based on a fast graph-regularized iterative singular value decomposition (SVD) that encourages similar documents to share similar topic mixture proportions.
Experiments on synthetic datasets and real-world corpora validate the model, showing improved performance and faster inference compared to existing Bayesian methods.