Real-world event sequences often have clustering structures, but most existing temporal point processes (TPPs) ignore them.A new study proposes learning structure-enhanced TPPs with Gromov-Wasserstein (GW) regularization.The proposed method imposes clustering structures on TPPs for improved interpretability in modeling and prediction.The learned TPPs demonstrate clustered sequence embeddings and competitive predictive and clustering performance.