Reduced order models (ROMs) are important in fluid mechanics for low-cost predictions in engineering applications.A new nonlinear reduction strategy is presented for transient flows, incorporating parametrization and uncertainty quantification.The strategy uses a variational auto-encoder (VAE) with variational inference for confidence measurement.The incorporation of attention mechanisms enhances generalization across different dynamics, improving model performance.