Discrete diffusion models gradually undo noise with Markov processMasking diffusion model performs best despite not denoising graduallyMasking diffusion utilizes fundamental difference in discrete Markov processesSchedule-conditioned discrete diffusion (SCUD) outperforms masking diffusion