menu
techminis

A naukri.com initiative

google-web-stories
Home

>

ML News

>

The Diffus...
source image

Arxiv

2d

read

195

img
dot

Image Credit: Arxiv

The Diffusion Duality

  • Uniform-state discrete diffusion models, while promising for fast text generation, often lag behind autoregressive and masked diffusion models.
  • A new method called Duo aims to narrow this performance gap by leveraging insights from Gaussian diffusion processes.
  • Duo incorporates curriculum learning, guided by Gaussian processes, to improve training speed and reduce variance.
  • Models trained with this approach outperform autoregressive models in zero-shot perplexity on multiple benchmarks.
  • The method also introduces Discrete Consistency Distillation to enhance few-step generation in diffusion language models by significantly accelerating sampling.
  • Code and model checkpoints for Duo are available on the project page: http://s-sahoo.github.io/duo

Read Full Article

like

11 Likes

For uninterrupted reading, download the app