The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models

The Cosine Schedule is Fisher-Rao-Optimal for Masked Discrete Diffusion Models










arXiv:2508.04884v1 Announce Type: new
Abstract: In this work, we study the problem of choosing the discretisation schedule for sampling from masked discrete diffusion models in terms of the information geometry of the induced probability path. Specifically, we show that the optimal schedule under the Fisher-Rao geometry recovers the popularly-used cosine schedule.






Leo Zhang





Go to original source





Posted

in

, ,

by