This paper is a decent starting point on the literature side, but it's a doozy.
Both the paper and blog post are pretty math heavy. I have not yet found a really clear intuitive explanation that doesn't get down in the weeds of the math, and it took me a long time to understand what the hell the math is trying to say (and there are some parts I still don't fully understand!)
https://lilianweng.github.io/posts/2021-07-11-diffusion-mode...
Personally, I find the core diffusion papers pretty dense and difficult to follow, so the blog post is where I'd begin.
https://arxiv.org/pdf/1503.03585.pdf
This paper is a decent starting point on the literature side, but it's a doozy.
Both the paper and blog post are pretty math heavy. I have not yet found a really clear intuitive explanation that doesn't get down in the weeds of the math, and it took me a long time to understand what the hell the math is trying to say (and there are some parts I still don't fully understand!)