Autoregressive transformer models are usually memory bound, whereas SD is comput...

ninkendo · on Oct 11, 2023

M1 has (fast) unified memory between GPU and CPU, so something being memory bound ought not to have much bearing on whether it belongs on CPU or GPU… at least in theory. I’m a total noob here though so I may be wrong.

GaggiX · on Oct 11, 2023

We were discussing mostly about NPU, I don't know if it makes a difference.

lib-dev · on Oct 11, 2023

From https://en.wikipedia.org/wiki/Apple_M1#Memory

> The M1 uses a 128-bit LPDDR4X SDRAM in a unified memory configuration shared by all the components of the processor.

I assume that includes the NPU, media engine, etc.