I use it directly from Python.
I can generate a 512x512 on my 10gb 3080 no problem (or three 384x384 at a time)
It measures memory usage as well.