Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Try: https://github.com/lllyasviel/Fooocus

I also recommend a good photorealistic base model, like RealVis XL.

In my experience its like DALL E but straight up better, more customizable, and local. And thats before you start trying finetunes and LORAs.

Other UIs will do SDXL, but every one I tried is terrible without all those default fooocus augmentations.



SDXL is great but it's in no way better than DALL E as far as straight text-to-image goes apart from the lack of censorship.

It has plenty of other advantages, but you can't tell it "make me a cute illustration of a 2 year old girl with Blaze from Blaze and the Monster Machines on a birthday cake with a large 2 candle on it."

DALL E will nail that, more or less. SDXL very much won't.


Here's what I got, pasting your prompt in DALL-E 3:

- https://ibb.co/k0NCWG7 - https://ibb.co/Vm3GZcR - https://ibb.co/bvSC4w3 - https://ibb.co/VqSdYbZ

I'm surprised that it didn't complain about copyrighted characters, it tends to do that a lot for me.


I used that as an example as I recently asked for it. I did find I had to tell it that "monster" in the title referred to monster trucks, not actual monsters. That helped it not put actual monsters in (as yours are half Blaze/half monsters), though my generations were way better at doing Blaze than yours were - they just had cute little monsters around too.


SD XL understands prompts much better than 1.5. So the next version of SD might be comparable to Dall-E without censorship.


Heh, yeah, that is true

https://ibb.co/1m0bLWC

More cherrypicking and messing with styles is getting closer, but nothing like Dall-E's first try I'm sure.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: