I'm curious to give this a go. I've been training a lot of LoRAs for FLUX dev re...

spython · 2024-10-03T22:30:52 1727994652

This looks really good! What is your process to get this kind of high quality LoRAs?

davidbarker · 2024-10-03T23:27:22 1727998042

Thank you!

A reasonable amount of training images (50 or so), and then I train for 2,000-ish steps for a new style.

Many of them work well with Flux, particularly if they're illustration-based. Some don't seem to work at all, so I didn't upload those!

stavros · 2024-10-03T23:57:24 1727999844

How long does this take, and on what equipment? It's amazing to me that you can do this from just 50 images, I would have thought tens of thousands.

davidbarker · 2024-10-04T01:05:53 1728003953

It's very impressive. I aim for around 50 images if I'm training a style, but only 10 to 20 if training a concept (like an object or a face).

I have a MacBook Air so I train using the various API providers.

For training a style, I use Replicate: https://replicate.com/ostris/flux-dev-lora-trainer/train

For training a concept/person, I use fal: https://fal.ai/models/fal-ai/flux-lora-fast-training

With fal, you can train a concept in around 2 minutes and only pay $2. Incredibly cheap. (You could also use it for training a style if you wanted to. I just found I seem to get slightly better results using Replicate's trainer for a style.)

throw14082020 · 2024-10-04T04:25:02 1728015902

$2 for 2 minutes? Can't you get less than $2 for 1 hour using GPU machines from providers like runpod or AirGPU? I found it a bit expensive to use replicate and fal after 10 minutes of prompting.

I have not used runpod or airgpu, and not affiliated.

reissbaker · 2024-10-04T06:25:45 1728023145

Yes, renting raw compute via Runpod and friends will generally be much cheaper than renting a higher level service that uses that compute e.g. fal.ai or Replicate. For example, an A6000 on fal.ai is a little over $2/hr (they only show you the price in seconds, perhaps to make it more difficult to compare with ordinary GPU providers); on Runpod an A6000 is less than half that, $0.76/hr in their managed "Secure Cloud." If you're willing to take some risk of boxes disappearing, and don't need much security, Runpod's "Community Cloud" is even cheaper at $0.49/hr.

Similar deal with Replicate: an A100 there is over $5/hr, whereas on Runpod it's $1.64/hr.

And if you use the "serverless" services, the pricing becomes even more astronomical; as you note, $1/minute is unreasonably expensive: that's over 20x the cost of renting 8xH100s on Runpod's "Secure Cloud" (and 8xH100s are extreme overkill for finetuning image generators: even 1xH100 would be sufficient, meaning it's actually 160x markup).

stavros · 2024-10-04T01:08:13 1728004093

Wow, fantastic, thanks! I thought it would be much, much more expensive than this. Thanks for the info!

davidbarker · 2024-10-04T01:39:57 1728005997

Happy to help! It's a lot of fun. And it becomes even more fun when you combine LoRAs. So you could train one on your face, and then use that with a style LoRA, giving you a stylised version of your face.

If you do end up training one on yourself with fal, it should ultimately take you here (https://fal.ai/models/fal-ai/flux-lora) with your new LoRA pre-filled.

Then:

1. Click 'Add item' to add another LoRA and enter the URL of a style LoRA's SafeTensor file (with Civitai, go to any style you like and copy the URL from the download button) (you can also find LoRAs on Hugging Face)

2. Paste that SafeTensor URL as the second LoRA, remembering to include the trigger word for yourself (you set this when you start the training) and the trigger word for the style (it tells you on the Civitai page)

3. Play with the strength for the LoRAs if you want it to look more like you or more like the style, etc.

-----

If you want a style LoRA to try, this one of SNL title cards I trained actually makes some great photographic images. https://civitai.com/models/773477/flux-lora-snl-portrait (the download link would be https://civitai.com/api/download/models/865105?type=Model&fo...)

-----

There's a lot of trial and error to get the best combinations. Have fun!

throwup238 · 2024-10-04T02:42:18 1728009738

Have you tried img2text when training a style?

I want to make a LoRA of Peokudin-Gorskii photographs from the Library of Congress collection and they have thousands of photos, so I’m curious whether that’s effective for autogenerating the caption for images.

davidbarker · 2024-10-04T02:50:34 1728010234

It's funny you should ask. I recently released a plugin (https://community-en.eagle.cool/plugin/4B56113D-EB3E-4020-A8...) for Eagle (an asset library management app) that allows you to write rules to caption/tag images and videos using various AI models.

I have a preset in there that I sometimes use to generate captions using GPT-4o.

If you use Replicate, they'll also generate captions for you automatically if you wish. (I think they use LLaVA behind the scenes.) I typically use this just because it's easier, and seems to work well enough.

throwup238 · 2024-10-04T03:09:32 1728011372

That’s awesome! Thank you for the replicate link too. I didn’t know they also did LoRA training. They’ve been kind of hitting it lit the park lately.

stavros · 2024-10-04T01:42:04 1728006124

Thanks for all this! I had created a SD LoRA of my face back in the day, time for another one!

davidbarker · 2024-10-04T01:42:59 1728006179

Awesome! :)