[Feature Request] Decouple `linear1` and `linear2` Flux layers in network_args #1613

EricBCoding · 2024-09-19T05:17:12Z

Hi,

There's a popular discussion thread that suggests training the proj_out (linear2) module of single blocks 7 and 20 for Flux LoRAs:

https://old.reddit.com/r/StableDiffusion/comments/1f523bd/good_flux_loras_can_be_less_than_45mb_128_dim/

As far as I can tell, it is not yet possible to isolate linear2 through the sd-scripts network_args flag. Perhaps this is as close as it gets:

--network_args "train_double_block_indices=none" "train_single_block_indices=7,20" "single_mod_dim=0"

I propose replacing the single_dim layer with e.g. single_linear1_dim and single_linear2_dim. That way, we can specify single_linear1_dim=0 to reproduce the training method outlined in the thread above.

Or is this already possible with a different set of arguments?

Thanks!

The text was updated successfully, but these errors were encountered:

whmc76 · 2024-09-19T09:51:56Z

+1

kohya-ss · 2024-09-19T11:50:37Z

This is interesting. Since linear1 and 2 belong to the same attention, I don't think there is any need to separate them.

I think we can get almost the same effect by training linear1 and 2 with half the dim(rank). Have you tried it?

EricBCoding · 2024-09-19T22:33:50Z

I think we can get almost the same effect by training linear1 and 2 with half the dim(rank). Have you tried it?

No, but I'll give it a go! I modified my copy of lora_flux.py to isolate linear2 and already trained a couple models that way. Let me try your suggestion and see how the results differ.

envy-ai · 2024-09-21T01:37:41Z

I think we can get almost the same effect by training linear1 and 2 with half the dim(rank). Have you tried it?

No, but I'll give it a go! I modified my copy of lora_flux.py to isolate linear2 and already trained a couple models that way. Let me try your suggestion and see how the results differ.

Any chance you could post the patch?

EricBCoding · 2024-09-21T04:02:41Z

Any chance you could post the patch?

My sd-scripts is heavily customized, but here's how you can apply it to yours:

Look for ("single_blocks", "linear") in networks/lora_flux.py. It's currently on line 665:

sd-scripts/networks/lora_flux.py

Line 665 in 95ff9db

("single_blocks", "linear"),

Replace this line with ("linear1"), and save the file.
Add "single_dim=0" to your network_args flag (in addition to the other values I provided in the OP.) This will now skip the linear1 modules and allow you to isolate linear2 for training.
If patched correctly, your console should state that you are targeting only 2 unet modules.

Hope that helps.

I'm still in the process of running some tests on training linear1 and linear2 in conjunction, will report back on that in the next few days.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Decouple `linear1` and `linear2` Flux layers in network_args #1613

[Feature Request] Decouple `linear1` and `linear2` Flux layers in network_args #1613

EricBCoding commented Sep 19, 2024

whmc76 commented Sep 19, 2024

kohya-ss commented Sep 19, 2024

EricBCoding commented Sep 19, 2024 •

edited

Loading

envy-ai commented Sep 21, 2024

EricBCoding commented Sep 21, 2024 •

edited

Loading

[Feature Request] Decouple linear1 and linear2 Flux layers in network_args #1613

[Feature Request] Decouple linear1 and linear2 Flux layers in network_args #1613

Comments

EricBCoding commented Sep 19, 2024

whmc76 commented Sep 19, 2024

kohya-ss commented Sep 19, 2024

EricBCoding commented Sep 19, 2024 • edited Loading

envy-ai commented Sep 21, 2024

EricBCoding commented Sep 21, 2024 • edited Loading

[Feature Request] Decouple `linear1` and `linear2` Flux layers in network_args #1613

[Feature Request] Decouple `linear1` and `linear2` Flux layers in network_args #1613

EricBCoding commented Sep 19, 2024 •

edited

Loading

EricBCoding commented Sep 21, 2024 •

edited

Loading