[Flux ControlNet] ControlNet initialization from transformer seems to be broken #9540

sayakpaul · 2024-09-27T03:16:06Z

Originally caught in #9324.

Reproduction:

from diffusers import FluxTransformer2DModel, FluxControlNetModel

transformer = FluxTransformer2DModel.from_pretrained(
    "hf-internal-testing/tiny-flux-pipe", subfolder="transformer"
)
controlnet = FluxControlNetModel.from_transformer(
    transformer=transformer, num_layers=1, num_single_layers=1, attention_head_dim=16, num_attention_heads=1
)

Leads to:

RuntimeError: Error(s) in loading state_dict for CombinedTimestepTextProjEmbeddings:
        size mismatch for timestep_embedder.linear_1.weight: copying a param with shape torch.Size([32, 256]) from checkpoint, the shape in current model is torch.Size([16, 256]).
        size mismatch for timestep_embedder.linear_1.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
        size mismatch for timestep_embedder.linear_2.weight: copying a param with shape torch.Size([32, 32]) from checkpoint, the shape in current model is torch.Size([16, 16]).
        size mismatch for timestep_embedder.linear_2.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
        size mismatch for text_embedder.linear_1.weight: copying a param with shape torch.Size([32, 32]) from checkpoint, the shape in current model is torch.Size([16, 32]).
        size mismatch for text_embedder.linear_1.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).
        size mismatch for text_embedder.linear_2.weight: copying a param with shape torch.Size([32, 32]) from checkpoint, the shape in current model is torch.Size([16, 16]).
        size mismatch for text_embedder.linear_2.bias: copying a param with shape torch.Size([32]) from checkpoint, the shape in current model is torch.Size([16]).

Cc: @PromeAIpro

I think it makes sense to make this more robust and have dedicated testing for it.

@yiyixuxu possible to look into it?

PromeAIpro · 2024-09-27T03:17:25Z

I explicitly pass it in, and works

flux_controlnet = FluxControlNetModel.from_transformer(
            flux_transformer,
+            attention_head_dim=flux_transformer.config["attention_head_dim"],
+            num_attention_heads=flux_transformer.config["num_attention_heads"],
            num_layers=args.num_double_layers,
            num_single_layers=args.num_single_layers,
        )

PromeAIpro · 2024-09-27T03:18:53Z

Why do we need to update the parameter here? Shouldn't it be passed in by the transformer?

sayakpaul · 2024-09-27T03:22:42Z

Yeah I would assume so as well.

So, if we match the parameters of the base transformer exactly, then it works:

controlnet = FluxControlNetModel.from_transformer(
    transformer=transformer, num_layers=1, num_single_layers=1, attention_head_dim=16, num_attention_heads=2
)

In this case, num_layers, num_single_layers, attention_head_dim, and num_attention_heads have been set to the values used by the transformer. But exposing this arguments can lead the user to believe that these are configurable.

For now, I am going to close this issue but we can revisit this later.

PromeAIpro · 2024-09-27T03:34:07Z

I think the default is the transformer configuration, unless the user explicitly specifies it, then we need to changeattention_head_dim and num_attention_heads

PromeAIpro · 2024-09-27T03:41:54Z

something like this

    def from_transformer(
        cls,
        transformer,
        num_layers: int = 4,
        num_single_layers: int = 10,
        attention_head_dim = None,
        num_attention_heads = None,
        load_weights_from_transformer=True,
    ):
        config = transformer.config
        config["num_layers"] = num_layers
        config["num_single_layers"] = num_single_layers
        config["attention_head_dim"] = attention_head_dim if attention_head_dim is not None else config["attention_head_dim"]
        config["num_attention_heads"] = num_attention_heads if num_attention_heads is not None else config["num_attention_heads"]

        controlnet = cls(**config)

        if load_weights_from_transformer:
            controlnet.pos_embed.load_state_dict(transformer.pos_embed.state_dict())
            controlnet.time_text_embed.load_state_dict(transformer.time_text_embed.state_dict())
            controlnet.context_embedder.load_state_dict(transformer.context_embedder.state_dict())
            controlnet.x_embedder.load_state_dict(transformer.x_embedder.state_dict())
            controlnet.transformer_blocks.load_state_dict(transformer.transformer_blocks.state_dict(), strict=False)
            controlnet.single_transformer_blocks.load_state_dict(
                transformer.single_transformer_blocks.state_dict(), strict=False
            )

            controlnet.controlnet_x_embedder = zero_module(controlnet.controlnet_x_embedder)

        return controlnet

sayakpaul assigned yiyixuxu Sep 27, 2024

sayakpaul mentioned this issue Sep 27, 2024

[examples] add train flux-controlnet scripts in example. #9324

Merged

6 tasks

sayakpaul closed this as completed Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Flux ControlNet] ControlNet initialization from transformer seems to be broken #9540

[Flux ControlNet] ControlNet initialization from transformer seems to be broken #9540

sayakpaul commented Sep 27, 2024

PromeAIpro commented Sep 27, 2024

PromeAIpro commented Sep 27, 2024

sayakpaul commented Sep 27, 2024

PromeAIpro commented Sep 27, 2024 •

edited

Loading

PromeAIpro commented Sep 27, 2024 •

edited

Loading

[Flux ControlNet] ControlNet initialization from transformer seems to be broken #9540

[Flux ControlNet] ControlNet initialization from transformer seems to be broken #9540

Comments

sayakpaul commented Sep 27, 2024

PromeAIpro commented Sep 27, 2024

PromeAIpro commented Sep 27, 2024

sayakpaul commented Sep 27, 2024

PromeAIpro commented Sep 27, 2024 • edited Loading

PromeAIpro commented Sep 27, 2024 • edited Loading

PromeAIpro commented Sep 27, 2024 •

edited

Loading

PromeAIpro commented Sep 27, 2024 •

edited

Loading