correct `attention_head_dim` for `JointTransformerBlock` #8608

yiyixuxu · 2024-06-18T02:39:27Z

No description provided.

HuggingFaceDocBuilderDev · 2024-06-18T02:45:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu · 2024-06-28T23:21:51Z

should finish #6893 and #7027

DN6 · 2024-07-01T13:42:31Z

src/diffusers/models/controlnet_sd3.py

@@ -81,7 +81,7 @@ def __init__(
                JointTransformerBlock(
                    dim=self.inner_dim,
                    num_attention_heads=num_attention_heads,
-                    attention_head_dim=self.inner_dim,
+                    attention_head_dim=attention_head_dim,


Should this also be self.config.attention_head_dim to match transformer_sd3.py?

DN6 · 2024-07-01T13:44:23Z

src/diffusers/models/attention.py

@@ -128,9 +128,9 @@ def __init__(self, dim, num_attention_heads, attention_head_dim, context_pre_onl
            query_dim=dim,
            cross_attention_dim=None,
            added_kv_proj_dim=dim,
-            dim_head=attention_head_dim // num_attention_heads,


This won't break? Wouldn't the value of dim_head be computed differently?

well no

currently dim_head=attention_head_dim // num_attention_heads with attention_head_dim and num_attention_heads passed from SD3ControlNetModel like this
* attention_head_dim=self.inner_dim

diffusers/src/diffusers/models/transformers/transformer_sd3.py

Line 100 in ddb9d85

attention_head_dim=self.inner_dim,

* self.inner_dim = num_attention_heads * attention_head_dim

diffusers/src/diffusers/models/transformers/transformer_sd3.py

Line 78 in ddb9d85

self.inner_dim = self.config.num_attention_heads * self.config.attention_head_dim

* -> so basically attention_head_dim is num_attention_heads * attention_head_dim
* num_attention_heads is num_attention_heads
* -> so dim_heads here are just attention_head_dim we used to configure the model, and if we pass it down correctly, we can use it directly

Ahh I see. Thanks for the explaining! 🙏🏽

src/diffusers/models/controlnet_sd3.py

* add * update sd3 controlnet * Update src/diffusers/models/controlnet_sd3.py --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>

add

e19915e

yiyixuxu requested a review from DN6 June 18, 2024 02:48

yiyixuxu and others added 2 commits June 28, 2024 09:38

Merge branch 'main' into attn-dim

33f86b2

update sd3 controlnet

05152f0

DN6 reviewed Jul 1, 2024

View reviewed changes

yiyixuxu commented Jul 1, 2024

View reviewed changes

src/diffusers/models/controlnet_sd3.py Outdated Show resolved Hide resolved

Update src/diffusers/models/controlnet_sd3.py

1796a11

DN6 approved these changes Jul 2, 2024

View reviewed changes

Merge branch 'main' into attn-dim

357950a

yiyixuxu merged commit d9f71ab into main Jul 2, 2024
18 checks passed

yiyixuxu deleted the attn-dim branch July 2, 2024 17:42

yiyixuxu mentioned this pull request Jul 8, 2024

Error(s) in initializing SD3ControlNetModel by from_transformer #8723

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

correct `attention_head_dim` for `JointTransformerBlock` #8608

correct `attention_head_dim` for `JointTransformerBlock` #8608

Uh oh!

yiyixuxu commented Jun 18, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Jun 18, 2024

Uh oh!

yiyixuxu commented Jun 28, 2024

Uh oh!

DN6 Jul 1, 2024

Uh oh!

DN6 Jul 1, 2024

Uh oh!

yiyixuxu Jul 1, 2024 •

edited

Loading

Uh oh!

DN6 Jul 2, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

correct attention_head_dim for JointTransformerBlock #8608

correct attention_head_dim for JointTransformerBlock #8608

Uh oh!

Conversation

yiyixuxu commented Jun 18, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Jun 18, 2024

Uh oh!

yiyixuxu commented Jun 28, 2024

Uh oh!

DN6 Jul 1, 2024

Choose a reason for hiding this comment

Uh oh!

DN6 Jul 1, 2024

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Jul 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DN6 Jul 2, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

correct `attention_head_dim` for `JointTransformerBlock` #8608

correct `attention_head_dim` for `JointTransformerBlock` #8608

yiyixuxu Jul 1, 2024 •

edited

Loading