-
Notifications
You must be signed in to change notification settings - Fork 6.1k
[Tencent Hunyuan Team] Add HunyuanDiT-v1.2 Support #8747
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
The config attributes {'use_style_cond_and_image_meta_size': False} were passed to HunyuanDiT2DModel, but are not expected and will be ignored. Please verify your config.json configuration file. ValueError Traceback (most recent call last) in <cell line: 4>() 4 frames /usr/local/lib/python3.10/dist-packages/diffusers/models/modeling_utils.py in from_pretrained(cls, pretrained_model_name_or_path, **kwargs) ValueError: Cannot load <class 'diffusers.models.transformers.hunyuan_transformer_2d.HunyuanDiT2DModel'> from /root/.cache/huggingface/hub/models--Tencent-Hunyuan--HunyuanDiT-v1.2-Diffusers/snapshots/bf329a9a93c2346d0986d91263207d3226d2858d/transformer because the following keys are missing: |
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
don't forget |
Please use the code provided in the current PR to load the model |
what makes HYDIT very slow compare to other 50 step dit like sd3 & pixart sigma ? |
您好,还是遇到 pipe = HunyuanDiTPipeline.from_pretrained("Tencent-Hunyuan/HunyuanDiT-v1.2-Diffusers", torch_dtype=torch.float16) |
@neonhuang |
我下载的模型,是通过pipe = HunyuanDiTPipeline.from_pretrained("Tencent-Hunyuan/HunyuanDiT-v1.2-Diffusers", torch_dtype=torch.float16)自动下载的。 |
* add v1.2 support --------- Co-authored-by: xingchaoliu <xingchaoliu@tencent.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>
We slightly changed the
hunyuandit_transformers_2d.py
andembeddings.py
to support Hunyuan-DiT v1.2 inference.It adds additional logic to avoid using
style_embedder
andimage_meta_size
(as they are not effective in the current inference framework anyway).Please have a look. Thank you.
Test script: