Quantized Flux with IP-Adapter #10728

hlky · 2025-02-05T15:48:50Z

What does this PR do?

Code (4bit)

from diffusers import BitsAndBytesConfig as DiffusersBitsAndBytesConfig
from transformers import BitsAndBytesConfig as TransformersBitsAndBytesConfig
import torch
from diffusers import FluxTransformer2DModel, FluxPipeline
from transformers import T5EncoderModel
from diffusers.utils import load_image


quant_config = TransformersBitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_compute_dtype=torch.float16,
)

text_encoder_2_4bit = T5EncoderModel.from_pretrained(
    "black-forest-labs/FLUX.1-dev",
    subfolder="text_encoder_2",
    quantization_config=quant_config,
    torch_dtype=torch.float16,
)

quant_config = DiffusersBitsAndBytesConfig(
    load_in_4bit=True,
    bnb_4bit_quant_type="nf4",
    bnb_4bit_compute_dtype=torch.float16,
)

transformer_4bit = FluxTransformer2DModel.from_pretrained(
    "black-forest-labs/FLUX.1-dev",
    subfolder="transformer",
    quantization_config=quant_config,
    torch_dtype=torch.float16,
)

pipe = FluxPipeline.from_pretrained(
    "black-forest-labs/FLUX.1-dev",
    transformer=transformer_4bit,
    text_encoder_2=text_encoder_2_4bit,
    torch_dtype=torch.float16,
)

image = load_image(
    "https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/diffusers/flux_ip_adapter_input.jpg"
).resize((1024, 1024))

pipe.load_ip_adapter(
    "XLabs-AI/flux-ip-adapter",
    weight_name="ip_adapter.safetensors",
    image_encoder_pretrained_model_name_or_path="openai/clip-vit-large-patch14",
    torch_dtype=torch.float16,
)

pipe.enable_model_cpu_offload()

pipe.set_ip_adapter_scale(1.0)

image = pipe(
    width=1024,
    height=1024,
    prompt="wearing sunglasses",
    negative_prompt="",
    true_cfg_scale=4.0,
    generator=torch.Generator().manual_seed(4444),
    ip_adapter_image=image,
).images[0]

image.save("flux_ip_adapter_output.jpg")

4bit	8bit

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yiyixuxu

HuggingFaceDocBuilderDev · 2025-02-05T15:55:39Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

yiyixuxu · 2025-02-06T01:16:44Z

src/diffusers/loaders/transformer_flux.py

@@ -177,5 +177,3 @@ def _load_ip_adapter_weights(self, state_dicts, low_cpu_mem_usage=False):

        self.encoder_hid_proj = MultiIPAdapterImageProjection(image_projection_layers)
        self.config.encoder_hid_dim_type = "ip_image_proj"
-


let's do this?

self.encoder_hid_proj.to(dtype=self.dtype, device=self.device)

if this works, can we apply this to other ip-adapter loader as well?

I don't think we need to cast it?

oh I think you're right!

yiyixuxu

can we do the same for the other ip-adapters?

Quantized Flux with IP-Adapter

8f29d42

hlky mentioned this pull request Feb 5, 2025

FLUX IPAdapter fails when transformers are quantized #10337

Closed

yiyixuxu reviewed Feb 6, 2025

View reviewed changes

yiyixuxu approved these changes Feb 6, 2025

View reviewed changes

yiyixuxu merged commit d43ce14 into huggingface:main Feb 6, 2025
12 checks passed

guiyrt mentioned this pull request Feb 13, 2025

SD3 IP-Adapter runtime checkpoint conversion #10718

Merged

6 tasks

hlky mentioned this pull request Feb 20, 2025

Support multiple IP adapter in Flux #10775

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Quantized Flux with IP-Adapter #10728

Quantized Flux with IP-Adapter #10728

Uh oh!

hlky commented Feb 5, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2025

Uh oh!

yiyixuxu Feb 6, 2025

Uh oh!

hlky Feb 6, 2025

Uh oh!

yiyixuxu Feb 6, 2025

Uh oh!

yiyixuxu left a comment

Uh oh!

Uh oh!

Uh oh!

		@@ -177,5 +177,3 @@ def _load_ip_adapter_weights(self, state_dicts, low_cpu_mem_usage=False):

		self.encoder_hid_proj = MultiIPAdapterImageProjection(image_projection_layers)
		self.config.encoder_hid_dim_type = "ip_image_proj"

Quantized Flux with IP-Adapter #10728

Quantized Flux with IP-Adapter #10728

Uh oh!

Conversation

hlky commented Feb 5, 2025

What does this PR do?

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Feb 5, 2025

Uh oh!

yiyixuxu Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

hlky Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

yiyixuxu Feb 6, 2025

Choose a reason for hiding this comment

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!