Add SEGA for FLUX by Marlon154 · Pull Request #3 · ml-research/diffusers

Marlon154 · 2024-11-04T14:34:11Z

What does this PR do?

This PR adds SEGA for FLUX, thereby enable users to guide the diffusion process by applying editing prompts.

* Add `remote_decode` to `remote_utils` * test dependency * test dependency * dependency * dependency * dependency * docstrings * changes * make style * apply * revert, add new options * Apply style fixes * deprecate base64, headers not needed * address comments * add license header * init test_remote_decode * more * more test * more test * skeleton for xl, flux * more test * flux test * flux packed * no scaling * -save * hunyuanvideo test * Apply style fixes * init docs * Update src/diffusers/utils/remote_utils.py Co-authored-by: Sayak Paul <[email protected]> * comments * Apply style fixes * comments * hybrid_inference/vae_decode * fix * tip? * tip * api reference autodoc * install tip --------- Co-authored-by: sayakpaul <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

fix-copies went uncaught it seems.

* fix-copies went uncaught it seems. * remove more unneeded encode_prompt() tests * Revert "fix-copies went uncaught it seems." This reverts commit eefb302. * empty

…eneration model (huggingface#10626) * Update EasyAnimate V5.1 * Add docs && add tests && Fix comments problems in transformer3d and vae * delete comments and remove useless import * delete process * Update EXAMPLE_DOC_STRING * rename transformer file * make fix-copies * make style * refactor pt. 1 * update toctree.yml * add model tests * Update layer_norm for norm_added_q and norm_added_k in Attention * Fix processor problem * refactor vae * Fix problem in comments * refactor tiling; remove einops dependency * fix docs path * make fix-copies * Update src/diffusers/pipelines/easyanimate/pipeline_easyanimate_control.py * update _toctree.yml * fix test * update * update * update * make fix-copies * fix tests --------- Co-authored-by: Aryan <[email protected]> Co-authored-by: Aryan <[email protected]> Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: Dhruv Nair <[email protected]>

* Fix SD2.X clip single file load projection_dim Infer projection_dim from the checkpoint before loading from pretrained, override any incorrect hub config. Hub configuration for SD2.X specifies projection_dim=512 which is incorrect for SD2.X checkpoints loaded from civitai and similar. Exception was previously thrown upon attempting to load_model_dict_into_meta for SD2.X single file checkpoints. Such LDM models usually require projection_dim=1024 * convert_open_clip_checkpoint use hidden_size for text_proj_dim * convert_open_clip_checkpoint, revert checkpoint[text_proj_key].shape[1] -> [0] values are identical --------- Co-authored-by: Teriks <[email protected]> Co-authored-by: Dhruv Nair <[email protected]>

* Update pipeline_animatediff.py * Update pipeline_animatediff_controlnet.py * Update pipeline_animatediff_sparsectrl.py * Update pipeline_animatediff_video2video.py * Update pipeline_animatediff_video2video_controlnet.py --------- Co-authored-by: Dhruv Nair <[email protected]>

* Add example of Ip-Adapter-Callback. * Add image links from HF Hub.

…ace#10945)

* Update ip_adapter.py * Update ip_adapter.py * Update ip_adapter.py * Update ip_adapter.py * Update ip_adapter.py * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: hlky <[email protected]>

* initial comit * fix empty cache * fix one more * fix style * update device functions * update * update * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <[email protected]> * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <[email protected]> * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <[email protected]> * Update tests/pipelines/controlnet/test_controlnet.py Co-authored-by: hlky <[email protected]> * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <[email protected]> * Update src/diffusers/utils/testing_utils.py Co-authored-by: hlky <[email protected]> * Update tests/pipelines/controlnet/test_controlnet.py Co-authored-by: hlky <[email protected]> * with gc.collect * update * make style * check_torch_dependencies * add mps empty cache * add changes * bug fix * enable on xpu * update more cases * revert * revert back * Update test_stable_diffusion_xl.py * Update tests/pipelines/stable_diffusion/test_stable_diffusion.py Co-authored-by: hlky <[email protected]> * Update tests/pipelines/stable_diffusion/test_stable_diffusion.py Co-authored-by: hlky <[email protected]> * Update tests/pipelines/stable_diffusion/test_stable_diffusion_img2img.py Co-authored-by: hlky <[email protected]> * Update tests/pipelines/stable_diffusion/test_stable_diffusion_img2img.py Co-authored-by: hlky <[email protected]> * Update tests/pipelines/stable_diffusion/test_stable_diffusion_img2img.py Co-authored-by: hlky <[email protected]> * Apply suggestions from code review Co-authored-by: hlky <[email protected]> * add test marker --------- Co-authored-by: hlky <[email protected]>

* Update evaluation.md * Update docs/source/en/conceptual/evaluation.md Co-authored-by: Steven Liu <[email protected]> --------- Co-authored-by: Steven Liu <[email protected]>

* feat: support non-diffusers lumina2 LoRAs. * revert ipynb changes (but I don't know why this is required ☹️) * empty --------- Co-authored-by: Dhruv Nair <[email protected]> Co-authored-by: YiYi Xu <[email protected]>

…e#10927) * [Quantization] support pass MappingType for TorchAoConfig * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…tization_config.py. (huggingface#10961) Update quantization_config.py

* update * refactor image-to-video pipeline * update * fix copied from * use FP32LayerNorm

) * Fix seed initialization to handle args.seed = 0 correctly * Apply style fixes --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…SDXL (huggingface#10951) * feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for SDXL * make style make quality

* Update pipeline_cogview4.py * Use GLM instead of T5 in doc

fix

* fix t5 training bug * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

use style bot GH action from hfh Co-authored-by: Sayak Paul <[email protected]>

…hs` is passed in a distributed training env (huggingface#10973) * updated train_dreambooth_lora to fix the LR schedulers for `num_train_epochs` in distributed training env * fixed formatting * remove trailing newlines * fixed style error

fix tests

Co-authored-by: Sayak Paul <[email protected]>

* update * update * addressed PR comments * update --------- Co-authored-by: YiYi Xu <[email protected]>

…ce#11369) * Add stochastic sampling to FlowMatchEulerDiscreteScheduler This PR adds stochastic sampling to FlowMatchEulerDiscreteScheduler based on Lightricks/LTX-Video@b1aeddd ltx_video/schedulers/rf.py * Apply style fixes * Use config value directly * Apply style fixes * Swap order * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <[email protected]> * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <[email protected]> --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: YiYi Xu <[email protected]>

…e#11281) * initial commit * initial commit * initial commit * initial commit * initial commit * initial commit * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Bagheera <[email protected]> * move prompt embeds, pooled embeds outside * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: hlky <[email protected]> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: hlky <[email protected]> * fix import * fix import and tokenizer 4, text encoder 4 loading * te * prompt embeds * fix naming * shapes * initial commit to add HiDreamImageLoraLoaderMixin * fix init * add tests * loader * fix model input * add code example to readme * fix default max length of text encoders * prints * nullify training cond in unpatchify for temp fix to incompatible shaping of transformer output during training * smol fix * unpatchify * unpatchify * fix validation * flip pred and loss * fix shift!!! * revert unpatchify changes (for now) * smol fix * Apply style fixes * workaround moe training * workaround moe training * remove prints * to reduce some memory, keep vae in `weight_dtype` same as we have for flux (as it's the same vae) https://github.com/huggingface/diffusers/blob/bbd0c161b55ba2234304f1e6325832dd69c60565/examples/dreambooth/train_dreambooth_lora_flux.py#L1207 * refactor to align with HiDream refactor * refactor to align with HiDream refactor * refactor to align with HiDream refactor * add support for cpu offloading of text encoders * Apply style fixes * adjust lr and rank for train example * fix copies * Apply style fixes * update README * update README * update README * fix license * keep prompt2,3,4 as None in validation * remove reverse ode comment * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <[email protected]> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <[email protected]> * vae offload change * fix text encoder offloading * Apply style fixes * cleaner to_kwargs * fix module name in copied from * add requirements * fix offloading * fix offloading * fix offloading * update transformers version in reqs * try AutoTokenizer * try AutoTokenizer * Apply style fixes * empty commit * Delete tests/lora/test_lora_layers_hidream.py * change tokenizer_4 to load with AutoTokenizer as well * make text_encoder_four and tokenizer_four configurable * save model card * save model card * revert T5 * fix test * remove non diffusers lumina2 conversion --------- Co-authored-by: Bagheera <[email protected]> Co-authored-by: hlky <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

update

up

Small change requirements_sana.txt to requirements_hidream.txt

…e#11375) * fix * add tests * add message check

… for image resizing (huggingface#11395)

update

* Kolors additional pipelines, community contrib --------- Co-authored-by: Teriks <[email protected]> Co-authored-by: Linoy Tsaban <[email protected]>

* 1. add pre-computation of prompt embeddings when custom prompts are used as well 2. save model card even if model is not pushed to hub 3. remove scheduler initialization from code example - not necessary anymore (it's now if the base model's config) 4. add skip_final_inference - to allow to run with validation, but skip the final loading of the pipeline with the lora weights to reduce memory reqs * pre encode validation prompt as well * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <[email protected]> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <[email protected]> * Update examples/dreambooth/train_dreambooth_lora_hidream.py Co-authored-by: Sayak Paul <[email protected]> * pre encode validation prompt as well * Apply style fixes * empty commit * change default trained modules * empty commit * address comments + change encoding of validation prompt (before it was only pre-encoded if custom prompts are provided, but should be pre-encoded either way) * Apply style fixes * empty commit * fix validation_embeddings definition * fix final inference condition * fix pipeline deletion in last inference * Apply style fixes * empty commit * layers * remove readme remarks on only pre-computing when instance prompt is provided and change example to 3d icons * smol fix * empty commit --------- Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Fix Flux IP adapter argument in the example IP-Adapter example had a wrong argument. Fix `true_cfg` -> `true_cfg_scale`

update

…for resizing (huggingface#11421) * Set LANCZOS as default interpolation mode for resizing * [train_dreambooth_lora.py] Set LANCZOS as default interpolation mode for resizing

…s in pipelines during torch.compile() (huggingface#11085) * test for better torch.compile stuff. * fixes * recompilation and graph break. * clear compilation cache. * change to modeling level test. * allow running compilation tests during nightlies.

* enable group_offload cases and quanto cases on XPU Signed-off-by: YAO Matrix <[email protected]> * use backend APIs Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: YAO Matrix <[email protected]> Signed-off-by: Yao Matrix <[email protected]>

* enable test_layerwise_casting_memory cases on XPU Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

fix import.

…follow up (huggingface#11426) * Update train_text_to_image.py * update

…ipts follow up (huggingface#11427) * Update train_text_to_image_lora.py * update_train_text_to_image_lora

* enable gguf test cases on XPU Signed-off-by: YAO Matrix <[email protected]> * make SD35LargeGGUFSingleFileTests::test_pipeline_inference pas Signed-off-by: root <[email protected]> * make FluxControlLoRAGGUFTests::test_lora_loading pass Signed-off-by: Yao Matrix <[email protected]> * polish code Signed-off-by: Yao Matrix <[email protected]> * Apply style fixes --------- Signed-off-by: YAO Matrix <[email protected]> Signed-off-by: root <[email protected]> Signed-off-by: Yao Matrix <[email protected]> Co-authored-by: root <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

remove unnecessary pipeline moving to cpu in validation Co-authored-by: Sayak Paul <[email protected]>

* Fixing missing provider options argument * Adding if else for provider options * Apply suggestions from code review Co-authored-by: YiYi Xu <[email protected]> * Apply style fixes * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <[email protected]> * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <[email protected]> --------- Co-authored-by: Uros Petkovic <[email protected]> Co-authored-by: Sayak Paul <[email protected]> Co-authored-by: YiYi Xu <[email protected]> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

…lNet training (huggingface#11449) Set LANCZOS as the default interpolation for image resizing

…gingface#11425) raise warning instead of error

Signed-off-by: Yao Matrix <[email protected]>

…e#11457) udpate

Signed-off-by: Yao Matrix <[email protected]>

* enable unidiffuser cases on XPU Signed-off-by: Yao Matrix <[email protected]> * fix a typo Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

Marlon154 added the enhancement New feature or request label Nov 4, 2024

Marlon154 self-assigned this Nov 4, 2024

Marlon154 force-pushed the main branch from 9418745 to b0c8973 Compare January 16, 2025 08:50

hlky and others added 27 commits March 2, 2025 17:10

Update VAE Decode endpoints (huggingface#10939)

54043c3

[chore] fix-copies to flux pipelines (huggingface#10941)

4aaa0d2

fix-copies went uncaught it seems.

[Tests] Remove more encode prompts tests (huggingface#10942)

7513162

* fix-copies went uncaught it seems. * remove more unneeded encode_prompt() tests * Revert "fix-copies went uncaught it seems." This reverts commit eefb302. * empty

Add Example of IPAdapterScaleCutoffCallback to Docs (huggingface#10934)

982f9b3

* Add example of Ip-Adapter-Callback. * Add image links from HF Hub.

Update pipeline_cogview4.py (huggingface#10944)

f92e599

Fix redundant prev_output_channel assignment in UNet2DModel (huggingf…

8f15be1

…ace#10945)

Update evaluation.md (huggingface#10938)

cc22058

* Update evaluation.md * Update docs/source/en/conceptual/evaluation.md Co-authored-by: Steven Liu <[email protected]> --------- Co-authored-by: Steven Liu <[email protected]>

[Quantization] support pass MappingType for TorchAoConfig (huggingfac…

11d8e3c

…e#10927) * [Quantization] support pass MappingType for TorchAoConfig * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

Fix the missing parentheses when calling is_torchao_available in quan…

dcd77ce

…tization_config.py. (huggingface#10961) Update quantization_config.py

[LoRA] Support Wan (huggingface#10943)

3ee899f

* update * refactor image-to-video pipeline * update * fix copied from * use FP32LayerNorm

feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for …

66bf7ea

…SDXL (huggingface#10951) * feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for SDXL * make style make quality

[Docs] CogView4 comment fix (huggingface#10957)

a74f02f

* Update pipeline_cogview4.py * Use GLM instead of T5 in doc

update check_input for cogview4 (huggingface#10966)

24c062a

fix

Add VAE Decode endpoint slow test (huggingface#10946)

08f74a8

[flux lora training] fix t5 training bug (huggingface#10845)

e031caf

* fix t5 training bug * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

use style bot GH Action from huggingface_hub (huggingface#10970)

fbf6b85

use style bot GH action from hfh Co-authored-by: Sayak Paul <[email protected]>

[tests] fix tests for save load components (huggingface#10977)

6e2a93d

fix tests

Fix loading OneTrainer Flux LoRA (huggingface#10978)

b150276

Co-authored-by: Sayak Paul <[email protected]>

ishan-modi and others added 30 commits April 21, 2025 09:56

[Refactor] Minor Improvement for import utils (huggingface#11161)

f59df3b

* update * update * addressed PR comments * update --------- Co-authored-by: YiYi Xu <[email protected]>

Update modeling imports (huggingface#11129)

f108ad8

update

[HiDream] move deprecation to 0.35.0 (huggingface#11384)

448c72a

up

Update README_hidream.md (huggingface#11386)

026507c

Small change requirements_sana.txt to requirements_hidream.txt

Fix group offloading with block_level and use_stream=True (huggingfac…

6cef71d

…e#11375) * fix * add tests * add message check

[train_dreambooth_flux] Add LANCZOS as the default interpolation mode…

4b60f4b

… for image resizing (huggingface#11395)

[Feature] Added Xlab Controlnet support (huggingface#11249)

a4f9c3c

update

Kolors additional pipelines, community contrib (huggingface#11372)

b4be422

* Kolors additional pipelines, community contrib --------- Co-authored-by: Teriks <[email protected]> Co-authored-by: Linoy Tsaban <[email protected]>

Fix Flux IP adapter argument in the pipeline example (huggingface#11402)

7986834

Fix Flux IP adapter argument in the example IP-Adapter example had a wrong argument. Fix `true_cfg` -> `true_cfg_scale`

[BUG] fixed WAN docstring (huggingface#11226)

e8312e7

update

Fix typos in strings and comments (huggingface#11407)

f00a995

[train_dreambooth_lora.py] Set LANCZOS as default interpolation mode …

bd96a08

…for resizing (huggingface#11421) * Set LANCZOS as default interpolation mode for resizing * [train_dreambooth_lora.py] Set LANCZOS as default interpolation mode for resizing

enable test_layerwise_casting_memory cases on XPU (huggingface#11406)

a7e9f85

* enable test_layerwise_casting_memory cases on XPU Signed-off-by: Yao Matrix <[email protected]> * fix style Signed-off-by: Yao Matrix <[email protected]> --------- Signed-off-by: Yao Matrix <[email protected]>

[tests] fix import. (huggingface#11434)

0e3f271

fix import.

[train_text_to_image] Better image interpolation in training scripts …

b3b04fe

…follow up (huggingface#11426) * Update train_text_to_image.py * update

[train_text_to_image_lora] Better image interpolation in training scr…

3da98e7

…ipts follow up (huggingface#11427) * Update train_text_to_image_lora.py * update_train_text_to_image_lora

[Hi-Dream LoRA] fix bug in validation (huggingface#11439)

0ac1d5b

remove unnecessary pipeline moving to cpu in validation Co-authored-by: Sayak Paul <[email protected]>

Set LANCZOS as the default interpolation for image resizing in Contro…

58431f1

…lNet training (huggingface#11449) Set LANCZOS as the default interpolation for image resizing

Raise warning instead of error for block offloading with streams (hug…

8fe5a14

…gingface#11425) raise warning instead of error

enable marigold_intrinsics cases on XPU (huggingface#11445)

60892c5

Signed-off-by: Yao Matrix <[email protected]>

torch.compile fullgraph compatibility for Hunyuan Video (huggingfac…

c865115

…e#11457) udpate

enable consistency test cases on XPU, all passed (huggingface#11446)

fbe2fe5

Signed-off-by: Yao Matrix <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SEGA for FLUX#3

Add SEGA for FLUX#3
Marlon154 wants to merge 709 commits intoml-research:sega-ditsfrom
Marlon154:main

Marlon154 commented Nov 4, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

19 participants

Conversation

Marlon154 commented Nov 4, 2024

What does this PR do?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

19 participants