ComfyUI-ELLA-wrapper

ComfyUI-ELLA-wrapper
★ 113

ComfyUI集成ELLA模型Diffusers封装自动下载依赖
为ComfyUI提供对Diffusers实现的ELLA模型的封装节点,自动下载模型与依赖,简化在ComfyUI中使用ELLA(已被腾讯官方节点取代)
💡 在ComfyUI中快速集成并使用Diffusers版ELLA模型
🍴 9 Forks💻 Python🔄 2024-05-21
📦
网盘下载
复制链接后前往夸克网盘下载
https://pan.quark.cn/s/b45acaa3411d
📦 requirements.txt
diffusers>=0.26.0
accelerate
omegaconf
sentencepiece
image
image
📄 README

UPDATE:

Tencent has released official Comfy ELLA nodes that should now be used instead of this:

https://github.com/TencentQQGYLab/ComfyUI-ELLA

ComfyUI wrapper nodes to use the Diffusers implementation of ELLA

Installing

Either use the Manager and it’s install from git -feature, or clone this repo to custom_nodes and run:

pip install -r requirements.txt

or if you use portable (run this in ComfyUI_windows_portable -folder):

python_embeded\python.exe -m pip install -r ComfyUI\custom_nodes\ComfyUI-ELLA-wrapper\requirements.txt

You can use any 1.5 model, rest is auto downloaded, sources:

ELLA model (132MB):

https://huggingface.co/QQGYLab/ELLA/blob/main/ella-sd1.5-tsc-t5xl.safetensors

Also requires google-flan-t5-xl, for which I have opted to autodownload this repo, 6GB):

https://huggingface.co/ybelkada/flan-t5-xl-sharded-bf16/tree/main

Original repo:

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Xiwei Hu*,

Rui Wang*,

Yixiao Fang*,

Bin Fu*,

Pei Cheng,

Gang Yu&#10022

  • Equal contributions, &#10022 Corresponding Author
  • Official code of “ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment”.

    🌟 Changelog

  • [2024.4.9] 🔥🔥🔥 Release ELLA-SD1.5 Checkpoint! Welcome to try!
  • [2024.3.11] 🔥 Release DPG-Bench! Welcome to try!
  • [2024.3.7] Initial update
  • Inference

    ELLA-SD1.5

    # get ELLA-SD1.5 at https://huggingface.co/QQGYLab/ELLA/blob/main/ella-sd1.5-tsc-t5xl.safetensors
    
    # comparing ella-sd1.5 and sd1.5
    # will generate images at `./assets/ella-inference-examples`
    python3 inference.py test --save_folder ./assets/ella-inference-examples --ella_path /path/to/ella-sd1.5-tsc-t5xl.safetensors
    
    # build a demo for ella-sd1.5
    GRADIO_SERVER_NAME=0.0.0.0 GRADIO_SERVER_PORT=8082 python3 ./inference.py demo /path/to/ella-sd1.5-tsc-t5xl.safetensors

    📊 DPG-Bench

    The guideline of DPG-Bench:

  • Generate your images according to our prompts.
  • It is recommended to generate 4 images per prompt and grid them to 2×2 format. Please Make sure your generated image’s filename is the same with the prompt’s filename.

  • Run the following command to conduct evaluation.
  • “`bash

    bash dpg_bench/dist_eval.sh $YOUR_IMAGE_PATH $RESOLUTION

    “`

    Thanks to the excellent work of DSG sincerely, we follow their instructions to generate questions and answers of DPG-Bench.

    📝 TODO

  • [ ] add huggingface demo link
  • [x] release checkpoint
  • [x] release inference code
  • [x] release DPG-Bench
  • 💡 Others

    We have also found LaVi-Bridge, another independent but similar work completed almost concurrently, which offers additional insights not covered by ELLA. The difference between ELLA and LaVi-Bridge can be found in issue 13. We are delighted to welcome other researchers and community users to promote the development of this field.

    😉 Citation

    If you find ELLA useful for your research and applications, please cite us using this BibTeX:

    @misc{hu2024ella,
          title={ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment}, 
          author={Xiwei Hu and Rui Wang and Yixiao Fang and Bin Fu and Pei Cheng and Gang Yu},
          year={2024},
          eprint={2403.05135},
          archivePrefix={arXiv},
          primaryClass={cs.CV}
    }