timm==0.4.12 transformers==4.15.0 fairscale==0.4.4 pycocoevalcap
Inside ComfyUI_windows_portable\python_embeded, run:
python.exe -m pip install fairscale
And, inside ComfyUI_windows_portable\ComfyUI\custom_nodes\, run:
git clone https://github.com/paulo-coronado/comfy_clip_blip_node
Add a cell with the following code:
!pip install fairscale
!cd custom_nodes && git clone https://github.com/paulo-coronado/comfy_clip_blip_node
The implementation of CLIPTextEncodeBLIP relies on resources from BLIP, ALBEF, Huggingface Transformers, and timm. We thank the original authors for their open-sourcing.