# Core dependencies # torch==2.5.0+cu124 # transformers==4.47.1 # numpy==1.26.3 # Pillow==10.1.0 # torch # transformers # numpy # Pillow # ComfyUI specific # # Custom node utilities # yors_comfyui_node_setup==0.10.1 # yors_pano_ansi_color==1.2.1 yors_comfyui_node_setup yors_pano_ansi_color # Optional dependencies (for advanced features) # bitsandbytes==0.45.0 # bitsandbytes
🤖 comfyui custom nodes to caption image with joy
# cd to comfyui/custom_nodes
git clone https://github.com/ymc-github/ymc_node_joy
Essential components:
google/siglip-so400m-patch14-384 (Vision model)unsloth/Meta-Llama-3.1-8B-bnb-4bit or meta-llama/Meta-Llama-3.1-8B (LLM)Joy_caption/image_adapter.pt (Custom adapter)<comfyui_root>/
├── models/
│ ├── clip/ # SigLIP Vision Model
│ │ └── siglip-so400m-patch14-384/
│ ├── llm/ # Llama Language Model
│ │ ├── Meta-Llama-3.1-8B-bnb-4bit/
│ │ └── Meta-Llama-3.1-8B/
│ └── Joy_caption/ # Custom Components
│ └── image_adapter.pt # Dimension Adapter
International: https://huggingface.co/google/siglip-so400m-patch14-384
China Mirror: https://hf-mirror.com/google/siglip-so400m-patch14-384
International: https://huggingface.co/unsloth/Meta-Llama-3.1-8B-bnb-4bit
China Mirror: https://hf-mirror.com/unsloth/Meta-Llama-3.1-8B-bnb-4bit
International: https://huggingface.co/meta-llama/Meta-Llama-3.1-8B (Access approval required)
China Mirror: https://hf-mirror.com/meta-llama/Meta-Llama-3.1-8B
International: https://huggingface.co/spaces/fancyfeast/joy-caption-pre-alpha/tree/main/wpkklhc6
China Mirror: https://www.modelscope.cn/models/fireicewolf/joy-caption-pre-alpha/files
joy, caption)ymc/captionutils/ymc/caption (as alias)~~comfy node registry-install ymc_node_joyname|email|desciption
:–|:–|:–
yemiancheng|
chenxinghua|<455758525@qq.com>|Code reference from StartHua/Comfyui_CXH_joy_caption|
MIT