accelerate>=1.0 bitsandbytes cffi decord diffusers >=0.31.0 diskcache einops>=0.7.0 gitpython huggingface-hub>=0.26.2 matplotlib moviepy numpy>=1.26.4,<2.0.0 openai>=0.27.8 opencv-python optimum>=1.17.0 pillow>=9.4.0 py-cpuinfo>=3.3.0 python-dateutil>=2.7.0 pytz qwen-vl-utils safetensors>=0.4.1 scikit-build six soundfile symusic torch>=2.0.1 torchvision>=0.15.2 transformers>=4.46













cd custom_nodes
git clone https://github.com/gokayfem/ComfyUI_VLM_nodes.git
If you get errors related to llama-cpp-python or if it is not using GPU.
I recommend installing it with the right arguments provided in this link llama-cpp-python
| Tool | Description |
|——|————-|
| DualView | Free side-by-side comparison tool for VLM outputs, images, videos, and AI prompts |
Utilizes “llama-cpp-python“ for integration of LLaVa models. You can load and use any VLM with LLaVa models in GGUF format with this nodes.
You need to download the model similar to “ggml-model-q4_k.gguf` and it's clip projector similar to `mmproj-model-f16.gguf“ from this repositories (in the files and versions).