torch torchvision numpy pillow huggingface_hub accelerate qwen-vl-utils bitsandbytes optimum transformers>=4.57.1


This repository provides ComfyUI nodes that wrap the latest vision-language and language-only checkpoints from the Qwen family. Both Qwen3 VL and Qwen2.5 VL models are supported for multimodal reasoning, alongside text-only Qwen2.5 models for prompt generation.
Qwen3-VL-4B-Thinking, Qwen3-VL-8B-Thinking, etc.).workflow/Qwen2VL.jsonworkflow/qwen25.jsonYou can install through ComfyUI Manager (search for Qwen-VL wrapper for ComfyUI) or manually:
“`bash
git clone https://github.com/alexcong/ComfyUI_QwenVL.git
“`
“`bash
cd ComfyUI_QwenVL
“`
“`bash
pip install -r requirements.txt
“`
Both nodes expose parameters for temperature, maximum token count, quantization (none/4-bit/8-bit), and manual seeding. Set keep_model_loaded to True to cache models between runs.
Downloaded models are stored under ComfyUI/models/LLM/.