ComfyUI_QwenVL

ComfyUI_QwenVL
★ 139

多模态Qwen-VL图文查询ComfyUI插件
ComfyUI 的 Qwen 多模态模型包装器,支持文本与单图查询,兼容 Qwen2.5 与 Qwen3-VL 系列,文本仅工作流使用 Qwen2.5 指令检查点。
💡 在 ComfyUI 中用 Qwen3/2.5 进行文本或单图问答与解析。
🍴 22 Forks💻 Python🔄 2025-11-29
📦 网盘链接待填入
📦 requirements.txt
torch
torchvision
numpy
pillow
huggingface_hub
accelerate
qwen-vl-utils
bitsandbytes
optimum
transformers>=4.57.1
Qwen VL workflow
Qwen text workflow
📄 README

ComfyUI Qwen VL Nodes

This repository provides ComfyUI nodes that wrap the latest vision-language and language-only checkpoints from the Qwen family. Both Qwen3 VL and Qwen2.5 VL models are supported for multimodal reasoning, alongside text-only Qwen2.5 models for prompt generation.

What’s New

  • Added support for the Qwen3 VL family (Qwen3-VL-4B-Thinking, Qwen3-VL-8B-Thinking, etc.).
  • Retained compatibility with existing Qwen2.5 VL models.
  • Text-only workflows continue to use the Qwen2.5 instruct checkpoints.
  • Sample Workflows

  • Multimodal workflow example: workflow/Qwen2VL.json
  • Text generation workflow example: workflow/qwen25.json
  • Installation

    You can install through ComfyUI Manager (search for Qwen-VL wrapper for ComfyUI) or manually:

  • Clone the repository:
  • “`bash

    git clone https://github.com/alexcong/ComfyUI_QwenVL.git

    “`

  • Change into the project directory:
  • “`bash

    cd ComfyUI_QwenVL

    “`

  • Install dependencies (ensure you are inside your ComfyUI virtual environment if you use one):
  • “`bash

    pip install -r requirements.txt

    “`

    Supported Nodes

  • Qwen2VL node – Multimodal generation with Qwen3 VL and Qwen2.5 VL checkpoints. Accepts images or videos as optional inputs alongside text prompts.
  • Qwen2 node – Text-only generation backed by Qwen2.5 instruct models, with optional quantization for lower memory usage.
  • Both nodes expose parameters for temperature, maximum token count, quantization (none/4-bit/8-bit), and manual seeding. Set keep_model_loaded to True to cache models between runs.

    Model Storage

    Downloaded models are stored under ComfyUI/models/LLM/.