ComfyUI-nunchaku

★ 2,820

量化推理加速显存优化ComfyUI插件

为ComfyUI提供Nunchaku推理节点，支持SVDQuant将权重与激活量化到1-4位，实现显存与延迟大幅下降，便于在笔记本或受限GPU上高速生成。

💡 在ComfyUI中用SVDQuant实现低精度高效率图像生成

🍴 153 Forks💻 Python🔄 2026-02-19

🔗 GitHub 原文

📦

网盘下载

复制链接后前往夸克网盘下载

https://pan.quark.cn/s/cbe6be756051

📦 requirements.txt

diffusers>=0.35
transformers>=4.54
sentencepiece
protobuf
huggingface_hub>=0.34
tomli
peft>=0.17
accelerate>=1.10
insightface
opencv-python
facexlib
onnxruntime
timm

📄 README

Paper | Docs | Website | Blog | Demo | Hugging Face | ModelScope

This repository provides the ComfyUI plugin for Nunchaku, an efficient inference engine for 4-bit neural networks quantized with SVDQuant. For the quantization library, check out DeepCompressor.

Join our user groups on Discord and WeChat for discussions—details here. If you have any questions, run into issues, or are interested in contributing, feel free to share your thoughts with us!

Nunchaku ComfyUI Plugin

News

[2026-01-12] 🚀 v1.2.0 Released! Enjoy a 20–30% Z-Image performance boost, seamless LoRA support with native ComfyUI nodes, and INT4 support for 20-series GPUs!

[2025-12-26] 🚀 v1.1.0: Support 4-bit Tongyi-MAI/Z-Image-Turbo! Download on Hugging Face or ModelScope, and try it with this workflow!

[2025-09-24] 🔥 Released 4-bit 4/8-step Qwen-Image-Edit-2509 lightning models at Hugging Face! Try them out with this workflow!

[2025-09-24] 🔥 Released 4-bit Qwen-Image-Edit-2509! Models are available on Hugging Face. Try them out with this workflow!

[2025-09-09] 🔥 Released 4-bit Qwen-Image-Edit together with the 4/8-step Lightning variants! Models are available on Hugging Face. Try them out with this workflow!

[2025-09-04] 🚀 Official release of Nunchaku v1.0.0! Qwen-Image now supports asynchronous offloading, cutting Transformer VRAM usage to as little as 3 GiB with no performance loss. You can also try our pre-quantized 4/8-step Qwen-Image-Lightning models on Hugging Face or ModelScope.

[2025-08-23] 🚀 v1.0.0 adds support for Qwen-Image! Check this workflow to get started. LoRA support is coming soon.

[2025-07-17] 📘 The official ComfyUI-nunchaku documentation is now live! Explore comprehensive guides and resources to help you get started.

[2025-06-29] 🔥 v0.3.3 now supports FLUX.1-Kontext-dev! Download the quantized model from Hugging Face or ModelScope and use this workflow to get started.

[2025-06-11] Starting from v0.3.2, you can now easily install or update the Nunchaku wheel using this workflow!

[2025-06-07] 🚀 Release Patch v0.3.1! We bring back FB Cache support and fix 4-bit text encoder loading. PuLID nodes are now optional and won’t interfere with other nodes. We’ve also added a NunchakuWheelInstaller node to help you install the correct Nunchaku wheel.

[2025-06-01] 🚀 Release v0.3.0! This update adds support for multiple-batch inference, ControlNet-Union-Pro 2.0 and initial integration of PuLID. You can now load Nunchaku FLUX models as a single file, and our upgraded 4-bit T5 encoder now matches FP8 T5 in quality!

[2025-04-16] 🎥 Released tutorial videos in both English and Chinese to assist installation and usage.

[2025-04-09] 📢 Published the April roadmap and an FAQ to help the community get started and stay up to date with Nunchaku’s development.

[2025-04-05] 🚀 Release v0.2.0! This release introduces multi-LoRA and ControlNet support, with enhanced performance using FP16 attention and First-Block Cache. We’ve also added 20-series GPU compatibility and official workflows for FLUX.1-redux!

Getting Started

Custom Model Quantization: DeepCompressor

Contribution Guide

Frequently Asked Questions

Star History

[](https://www.star-history.com/#nunchaku-ai/ComfyUI-nunchaku&Date)